期刊文献+

汉语句法分析中标点符号的运用

The Use of Punctuations in Chinese Syntactic Parsing Techniques
在线阅读 下载PDF
导出
摘要 目前,大部分句法分析都忽略标点符号这一重要的句法特征或者只进行非常简单的处理。本文根据标点符号的句法结构特性,提出规则分层的方法,将标点融入汉语句法分析中。利用标点符号的分割作用,将长句分成一个个小的句子的序列,并对每个小的句子单元进行句法和结构分析,再根据已经抽取出来的类型规则进行二次句法分析,从而得到一个完整的句法分析树。实验表明,这种方法不但解决了部分长句无法正确得到句法树的难题,而且分析的歧义减小了,效率得到了提高。 So far, most Chinese syntactic parsing techniques neglect the punctuations or oversimplify their functions. However, it is actually very important information of syntactic characters. According to the features of punctuations in the syntactic structure, this paper proposes a new rule-layered approach. This method makes the punctuations into Chinese syntactic analysis and uses the punctuation role to split long sentences into small sequence sentences. Then each small unit is parsed syntactically and structurally. Finally, we extract the type rule to analyse and complete the parsing tree. Experiments show that this approach not only solves the problem that part of long sentences can not correctly obtain syntactic trees, but also reduces the ambiguities of parsing,and increases efficiency.
出处 《计算机工程与科学》 CSCD 北大核心 2009年第1期145-147,共3页 Computer Engineering & Science
基金 陕西省教育厅专项课题(06JK246)
关键词 句法分析 标点符号 类型规则 syntactic parsing punctuation type rule
作者简介 张小艳(1967-),女,陕西西安人,副教授,研究方向为网络集成与数据库技术、知识工程与智能系统、计算机教育技术等 通讯地址:710054陕西省西安市雁塔中路58号西安科技大学计算机学院;Tel:(029)85583722,13572897811;E-mail:zhangxy@xust.edu.cn 邵刚,硕士生,研究方向为网络集成与数据库技术、自然语言处理。
  • 相关文献

参考文献9

二级参考文献46

  • 1周强.汉语句法树库标注体系[J].中文信息学报,2004,18(4):1-8. 被引量:91
  • 2戴浩一.概念结构与非自主性语法:汉语语法概念系统初探[J].当代语言学,2002,4(1):1-12. 被引量:109
  • 3陈肇雄,高庆狮.智能化英汉机译系统IMT/EC[J].中国科学(A辑),1989,20(2):186-194. 被引量:16
  • 4贺琛 黄河燕 等.英语文本简化处理方法综述,ICCC2001[M].上海,-..
  • 5黄河燕.IHSMTS中多策略译文生成算法,ICCC2001[M].上海,-..
  • 6Brants, S., & Hansen, S. (2002). Developments in the TIGER annotation scheme and their realization in the corpus[A]. In: Proceedings of the Third Conference on Language Resources and Evaluation (LREC-02)[C]. Las Palmas de Gran Canaria, Spain. 1643-164
  • 7Collins, M. (1999) Head-Driven Statistical Models for Natural Language Parsing[D]. Ph.D. Thesis. Dept. of Computer Science and Information, The University of Pennsylvania.
  • 8Hajic, J. (1999). Building a syntactically annotated corpus: The Prague Dependency Treebank[A]. In: E. Hajicova (Ed.), Issues of valency and meaning. Studies in honour of Jarmila Panevova. Prague, Czech Republic: Charles University Press.
  • 9Chu-Ren Huang, Feng-Yi Chen, Keh-Jiann Chen, & al.(2000). Sinica Treebank: Design Criteria, Annotation Guidelines, and On-line Interface[A], Proceedings of the Second Chinese Language Processing Workshop[C], HongKong. 29-37.
  • 10Kingsbury, P.; Martha Palmer, and Marcus, M. (2002). Adding Semantic Annotation to the Penn TreeBank[A]. In: Proceedings of the Human Language Technology Conference[C], San Diego, California.

共引文献119

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部