期刊文献+

一种基于短语的汉蒙统计机器翻译与调序模型 被引量:3

A phrase based statistical Chinese-Mongolian machine translation and reordering model
在线阅读 下载PDF
导出
摘要 根据蒙古语的一些特点,为基于短语的汉蒙统计机器翻译提出了一种适合于汉蒙统计机器翻译的调序模型,并给出了相应的训练和解码算法以及初步实验的结果。汉蒙双语语料库规模很小,数据稀疏问题严重,而在汉蒙翻译中,词序变化又非常明显,在汉英等机器翻译中使用的调序方法难于应用到汉蒙统计机器翻译中。通过对汉蒙翻译过程中词语顺序变化的正态分布假设,建立了一种概率调序模型。实验表明,这种概率调序模型好于 Moses系统中采用的调序方法。 Based on the phrase-based statistical Chinese-Mongolian machine translation, an ordering model is put forward according to the Mongolian features, together with the corresponding drills, the decoding algorithm as well as the results of the primary experiments. Currently, the Chinese-Mongolian bilingual corpus is on a relatively small scale and its data are seriously sparse. In addition, the word order changes are dramatic and prevalent in Chinese-Mongolian translations. Consequently, the reordering method used in Chinese-English translation can not be optimally applied to the Chinese-Mongolian translation. By the assumption of the normal distribution of word-order changes after the analyses of these changes in Chinese-Mongolian translations, a probabilistic reordering model is established in the paper. According to the experimental results, the probabilistic model is superior to the ordering method in the Moses.
出处 《高技术通讯》 EI CAS CSCD 北大核心 2009年第5期475-479,共5页 Chinese High Technology Letters
基金 973计划(2007CB316503) 内蒙古自然基金(200607010805)资助项目
关键词 机器翻译 统计方法 蒙古语 调序 概率 machine translation, statistical method, Mongolian, reorder, probability
作者简介 男,1972年生,博士,副教授;研究方向:自然语言处理;联系人,E-mail:eshhx@imu.edu.cn
  • 相关文献

参考文献15

  • 1Och F J, Tillman C, Ney H. Improved alignment models for statistical machine translation. In: Proceedings of the Conference on Empirical Methods of Natural Language Processing, College Park, Maryland, USA, 1999.20-28.
  • 2Koehn P, Och F J, Marcu D. Statistical phrase-based translation. In: Proceedings of the Homan Language Technology/ North American Chapter of the Association for Computing Linguistics 2003, Edmonton, Canada, 2003. 127-133.
  • 3侯宏旭,刘群,那顺乌日图.基于实例的汉蒙机器翻译[J].中文信息学报,2007,21(4):65-72. 被引量:16
  • 4Brown P F, Cocke J, Della Pietra S A, et al. A statistical approach to machine translation. ComputationalLinguistics, 1990, 16(2) :79-85.
  • 5Brown P F, Della Pierre S A, Della Pietm V J, et al. The mathematics of statistical machine translation parameter estimation. ComputationalLinguistics, 1993, 19(2) :263-311.
  • 6Liu Y, Liu Q, Lin S. Tree-to-string alignment template for statistical machine translation. In: Proceedings of the International Conference on Computing Linguistics/Annual Meeting of the Association for Computing Linguistics, Sydney, Australia, 2006. 609-616.
  • 7Xiong D, Liu Q, Lin S. Maximum entropy based phrase reordering model for statistical machine translation. In: Proceedings of the International Conference on Computing Iinguistics/Annual Meeting of the Association for Computing Linguistics, Sydney, Australia, 2006. 521-528.
  • 8Tillmann C, Zhang T. A localized prediction model for statistical machine translation. In: Proceedings of the Annual Meeting of the Association for Computing Linguistics, Ann Arbor, Michigan, USA, 2005. 557-564.
  • 9Watanabe T, Sumita E. Bidirectional decoding for statistical machine translation. In: Proceedings of the 19th International Conference on Computational Linguistic (COLING), Taipei, China, 2002. 1078-1085.
  • 10侯宏旭,刘群,刘志文,张国强.Skip-N蒙古文统计语言模型[J].内蒙古大学学报(自然科学版),2008,39(2):220-224. 被引量:8

二级参考文献54

共引文献30

同被引文献47

引证文献3

二级引证文献14

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部