透過您的圖書館登入
IP:3.17.186.218
  • 期刊
  • OpenAccess

Improving Translation Selection with a New Translation Model Trained by Independent Monolingual Corpora

並列摘要


We propose a novel statistical translation model to improve translation selection of collocation. In the statistical approach that has been popularly applied for translation selection, bilingual corpora are used to train the translation model. However, there exists a formidable bottleneck in acquiring large-scale bilingual corpora, in particular for language pairs involving Chinese. In this paper, we propose a new approach to training the translation model by using unrelated monolingual corpora. First, a Chinese corpus and an English corpus are parsed with dependency parsers, respectively, and two dependency triple databases are generated. Then, the similarity between a Chinese word and an English word can be estimated using the two monolingual dependency triple databases with the help of a simple Chinese-English dictionary. This cross-language word similarity is used to simulate the word translation probability. Finally, the generated translation model is used together with the language model trained with the English dependency database to realize translation of Chinese collocations into English. To demonstrate the effectiveness of this method, we performed various experiments with verb-object collocation translation. The experiments produced very promising results.

參考文獻


Brown, P. F.,Pietra, Stephen A. Della,Pietra, Vincent J. Della,Mercer, Robert L.(1993).The mathematics of machine translation: parameter Estimation.Computational Linguistics.19(2),263-311.
Kenneth, K. K. W., K. W.(1994).The 4th Conference on Applied Natural Language Processing.
Ido Alon, Alon(1994).Word Sense Disambiguation Using a Second Language Monolingual Corpus.Computational Linguistics.20(4),563-596.
Fung, P.(1995).Proceedings of the 33rd Annual Meeting of the Association for Computational Linguistics.
Fung, P.(1997).Using word signature features for terminology translation from large corpora.

被引用紀錄


蔣明撰 (2012). 混合單語與平行語料之搭配詞翻譯 [master's thesis, National Tsing Hua University]. Airiti Library. https://doi.org/10.6843/NTHU.2012.00510
蕭孟勤(2007)。利用依存關係之詞彙翻譯〔碩士論文,國立清華大學〕。華藝線上圖書館。https://www.airitilibrary.com/Article/Detail?DocID=U0016-0210200715255309

延伸閱讀