Abstract
Multi-category words are widely distributed in Chinese patent documents, and identification of them has been one of the difficulties in machine translation (MT). This paper proposes a rule-based method for identifying verb and preposition multi-category words in Chinese-English patent machine translation. Based on principles of boundary perception and according to syntactic and semantic information of multi-category words as well as context information, some reliable disambiguation rules are designed to help the MT system analyze proper categories of words, then proposes adverbial and predicate identification rules to determine and identify the words further. Related experiments and BLEU evaluations show that the method is efficient to recognize verbs and prepositions better, and is also helpful to improve final translation quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Jiang, H., Jiang, T., Zhang, K., et al.: Some Key Issues in Chinese-to-English Patent Machine Translation. In: Proceedings of the 9th China Workshop on Machine Translation, CWMT 2013 (2013)
Wang, D.: Chinese to English automatic patent machine translation at SIPO. J. World Patent Information. 31(2), 137–139 (2009)
Goto, I., Lu, B., Chow, K.P., Sumita, E., Tsou, B.K.: Overview of the patent machine translation task at the ntcir-9 workshop. In: Proceedings of 9th NTCIR Workshop Meeting, pp. 559–578 (2011)
Eneko, A., Edmonds, P.: Word sense disambiguation, algorithms and applications. Springer, Heidelberg (2006)
McCarthy, D.: Word Sense Disambiguation: An Overview. J. Language and Linguistics Compass (3/2), 537–558 (2009)
Xia, J.: Automatic Recognition Research on Syntactic Category of Common Words (2012) (in Chinese)
Zhang, L.: Multi-category words processing mechanism combined rule with statistical methods (2002) (in Chinese)
Gan, J., Huang, D.: Automatic Identification of Chinese Prepositional Phrase. J. Journal of Chinese Information Processing 19(4), 17–23 (2005) (in Chinese)
Wen, M., Wu, Y.: Feature-rich Prepositional Phrase Boundary Identification based on SVM. J. Journal of Chinese Information Processing. 23(5), 19–24 (2009) (in Chinese)
Zan, H., Zhang, T., Zhang, K.: Automatic recognition research on preposition’s usage based on combinations of rules and statistics. J. Computer Engineering and Design 34(6), 2153–2157 (2013) (in Chinese)
List, J.: Review of machine translation in patents–Implications for search. J. World Patent Information 34, 193–195 (2012)
Isozaki, H., Hirao, T., Duh, K., et al.: Automatic Evaluation of Translation Quality for Distant Language Pairs. In: Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing, pp. 944–952. MIT, Massachusetts (2010)
Wei, X., Zhang, Q.: Study on the computer-proceeding rules of V+V in Chinese. J. Application Research of Computers. 1, 43–46 (2006) (in Chinese)
Zhu, Y., Jin, Y.: A Chinese-English patent machine translation system based on the theory of hierarchical network of concepts. J. The Journal of China Universities of Posts and Telecommunications 19(Suppl. 2), 140–146 (2012)
Papineni, K., Roukos, S., Ward, T., et al.: BLEU: a Method for Automatic Evaluation of Machine Translation. In: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pp. 311–318 (2002)
Liu, Q., Qian, Y.: Summary on Chinese Information Processing Technology Evaluation. J. Communications of China Computer Federation 02, 11–18 (2008) (in Chinese)
Madnani, N.: iBLEU: Interactively Debugging & Scoring Statistical Machine Translation Systems. In: Proceedings of the Fifth IEEE International Conference on Semantic Computing, pp. 213–214. IEEE Press, USA (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this paper
Cite this paper
Li, H., Zhu, Y., Jin, Y. (2015). Identifying Verb-Preposition Multi-Category Words in Chinese-English Patent Machine Translation. In: Chalup, S.K., Blair, A.D., Randall, M. (eds) Artificial Life and Computational Intelligence. ACALCI 2015. Lecture Notes in Computer Science(), vol 8955. Springer, Cham. https://doi.org/10.1007/978-3-319-14803-8_32
Download citation
DOI: https://doi.org/10.1007/978-3-319-14803-8_32
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-14802-1
Online ISBN: 978-3-319-14803-8
eBook Packages: Computer ScienceComputer Science (R0)