DOI QR코드

DOI QR Code

Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation

  • Zhu, Shunle (Donghai Science and Technology College, Zhejiang Ocean University)
  • Received : 2018.08.31
  • Accepted : 2019.05.29
  • Published : 2021.02.28

Abstract

To eliminate ambiguities in the existing methods to simplify Chinese pronunciation learning, we propose a model that can predict the pronunciation of Chinese characters automatically. The proposed model relies on a statistical machine translation (SMT) framework. In particular, we consider the components of Chinese characters as the basic unit and consider the pronunciation prediction as a machine translation procedure (the component sequence as a source sentence, the pronunciation, pinyin, as a target sentence). In addition to traditional features such as the bidirectional word translation and the n-gram language model, we also implement a component similarity feature to overcome some typos during practical use. We incorporate these features into a log-linear model. The experimental results show that our approach significantly outperforms other baseline models.

Keywords

References

  1. S. K. Hsieh, "Hanzi, concept and computation: a preliminary survey of Chinese Characters as a Knowledge Resource in NLP," PhD dissertation, Universitat Tubingen, Tubingen, Germany, 2006.
  2. R. J. Byrd and E. Tzoukermann, "Adapting an English morphological analyzer for French," in Proceedings of the 26th Annual Meeting of the Association for Computational Linguistics, Buffalo, NY, 1988, pp. 1-6.
  3. F. L. Huang, S. Y. Ke, and Q. W. Fan, "Predicting effectively the pronunciation of Chinese polyphones by extracting the lexical information," in Advances in Computer and Information Sciences and Engineering. Dordrecht, Germany: Springer, 2008, pp. 159-165.
  4. C. Mi, Y. Yang, X. Zhou, L. Wang, X. Li, and T. Jiang, "Exploiting Bishun to predict the pronunciation of Chinese," Computación y Sistemas, vol. 20, no. 3, pp. 541-549, 2016.
  5. P. Koehn, F. J. Och, and D. Marcu, "Statistical phrase-based translation," in Proceedings of Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), Edmonton, Canada, 2003.
  6. R. Zens, F. J. Och, and H. Ney, "Phrase-based statistical machine translation," in KI 2002: Advances in Artificial Intelligence. Heidelberg, Germany: Springer, 2002, pp. 18-32
  7. F. J. Och and H. Ney, "The alignment template approach to statistical machine translation," Computational Linguistics, vol. 30, no. 4, pp. 417-449, 2004. https://doi.org/10.1162/0891201042544884
  8. X. Shi, K. Knight, and H. Ji, "How to speak a language without knowing it," in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Baltimore, MD, 2014, pp. 278-282.
  9. C. C. Lin and R. T. H. & Tsai, "A generative data augmentation model for enhancing Chinese dialect pronunciation prediction," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 4, pp. 1109-1117, 2012. https://doi.org/10.1109/TASL.2011.2172424
  10. J. Hatori and H. Suzuki, "Predicting word pronunciation in Japanese," in Computational Linguistics and Intelligent Text Processing. Heidelberg, Germany: Springer, 2011 pp. 477-492.
  11. J. Hatori and H. Suzuki, "Japanese pronunciation prediction as phrasal statistical machine translation," in Proceedings of 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, 2011, pp. 120-128.
  12. R. Christensen, Log-Linear Models and Logistic Regression. New York, NY: Springer, 2006.
  13. F. J. Och and H. Ney, "Discriminative training and maximum entropy models for statistical machine translation," in Proceedings of the 40th Annual meeting of the Association for Computational Linguistics, Philadelphia, PA, 2002, pp. 295-302.
  14. P. F. Brown, V. J. Della Pietra, P. V. Desouza, J. C. Lai, and R. L. Mercer, "Class-based n-gram models of natural language," Computational Linguistics, vol. 18, no. 4, pp. 467-480, 1992.
  15. W. J. Heeringa, "Measuring dialect pronunciation differences using Levenshtein distance," Ph.D. dissertation, University Library Groningen, The Netherlands, 2004.
  16. F. J. Och, "Minimum error rate training in statistical machine translation," in Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, Japan, 2003, pp. 160-167.
  17. P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, et al., "Moses: open source toolkit for statistical machine translation," in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions, Prague, Czech Republic, 2007, pp. 177-180.
  18. A. Stolcke, "SRILM-an extensible language modeling toolkit," in Proceedings of the 7th International Conference on Spoken Language Processing, Denver, CO, 2002, pp. 901-904.