Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation

Zhu, Shunle;

doi:10.3745/JIPS.04.0208

Journal of Information Processing Systems

Volume 17 Issue 1
/
Pages.203-212
/
2021
/
1976-913X(pISSN)
/
2092-805X(eISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation

Zhu, Shunle (Donghai Science and Technology College, Zhejiang Ocean University)

Received : 2018.08.31
Accepted : 2019.05.29
Published : 2021.02.28

https://doi.org/10.3745/JIPS.04.0208 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

To eliminate ambiguities in the existing methods to simplify Chinese pronunciation learning, we propose a model that can predict the pronunciation of Chinese characters automatically. The proposed model relies on a statistical machine translation (SMT) framework. In particular, we consider the components of Chinese characters as the basic unit and consider the pronunciation prediction as a machine translation procedure (the component sequence as a source sentence, the pronunciation, pinyin, as a target sentence). In addition to traditional features such as the bidirectional word translation and the n-gram language model, we also implement a component similarity feature to overcome some typos during practical use. We incorporate these features into a log-linear model. The experimental results show that our approach significantly outperforms other baseline models.

Keywords

References

S. K. Hsieh, "Hanzi, concept and computation: a preliminary survey of Chinese Characters as a Knowledge Resource in NLP," PhD dissertation, Universitat Tubingen, Tubingen, Germany, 2006.
R. J. Byrd and E. Tzoukermann, "Adapting an English morphological analyzer for French," in Proceedings of the 26th Annual Meeting of the Association for Computational Linguistics, Buffalo, NY, 1988, pp. 1-6.
F. L. Huang, S. Y. Ke, and Q. W. Fan, "Predicting effectively the pronunciation of Chinese polyphones by extracting the lexical information," in Advances in Computer and Information Sciences and Engineering. Dordrecht, Germany: Springer, 2008, pp. 159-165.
C. Mi, Y. Yang, X. Zhou, L. Wang, X. Li, and T. Jiang, "Exploiting Bishun to predict the pronunciation of Chinese," Computación y Sistemas, vol. 20, no. 3, pp. 541-549, 2016.
P. Koehn, F. J. Och, and D. Marcu, "Statistical phrase-based translation," in Proceedings of Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL), Edmonton, Canada, 2003.
R. Zens, F. J. Och, and H. Ney, "Phrase-based statistical machine translation," in KI 2002: Advances in Artificial Intelligence. Heidelberg, Germany: Springer, 2002, pp. 18-32
F. J. Och and H. Ney, "The alignment template approach to statistical machine translation," Computational Linguistics, vol. 30, no. 4, pp. 417-449, 2004. https://doi.org/10.1162/0891201042544884
X. Shi, K. Knight, and H. Ji, "How to speak a language without knowing it," in Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), Baltimore, MD, 2014, pp. 278-282.
C. C. Lin and R. T. H. & Tsai, "A generative data augmentation model for enhancing Chinese dialect pronunciation prediction," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 4, pp. 1109-1117, 2012. https://doi.org/10.1109/TASL.2011.2172424
J. Hatori and H. Suzuki, "Predicting word pronunciation in Japanese," in Computational Linguistics and Intelligent Text Processing. Heidelberg, Germany: Springer, 2011 pp. 477-492.
J. Hatori and H. Suzuki, "Japanese pronunciation prediction as phrasal statistical machine translation," in Proceedings of 5th International Joint Conference on Natural Language Processing, Chiang Mai, Thailand, 2011, pp. 120-128.
R. Christensen, Log-Linear Models and Logistic Regression. New York, NY: Springer, 2006.
F. J. Och and H. Ney, "Discriminative training and maximum entropy models for statistical machine translation," in Proceedings of the 40th Annual meeting of the Association for Computational Linguistics, Philadelphia, PA, 2002, pp. 295-302.
P. F. Brown, V. J. Della Pietra, P. V. Desouza, J. C. Lai, and R. L. Mercer, "Class-based n-gram models of natural language," Computational Linguistics, vol. 18, no. 4, pp. 467-480, 1992.
W. J. Heeringa, "Measuring dialect pronunciation differences using Levenshtein distance," Ph.D. dissertation, University Library Groningen, The Netherlands, 2004.
F. J. Och, "Minimum error rate training in statistical machine translation," in Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo, Japan, 2003, pp. 160-167.
P. Koehn, H. Hoang, A. Birch, C. Callison-Burch, M. Federico, N. Bertoldi, et al., "Moses: open source toolkit for statistical machine translation," in Proceedings of the 45th Annual Meeting of the Association for Computational Linguistics Companion Volume Proceedings of the Demo and Poster Sessions, Prague, Czech Republic, 2007, pp. 177-180.
A. Stolcke, "SRILM-an extensible language modeling toolkit," in Proceedings of the 7th International Conference on Spoken Language Processing, Denver, CO, 2002, pp. 901-904.

Journal of Information Processing Systems

Optimized Chinese Pronunciation Prediction by Component-Based Statistical Machine Translation

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)