Acknowledgement
Supported by : 한국연구재단
References
- Teubert Wolfgang, "Comparable or parallel corpora?," International journal of lexicography, Vol. 9, No. 3, pp. 238-264, 1996. https://doi.org/10.1093/ijl/9.3.238
- Sunghyun Kim. Seon Yang and Youngjoong Ko, "Extracting Korean-English Parallel Sentences from Wikipedia," Journal of korean institute of information scientists and engineers (KIISE): software and applications, pp. 580-585, 2014.
- Dragos Stefan munteanu and Daniel Marcu, "Improving machine translation performance by exploiting non-parallel corpora," Computational linguistics, Vol. 31, No. 4, pp. 477-504, 1995.
- Tao Tao and ChengXiang Zhai, "Mining comparable bilingual text corpora for cross-language information integration," Proc. of the 19th ACM SIGKDD international conference on knowledge discovery in data mining (KDD-2005), pp. 691-696, 2005.
- Ramirez Jessica C and Yuji Matsumoto, "A Rule-Based Approach For Aligning Japanese-Spanish Sentences From A Comparable Corpora," arXiv preprint arXiv:1211.4488, 2012.
- Utiyama Masao and Hitoshi Isahara, "Reliable measures for aligning Japanese-English news articles and sentences," Proc. of ACL '03, pp. 72-79, 2003.
- Adafre Sisay Fissaha and Maarten De Rijke. "Finding similar sentences across multiple languages in wikipedia," Proc. of ACL '06, pp. 62-69, 2006.
- David M. Blei, Andrew Y. Ng and Michael I.Jordan, "Latent dirichlet allocation," The journal of machine learning research, 3, pp. 993-1022, 2003.
- Zede Zhu, Miao Li, Lei Chen and Zhenxin Yang, "Building Comparable Corpora Based on Bilingual LDA Model," Proc. of ACL '13, pp. 278-282, 2013.
- Ture Ferhan and jimmay Lin, "Why not grab a free lunch?: mining large corpora for parallel sentences to improve translation modeling," Proc. of the 2012 conference of the north american chapter of the association for computational linguistics: human language technologies, association for computational linguistics, pp. 626-630, 2012.
- Mallet toolkit, [Online]. Available: http://mallet.cs.umass.edu/download.php
- GIZA++ statistical translation models toolkit, [Online]. Available: http://code.google.com/p/giza-pp/