과제정보
연구 과제 주관 기관 : 한국연구재단
참고문헌
- Teubert Wolfgang, "Comparable or parallel corpora?," International journal of lexicography, vol.9, no.3, p.238, 1996. https://doi.org/10.1093/ijl/9.3.238
- Adafre Sisay Fissaha and Maarten De Rijke. "Finding similar sentences across multiple languages in wikipedia," In Proceedings of EACL'06, p.62, 2006.
- Hewavitharana Sanjika and Stephan Vogel, "Extracting parallel phrases from comparable data," In Proceedings of BUCC'11, p.61, 2011.
- Ture Ferhan and Jimmy Lin, "Why not grab a free lunch?: mining large corpora for parallel sentences to improve translation modeling," In Proceedings of NAACL'12, p.626, 2012.
- Dean Jeffrey and Sanjay Ghemawat, "MapReduce: simplified data processing on large clusters," Communications of the ACM, vol.51, no.1, p.107, 2008.
- David M. Blei, Andrew Y. Ng and Michael I. Jordan, "Latent dirichlet allocation," The Journal of Machine Learning research, 3, p.993, 2003.
- Zede Zhu, Miao Li, Lei Chen and Zhenxin Yang, "Building Comparable Corpora Based on Bilingual LDA Model," In Proceedings of ACL'13, p.278, 2013.
- Ivan Vulic, Wim De Smet, and Marie-Francine Moens, "Cross-language information retrieval with latent topic models trained on a comparable corpus," Information Retrieval Technology, Springer Berlin Heidelberg, p.37, 2011.
- Ivan Vulic and Marie-Francine Moens, "Crosslingual semantic similarity of words as the similarity of their semantic word response," In Proceedings of NAACL'13, p.106, 2013.