References
- Berry, M. W. and Castellanos, M. (2007), Survey of Text Mining : Clustering, Classification, and Retrieval, Springer, New York, NY, USA.
- Chen, S., Xu, Y., and Chang, H. (2011), A simple and effective unsupervised word segmentation approach, In proceedings of the 25th AAAI Conference on Artificial Intelligence, San Francisco, CA, USA.
- Cho, S. G. and Kim, S. B. (2012), Finding meaningful pattern of key words in IIE Transactions using text mining, Journal of the Korean Institute of Industrial Engineers, 38(1), 67-73. https://doi.org/10.7232/JKIIE.2012.38.1.067
- Fellbaum, C. (2005), WordNet and wordnets, In: Brown, Keith et al. (eds.), Encyclopedia of Language and Linguistics, Second Edition, Oxford: Elsevier, 665-670.
- Feng, H., Chen, K., Deng, X., and Zheng, W. (2004), Accessor variety criteria for Chinese word extraction. Computational Linguistics, 30(1), 75-93. https://doi.org/10.1162/089120104773633394
- Harris, Z. S. (1955), From phoneme to morpheme, Language, 31(2), 190-222. https://doi.org/10.2307/411036
- Hotho, A., Nurnberger, A., and Paass, Gerhard (2005), A brief survey of text mining, Ldv Forum, 20(1), 19-62.
- Jin, Z. and Tanaka-Ishii, K. (2006), Unsupervised segmentation of Chinese text by use of branching entropy, In Proceedings of the COLING/ACL on Main conference poster sessions, Association for Computational Linguistics.
- Jurafsky, D. and Martin, J. H. (2009), Speech and Language Processing : An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition, Prentice Hall.
- Kleinberg, J. M. (1999), Authoritative sources in a hyperlinked environment, Journal of ACM, 46(5), 604-632. https://doi.org/10.1145/324133.324140
- Lawrence, P., Brin, S., Rajeev, M., and Terry, W. (1999), The PageRank citation ranking: Bringing order to the web. Technical Report, Stanford InfoLab.
- Lee, D., Yeon, J., Hwang, I., and Lee, S.-G. (2010), KKMA : A tool for utilizing Sejong Corpus based on Relational Database, Journal of KIISE : Computing Practices and Letters, 16(11), 1046-1050.
- Lu, X., Zhang, L., and Hu, J. (2004), Statistical substring reduction in linear time, In proceedings of the 1st International Joint Conference on Natural Language Processing (IJCNLP), Hainan Island, China.
- Maosong, S. Dayang, S., and Tsou, B. K. (1998), Chinese word segmentation without using lexicon and hand-crafted training data, In proceedings of the 17th International Conference on Computational Linguistics (COLING), Stroudsburg, PA, USA.
- McKinsey Global Institute (2011), Big Data : The Next Frontier for Innovation, Competition, and Productivity.
- Mihalcea, R. and Tarau, P. (2004), TextRank : Bringing order into texts, In proceedings of 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP), Barcelona, Spain.
- Mochihashi, D. Yamada T. and Ueda N. (2009), Bayesian unsupervised word segmentation with nested Pitman-Yor language modeling, Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP.
- Petrovic, S., Snajder J., and Dalbelo B. (2010), Extending lexical association measures for collocation extraction, 24(2), 383-394. https://doi.org/10.1016/j.csl.2009.06.001
- Porter, M. F. (1980), An algorithm for suffix stripping, Program, 14(3), 130-137. https://doi.org/10.1108/eb046814
- Willett, P. (2006), The Porter stemming algorithm : then and now, Program : Electronic Library and Information Systems, 40(3), 219-223. https://doi.org/10.1108/00330330610681295
- Zhao, H. and Kit, C. (2007), Incorporating global information into supervised learning for Chinese word segmentation, In proceedings of the 10th Conference of the Pacifi c Association for Computational Linguistics (PCALING), Melbourne, Australia.