DOI QR코드

DOI QR Code

한국농촌계획 온톨로지 구축을 위한 상호정보 기반 단어연결망 분석

Word Network Analysis based on Mutual Information for Ontology of Korean Rural Planning

  • 이제명 (교토대학교 지역환경과학전공)
  • Lee, Jemyung (Division of Environmental Science and Technology, Kyoto University)
  • 투고 : 2017.07.12
  • 심사 : 2017.08.04
  • 발행 : 2017.08.30

초록

There has been a growing concern on ontology especially in recent knowledge-based industry and defining a field-customized semantic word network is essential for building it. In this paper, a word network for ontology is established with 785 publications of Korean Society of Rural Planning(KSRP), from 1995 to 2017. Semantic relationships between words in the publications were quantitatively measured with the 'normalized pointwise mutual information' based on the information theory. Appearance and co-appearance frequencies of nouns and adjectives in phrases are analyzed based on the assumption that a 'noun phrase' represents a single 'concept'. The word network of KSRP was compared with that of $WordNet^{TM}$, a world-wide thesaurus network, for the verification. It is proved that the KSRP's word network, established in this paper, provides words' semantic relationships based on the common concepts of Korean rural planning research field. With the results, it is expecting that the established word network can present more opportunity for preparation of the fourth industrial revolution to the field of the Korean rural planning.

키워드

참고문헌

  1. Baldridge, J., 2005, The opennlp project. URL:http://opennlp.apache.org/index.html.
  2. Bastian, M., Heymann, S. and Jacomy, M., 2009, Gephi: an open source software for exploring and manipulating networks, International AAAI Conference on Weblogs and Social Media.
  3. Benjamins, P.V., Fensel, D. and Gomez-Perez, A., 1998, Knowledge Management through Ontologies, International Conference on Practical Aspects of Knowledge Management (PAKM-98), pp.29-30.
  4. Berneers-Lee, T., 2001, The Semantic Web: A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities, Scientific American, 284(5), pp.34-43. https://doi.org/10.1038/scientificamerican0501-34
  5. Bouma, G., 2009, Normalized (Pointwise) Mutual Information in Collocation Extraction, Proceedings of the Biennial GSCL Conference, pp.31-40.
  6. Cambrosio, A., Limoges, C., Courtial, J.P. and Lavile, F. 1993, Historical scientometrics? Mapping over 70 years of biological safety research with coword analysis, Scientometrics, 27(2), pp.119-143. https://doi.org/10.1007/BF02016546
  7. Choi, Y., 2017, Legal Issues of Regional Informatization in the Fourth Industrial Revolution, The Journal of Public Policy & Governance, 10(4), pp.35-57.
  8. Choi, Y.J., Choi, S.J., 2016, Analysis of Buyer's Behavioral Difference by Types of Online Shopping - Utilizing online sales data of Yangpyeong Sumi Village -, Journal of Kyonggi Tourism Research, 26, pp.49-66.
  9. Church K.W. and Hanks P., 1990, Word association norms, mutual information, and lexicography, Computational Linguistics, 16(1), pp.22-29.
  10. Damani, O.P., 2013, Improving Pointwise Mutual Information (PMI) by Incorporating Significant Co-occurrence, Proceedings of the Seventeenth Conference on Computational Natural Language Learning, pp.20-28.
  11. Ding, Y., Chowdhury, G.G. and Foo, S., 2001, Bibliometric cartography of information retrieval research by using co-word analysis, Information Processing & Management, 37(6), pp.817-842. https://doi.org/10.1016/S0306-4573(00)00051-0
  12. Fano, R.M., 1961, Transmission of Information: A Statistical Theory of Communication, The MIT Press. pp.21-61.
  13. Grangel-González, I., Halilaj, L., Coskun, G., Auer, S., Collarana, D. and Hoffmeister, M., 2016, Towards a Semantic Administrative Shell for Industry 4.0 Components, 2016 IEEE Tenth International Conference on Semantic Computing (ICSC), pp.230-237.
  14. Gu M.S., Hwang J.H., Ryu, K.H. and Hong J.E., 2006, Semi-Automatic Ontology Generation about XML Documents using Data Mining Method, The KIPS Transactions : Part D, 13(3), pp.299-308.
  15. Hyun, S.H. and Ham, Y.S., 2017, Analysis on the Effects to Local Tax Income according to Regional Industrial Structure's Productivity: Focused on Local governments in GyeongGi Province, The Korean Journal of Local Government Studies, 20(4), pp.25-45. https://doi.org/10.20484/klog.20.4.2
  16. Hyvonen, E., Saarela, S., and Kim V., 2003, Ontogator: combining view- and ontology-based search with semantic browsing, Proceedings of XML Finland 2003, Open Standards, XML, and the Public Sec-tor, Kuopio.
  17. Jacomy, M., Venturini, T., Heymann, S. and Bastian, M., 2014, ForceAtlas2, a Continuous Graph Layout Algorithm for Handy Network Visualization Designed for the Gephi Software, PLoS ONE, 9(6): e98679, https://doi.org/10.1371/journal.pone.0098679
  18. Kang, S.J., 2004, Ontology Construction and Its Application to Disambiguate Word Senses, The KIPS transactions. Part B, 11(4), pp.491-500.
  19. Kim, H.S., Choi, I. and Kim, M., 2005, A Statistical Approach for Extracting and Naming Relation between Concepts, The KIPS(Korea Information Processing Society) Transactions : Part B, 12(4), pp.479-486.
  20. Lawrie D., Croft B., and Rosenberg, A., 2001, Finding topic words for hierarchical summarization, In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '01). ACM, New York, NY, USA, pp.349-357.
  21. Lee Y., 2016, Dynamic ontology construction algorithm from Wikipedia and its application toward real-time nation image analysis, Journal of the Korean Data & Information Science Society, 27(4),pp.979-991. https://doi.org/10.7465/jkdi.2016.27.4.979
  22. Lee, H.J., Lee, J.M., Park, M.J., Kim, H.J., Lee, J.J., 2006, Development of Rural Amenity resrouces Information System Using Ontology and Web-GIS, Journal of the Korean Society of Rural Planning, 12(4), pp.13-22.
  23. Lee, H.R., 2009, Implementation of Information Retrieval and Management System Based on Ontology Using Object Oriented Design Pattern, Journal of the Korean Association of Geographic Information Studies, 12(4), pp.146-157.
  24. Lee, H.R., Baek, J.H., Baek, J.H., 2008, Prototype of Crops Information System based on Ontology and WebGIS, Journal of the Korean Association of Geographic Information Studies, 11(3), pp.43-51.
  25. Lee, J., Kim, Y., Shin, H. and Song, K., 2014, A Study on Ontology Based Knowledge Representation Method with the Alzheimer Disease Related Articles, Journal of Internet Computing and Services, 15(3), pp.125-135. https://doi.org/10.7472/jksii.2014.15.3.125
  26. Lee, J.M., Lee, J.J., 2006, Integration with External Information Using Ontology for Rural Amenity Resources Information Service, Journal of the Korean Society of Rural Planning, 12(4), pp.53-61.
  27. Lee, J.M., Suh, K., Kim, H.J., Lee, J.J., 2005, Design of Integrated Database Schema for Imptoving Usability of Rural Information, Journal of the Korean Society of Rural Planning, 11(2), pp.43-49.
  28. Lee, S.H., 2010, Development of Subsurface Spatial Information Model with Cluster Analysis and Ontology Model, Journal of the Korean Association of Geographic Information Studies, 13(4), pp.170-180.
  29. Maedche, A. and Staab, S., 2000, Semi-Automatic Engineering of Ontologies from Text , In Proceedings of the 12th Internal Conference on Software and Knowledge Engineering, pp.231-239.
  30. Mashey, J.R., 1998, Big Data ... and the Next Wave of InfraStress, Usenix, Slides from invited talk.
  31. Miller, G.A., 1995, WordNet: a lexical database for English, Communications of the ACM, 38(11), pp.39-41. https://doi.org/10.1145/219717.219748
  32. Ministry of Agriculture, Food and Rural Affairs (MAFRA, 농식품부), 2017a, High-quality Bigdata-map construction for the fourth industrial revolution of agriculture and food sector(in Korean: 농식품분야 4차혁명 위해 고품질 빅데이터 지도 구축), A press releas, MAFRA.
  33. Ministry of Agriculture, Food and Rural Affairs (MAFRA, 농식품부), 2017b, Application of public databasse of agriculture and food, lead to new foundation in the era of the fourth industrial revolution (in Korean: 농식품 공공데이터 활용, 4차 산업 혁명시대 새로운 창업 선도), A press releas, MAFRA.
  34. Ministry of Science, ICT and Future PLanning(MSIP), 2016, A comprehensive mid- and long-term plan of inteligence and information society for the fourth industrial revolution (in Korean: 제4차 산업혁명에 대응한 지능정보사회 중장기 종합대책), MISP and ministries concernced.
  35. Mun, H.J. and Woo, Y.T., 2006, Concept Extraction Technique from Documents Using Domain Ontology, The KIPS Transactions : Part D, 13(3), pp.309-316.
  36. Nam, W.H., Kim, T., Choi, J.Y., Kim, J.T., La, M.C., 2011, Wireless Sensor Network Development using RFID for Agricultural Water Management, Journal of the Korean Society of Agricultural Engineers, 53(5), pp.43-51. https://doi.org/10.5389/KSAE.2011.53.5.043
  37. Park, J.G. and Chang, Y.C., 2001, A Contruction of Spatial Database for Precision Farming, Kon-Kuk Journal of Natural Science and Technology, 12, pp.61-72.
  38. Park, J.H., Hwang, J.H., Lee, S.W., 2014a, The effect of the 6th industrialization in agriculture on farm and off-farm income, Journal of the Korean Society of Rural Planning, 20(4), pp.193-208. https://doi.org/10.7851/ksrp.2014.20.4.193
  39. Park, M., Kim, S.B., Kim, E.J., Rhee, S., Song, Y., Lim, C.S., Choi, J.A. and Chin, H.S., 2014b, The Current State of the Korean Rural Amenity Resource Database, Journal of the Korean Society of Rural Planning, 20(4), pp.263-276. https://doi.org/10.7851/ksrp.2014.20.4.263
  40. Role F. and Nadif, M., 2011, Handling the Impact of Low Frequency Events on Co-occurrence based Measures of Word Similarity - A Case Study of Pointwise Mutual Information, Proceedings of KDIR 2011 : KDIR- International Conference on Knowledge Discovery and Information Retrieval, pp.226-231
  41. Sanderson, M. and Croft, B., 1999, Deriving concept hierarchies from text, In Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval (SIGIR '99), ACM, New York, NY, USA, pp.206-213.
  42. Schwab, K., 2015, The Fourth Industrial : what it means, how to respond, Foreign Affairs, Accessed July 6. 2017. from https://www.foreignaffairs.com/articles/2015-12-12/fourth-industrial-revolution.
  43. Shannon, C.E., 1948, A Mathematical Theory of Communication, The Bell System Technical Journal, 27, pp.379-423. https://doi.org/10.1002/j.1538-7305.1948.tb01338.x
  44. Shim, J.H. and Choi, M.G., 2013, An Analysis of the Intellectual Structure of Venture-Creation Studies to build an Entrepreneurship Ontology, Knowledge Management Research, 14(4), pp.75-86.
  45. Song, D.G., 2005, A Study of Methodology for Automatic Construction of OWL Ontologies from Sejong Electronic Dictionary, Language and Information, 9(1), pp.19-34. https://doi.org/10.29403/LI.9.1.2
  46. Staab S., Studer, R., Schnurr, H.P. and Y. Sure, 2001, Knowledge processes and ontologies, IEEE Intelligent Systems, 16(1), pp. 26-34.
  47. Thomas R.G., 1993, A Translation Approach to Portable Ontology Specifications, Stanford Knowledge System Laboratory Technique Report KSL-92-71, pp.1-2.
  48. Uschold, M. and Gruninger, M., 1996, Ontologies: Principles, methods and applications, Knowledge Engineering Review, 11(2), pp.93-136. https://doi.org/10.1017/S0269888900007797
  49. Weinstein, P.C., 1998, Ontology-based metadata: transforming the MARC legacy, In Proceedings of the third ACM conference on Digital libraries (DL '98), pp.254-263.
  50. Yang, J.I., Lee, J.H., Hwang, D.Y., 2014, Empirical Study on the Activation Plan for 6th Industrialization of Rural Agricultural Resources: Focus on the Field Experts and the Complementary Demand, Journal of the Korean Society of Rural Planning,20(3), pp.111-120. https://doi.org/10.7851/ksrp.2014.20.3.111
  51. Yang, K.A., Yang, H.J., Yang, J.D., 2004, Bio-Ontology Generation Using Object-Oriented Ontology Manager, The KIPS transactions. Part B, 11(4), pp.437-448.
  52. Yarowsky D., 1995, Unsupervised word sense disambiguation rivaling supervised methods, In Proceedings of the 33rd annual meeting on Association for Computational Linguistics (ACL '95), Association for Computational Linguistics, Stroudsburg, PA, USA, pp.189-196.
  53. Yoo, S.B. and Yoon, H.K., 2000, Searching River Information using Ontology, Journal of the Korea open GIS association, 2(2), pp.117-126.