DOI QR코드

DOI QR Code

Change Acceptable In-Depth Searching in LOD Cloud for Efficient Knowledge Expansion

효과적인 지식확장을 위한 LOD 클라우드에서의 변화수용적 심층검색

  • 김광민 (솔트룩스 인공지능연구센터) ;
  • 손용락 (서경대학교 컴퓨터공학과)
  • Received : 2018.01.15
  • Accepted : 2018.06.23
  • Published : 2018.06.30

Abstract

LOD(Linked Open Data) cloud is a practical implementation of semantic web. We suggested a new method that provides identity links conveniently in LOD cloud. It also allows changes in LOD to be reflected to searching results without any omissions. LOD provides detail descriptions of entities to public in RDF triple form. RDF triple is composed of subject, predicates, and objects and presents detail description for an entity. Links in LOD cloud, named identity links, are realized by asserting entities of different RDF triples to be identical. Currently, the identity link is provided with creating a link triple explicitly in which associates its subject and object with source and target entities. Link triples are appended to LOD. With identity links, a knowledge achieves from an LOD can be expanded with different knowledge from different LODs. The goal of LOD cloud is providing opportunity of knowledge expansion to users. Appending link triples to LOD, however, has serious difficulties in discovering identity links between entities one by one notwithstanding the enormous scale of LOD. Newly added entities cannot be reflected to searching results until identity links heading for them are serialized and published to LOD cloud. Instead of creating enormous identity links, we propose LOD to prepare its own link policy. The link policy specifies a set of target LODs to link and constraints necessary to discover identity links to entities on target LODs. On searching, it becomes possible to access newly added entities and reflect them to searching results without any omissions by referencing the link policies. Link policy specifies a set of predicate pairs for discovering identity between associated entities in source and target LODs. For the link policy specification, we have suggested a set of vocabularies that conform to RDFS and OWL. Identity between entities is evaluated in accordance with a similarity of the source and the target entities' objects which have been associated with the predicates' pair in the link policy. We implemented a system "Change Acceptable In-Depth Searching System(CAIDS)". With CAIDS, user's searching request starts from depth_0 LOD, i.e. surface searching. Referencing the link policies of LODs, CAIDS proceeds in-depth searching, next LODs of next depths. To supplement identity links derived from the link policies, CAIDS uses explicit link triples as well. Following the identity links, CAIDS's in-depth searching progresses. Content of an entity obtained from depth_0 LOD expands with the contents of entities of other LODs which have been discovered to be identical to depth_0 LOD entity. Expanding content of depth_0 LOD entity without user's cognition of such other LODs is the implementation of knowledge expansion. It is the goal of LOD cloud. The more identity links in LOD cloud, the wider content expansions in LOD cloud. We have suggested a new way to create identity links abundantly and supply them to LOD cloud. Experiments on CAIDS performed against DBpedia LODs of Korea, France, Italy, Spain, and Portugal. They present that CAIDS provides appropriate expansion ratio and inclusion ratio as long as degree of similarity between source and target objects is 0.8 ~ 0.9. Expansion ratio, for each depth, depicts the ratio of the entities discovered at the depth to the entities of depth_0 LOD. For each depth, inclusion ratio illustrates the ratio of the entities discovered only with explicit links to the entities discovered only with link policies. In cases of similarity degrees with under 0.8, expansion becomes excessive and thus contents become distorted. Similarity degree of 0.8 ~ 0.9 provides appropriate amount of RDF triples searched as well. Experiments have evaluated confidence degree of contents which have been expanded in accordance with in-depth searching. Confidence degree of content is directly coupled with identity ratio of an entity, which means the degree of identity to the entity of depth_0 LOD. Identity ratio of an entity is obtained by multiplying source LOD's confidence and source entity's identity ratio. By tracing the identity links in advance, LOD's confidence is evaluated in accordance with the amount of identity links incoming to the entities in the LOD. While evaluating the identity ratio, concept of identity agreement, which means that multiple identity links head to a common entity, has been considered. With the identity agreement concept, experimental results present that identity ratio decreases as depth deepens, but rebounds as the depth deepens more. For each entity, as the number of identity links increases, identity ratio rebounds early and reaches at 1 finally. We found out that more than 8 identity links for each entity would lead users to give their confidence to the contents expanded. Link policy based in-depth searching method, we proposed, is expected to contribute to abundant identity links provisions to LOD cloud.

본 연구는 시멘틱 웹의 실질적 구현체인 LOD 클라우드에서 연결정책을 활용함으로써 LOD들간 연결을 효과적으로 제공하고 LOD의 변경된 내용을 검색결과에 빠짐없이 반영할 수 있는 방안을 제시한다. 현재 LOD 클라우드에서는 개체간 연결은 를 이용하여 개체들이 동일함을 명시적으로 기술하는 방식으로 이루어져 있다. 하지만, 이러한 명시적 연결방식은 LOD 클라우드 규모의 방대함에도 불구하고 개체간 동일성을 개체단위에서 파악하여야 하는 어려움이 있으며 주기적으로 LOD에 추가하여야 함에 따라 검색 시 개체들이 누락되는 한계가 있다. 이를 극복하기 위하여 본 연구에서는 명시적 연결을 생성하는 대신 LOD별로 연결하고자 하는 LOD와의 연결정책을 수립하여 LOD와 함께 공개하는 방식을 제안한다. 연결정책을 활용함으로써 연결하여야 할 동일개체를 검색시점에서 파악할 수 있으므로 추가되었던 개체들을 누락됨 없이 검색결과에 포함시킬 수 있고 LOD 클라우드에서의 연결성도 효과적으로 확충할 수 있다. 확충된 연결성은 정보의 지능적 처리의 선행과정인 지식확장의 근간이 된다. 연결정책은 연결하고자 하는 소스와 타겟 LOD의 주어 개체들간의 동일성을 평가하는데 도움이 되는 술어 쌍을 명세하는 방식으로 수립하며 검색 시 이러한 술어쌍에 대응하는 RDF 트리플을 검색하고 이들의 목적어들이 충분히 동일한 것인가를 평가하여 주어개체들의 동일수준을 판단한다. 본 연구에서는 이러한 연결정책을 이용하여 여러 LOD들을 심층적으로 검색하는 시스템을 구현하였다. 검색과정에서는 기존 명시적 연결들도 함께 활용하도록 구현하였다. 검색시스템에 대한 실험은 DBpedia의 주요 LOD들을 대상으로 진행하였다. 실험결과 연결대상 개체들의 목적어들이 0.8 ~ 0.9의 유사수준을 가지는 경우 적정한 확장성을 가지고 충분히 신뢰적인 개체들을 적절하게 포함하는 것으로 확인하였다. 또한, 개체들은 8개 이상의 동일연결을 제공하여야 검색결과가 신뢰적으로 활용될 수 있을 것으로 파악되었다.

Keywords

References

  1. Abele A. and McCrae J., The Linked Open Data cloud diagram, 2017. Available at http://lod-cloud.net/ (Downloaded 21 December, 2017)
  2. Auer, S., et al., T., Linked open data - creating knowledge out of interlinked data, Springer, U.S.A., 2014
  3. Bill, C. and Mary, K., Towards a semantic web, Chandos Publishing, U.K., 2011
  4. Bizer C., Is the Semantic Web what we expected, 2017. Available at https://www.slideshare.net/bizer/is-the-semantic-web-what-we-expected-adoption-patterns-and-contentdriven-challenges-iswc-2016-keynote (Downloaded 21 December, 2017)
  5. Bob, D., Learning SPARQL, O'REILLY, U.S.A., 2013
  6. Brown, Peter F., et al, "Class-based n-gram models of natural language" Computational linguistics, Vol.18, No.4(1992), 467-479.
  7. Carol, G. and Shenghui, W., Library linked data in the cloud, Morgan & Claypool Publishers, U.S.A., 2015
  8. Choi, Y., Park, J., "The Need for Paradigm Shift in Semantic Similarity and Semantic Relatedness : From Cognitive Semantics Perspective", Journal of Intelligence and Information Systems, Vol.19, No.1(2013), 111-123. https://doi.org/10.13088/jiis.2013.19.1.111
  9. David, W., et al., Linked data: A Structured Data on the Web, Manning Publication, U.S.A., 2014
  10. Dean, A. and James, H., Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL, Elsevier, 2011
  11. Erik, M., Library Linked Data: Research and Adoption, ALA TechSource, U.S.A., 2014
  12. Frank, C. and Lars, S., Linked Data and User Interaction, ILFA Publications, 2016
  13. Grigoris, A., et al., A Semantic Web Primer, MIT Press, 2012
  14. Hart, G. and Catherine, D., Linked data: A Geographic Perspective, CRC Press, U.S.A., 2013
  15. Harth, A., et al., Linked Data Management, 1st Ed., 20-25. CRC Press, U.S.A., 2014
  16. Heath, T. and Bizer, C. Linked Data: Evolving the Web into a Global Data Space, Morgan & Claypool, U.S.A., 2011.
  17. Hitzler P., et al., Foundation of Semantic Web Technologies, CRC Press, U.S.A., 2009
  18. Jeff, Z. and Guido, V., Exploiting Linked Data and Knowledge Graphs in Large Organizations, Springer, Switzerland, 2017
  19. Jeong, H., "A Study on Ontology and Topic Modeling-based Multi-dimensional Knowledge Map Services", Journal of Intelligence and Information Systems, Vol.21, No.4(2015), 79-92. https://doi.org/10.13088/JIIS.2015.21.4.079
  20. Konstantinou, N., Materializing the Web of Linked Data, 1st Ed., 51. Springer, U.S.A., 2015
  21. Michael, D., The Great Cloud Migration: Your Roadmap to Cloud Computing, Big Data and Linked Data, Outskirts Press, U.S.A., 2013
  22. Martin H., et al., Ontology Management: Semantic Web, Semantic Web Services, and Business Applications (Semantic Web and Beyond), Springer, U.S.A., 2007
  23. Ngonga, A. and Auer, S., "LIMES - A Time-Effi cient Approach for Large-Scale Link Discovery on the Web of Data", Proc. of the 22nd IJCAI, (2011), 2312-2317.
  24. Park, H. J., Lee, H. J., Kim, J. W., "Participation Level in Online Knowledge Sharing: Behavioral Approach on Wikipedia", Journal of Intelligence and Information Systems, Vol.19, No.4(2013), 97-121. https://doi.org/10.13088/jiis.2013.19.4.097
  25. Park J. and Sohn, Y., "A Syntax Added Link Evaluation Technique for Improving Trustworthiness of LOD's Linkages", Journal of KIISE: Databases, Vol. 41, No. 1 (2014), 45-61.
  26. Park J. and Sohn, Y., "Trustworthiness Improving Link Evaluation Technique for LOD Linkages giving Considerations to the Syntactic Properties of RDFS, OWL, and OWL2", Journal of KIISE: Databases, Vol. 41, No. 4 (2014), 226-241.
  27. Robert, A. and Barry S., Ontologies with Basic Formal Ontology, MIT Press, 2015
  28. Schmachtenberg, M. and Bizerand, H., State of the LOD Cloud, 2017, Available at http://linkeddatacatalog.dws.informatik.uni-mannheim.de/state/ (Downloaded 21 December, 2017)
  29. Sikos, F., Mastering Structured Data on the Semantic Web, Apress, U.S.A., 2015
  30. Volz J., et al., "Silk - A Link Discovery Framework for the Web of Data", Proc. of the 2nd Workshop on Linked Data on the Web 2009 (LDOW2009), (2009), 238-247.
  31. W3C, What is Linked Data, 2017, Available at https://www.w3.org/standards/semanticweb/data (Downloaded 21 December, 2017).