DOI QR코드

DOI QR Code

Change Acceptable In-Depth Searching in LOD Cloud for Efficient Knowledge Expansion

효과적인 지식확장을 위한 LOD 클라우드에서의 변화수용적 심층검색

  • 김광민 (솔트룩스 인공지능연구센터) ;
  • 손용락 (서경대학교 컴퓨터공학과)
  • Received : 2018.01.15
  • Accepted : 2018.06.23
  • Published : 2018.06.30

Abstract

LOD(Linked Open Data) cloud is a practical implementation of semantic web. We suggested a new method that provides identity links conveniently in LOD cloud. It also allows changes in LOD to be reflected to searching results without any omissions. LOD provides detail descriptions of entities to public in RDF triple form. RDF triple is composed of subject, predicates, and objects and presents detail description for an entity. Links in LOD cloud, named identity links, are realized by asserting entities of different RDF triples to be identical. Currently, the identity link is provided with creating a link triple explicitly in which associates its subject and object with source and target entities. Link triples are appended to LOD. With identity links, a knowledge achieves from an LOD can be expanded with different knowledge from different LODs. The goal of LOD cloud is providing opportunity of knowledge expansion to users. Appending link triples to LOD, however, has serious difficulties in discovering identity links between entities one by one notwithstanding the enormous scale of LOD. Newly added entities cannot be reflected to searching results until identity links heading for them are serialized and published to LOD cloud. Instead of creating enormous identity links, we propose LOD to prepare its own link policy. The link policy specifies a set of target LODs to link and constraints necessary to discover identity links to entities on target LODs. On searching, it becomes possible to access newly added entities and reflect them to searching results without any omissions by referencing the link policies. Link policy specifies a set of predicate pairs for discovering identity between associated entities in source and target LODs. For the link policy specification, we have suggested a set of vocabularies that conform to RDFS and OWL. Identity between entities is evaluated in accordance with a similarity of the source and the target entities' objects which have been associated with the predicates' pair in the link policy. We implemented a system "Change Acceptable In-Depth Searching System(CAIDS)". With CAIDS, user's searching request starts from depth_0 LOD, i.e. surface searching. Referencing the link policies of LODs, CAIDS proceeds in-depth searching, next LODs of next depths. To supplement identity links derived from the link policies, CAIDS uses explicit link triples as well. Following the identity links, CAIDS's in-depth searching progresses. Content of an entity obtained from depth_0 LOD expands with the contents of entities of other LODs which have been discovered to be identical to depth_0 LOD entity. Expanding content of depth_0 LOD entity without user's cognition of such other LODs is the implementation of knowledge expansion. It is the goal of LOD cloud. The more identity links in LOD cloud, the wider content expansions in LOD cloud. We have suggested a new way to create identity links abundantly and supply them to LOD cloud. Experiments on CAIDS performed against DBpedia LODs of Korea, France, Italy, Spain, and Portugal. They present that CAIDS provides appropriate expansion ratio and inclusion ratio as long as degree of similarity between source and target objects is 0.8 ~ 0.9. Expansion ratio, for each depth, depicts the ratio of the entities discovered at the depth to the entities of depth_0 LOD. For each depth, inclusion ratio illustrates the ratio of the entities discovered only with explicit links to the entities discovered only with link policies. In cases of similarity degrees with under 0.8, expansion becomes excessive and thus contents become distorted. Similarity degree of 0.8 ~ 0.9 provides appropriate amount of RDF triples searched as well. Experiments have evaluated confidence degree of contents which have been expanded in accordance with in-depth searching. Confidence degree of content is directly coupled with identity ratio of an entity, which means the degree of identity to the entity of depth_0 LOD. Identity ratio of an entity is obtained by multiplying source LOD's confidence and source entity's identity ratio. By tracing the identity links in advance, LOD's confidence is evaluated in accordance with the amount of identity links incoming to the entities in the LOD. While evaluating the identity ratio, concept of identity agreement, which means that multiple identity links head to a common entity, has been considered. With the identity agreement concept, experimental results present that identity ratio decreases as depth deepens, but rebounds as the depth deepens more. For each entity, as the number of identity links increases, identity ratio rebounds early and reaches at 1 finally. We found out that more than 8 identity links for each entity would lead users to give their confidence to the contents expanded. Link policy based in-depth searching method, we proposed, is expected to contribute to abundant identity links provisions to LOD cloud.

References

  1. Abele A. and McCrae J., The Linked Open Data cloud diagram, 2017. Available at http://lod-cloud.net/ (Downloaded 21 December, 2017)
  2. Auer, S., et al., T., Linked open data - creating knowledge out of interlinked data, Springer, U.S.A., 2014
  3. Bill, C. and Mary, K., Towards a semantic web, Chandos Publishing, U.K., 2011
  4. Bizer C., Is the Semantic Web what we expected, 2017. Available at https://www.slideshare.net/bizer/is-the-semantic-web-what-we-expected-adoption-patterns-and-contentdriven-challenges-iswc-2016-keynote (Downloaded 21 December, 2017)
  5. Bob, D., Learning SPARQL, O'REILLY, U.S.A., 2013
  6. Brown, Peter F., et al, "Class-based n-gram models of natural language" Computational linguistics, Vol.18, No.4(1992), 467-479.
  7. Carol, G. and Shenghui, W., Library linked data in the cloud, Morgan & Claypool Publishers, U.S.A., 2015
  8. Choi, Y., Park, J., "The Need for Paradigm Shift in Semantic Similarity and Semantic Relatedness : From Cognitive Semantics Perspective", Journal of Intelligence and Information Systems, Vol.19, No.1(2013), 111-123. https://doi.org/10.13088/jiis.2013.19.1.111
  9. David, W., et al., Linked data: A Structured Data on the Web, Manning Publication, U.S.A., 2014
  10. Dean, A. and James, H., Semantic Web for the Working Ontologist: Effective Modeling in RDFS and OWL, Elsevier, 2011
  11. Erik, M., Library Linked Data: Research and Adoption, ALA TechSource, U.S.A., 2014
  12. Frank, C. and Lars, S., Linked Data and User Interaction, ILFA Publications, 2016
  13. Grigoris, A., et al., A Semantic Web Primer, MIT Press, 2012
  14. Hart, G. and Catherine, D., Linked data: A Geographic Perspective, CRC Press, U.S.A., 2013
  15. Harth, A., et al., Linked Data Management, 1st Ed., 20-25. CRC Press, U.S.A., 2014
  16. Heath, T. and Bizer, C. Linked Data: Evolving the Web into a Global Data Space, Morgan & Claypool, U.S.A., 2011.
  17. Hitzler P., et al., Foundation of Semantic Web Technologies, CRC Press, U.S.A., 2009
  18. Jeff, Z. and Guido, V., Exploiting Linked Data and Knowledge Graphs in Large Organizations, Springer, Switzerland, 2017
  19. Jeong, H., "A Study on Ontology and Topic Modeling-based Multi-dimensional Knowledge Map Services", Journal of Intelligence and Information Systems, Vol.21, No.4(2015), 79-92. https://doi.org/10.13088/JIIS.2015.21.4.079
  20. Konstantinou, N., Materializing the Web of Linked Data, 1st Ed., 51. Springer, U.S.A., 2015
  21. Michael, D., The Great Cloud Migration: Your Roadmap to Cloud Computing, Big Data and Linked Data, Outskirts Press, U.S.A., 2013
  22. Martin H., et al., Ontology Management: Semantic Web, Semantic Web Services, and Business Applications (Semantic Web and Beyond), Springer, U.S.A., 2007
  23. Ngonga, A. and Auer, S., "LIMES - A Time-Effi cient Approach for Large-Scale Link Discovery on the Web of Data", Proc. of the 22nd IJCAI, (2011), 2312-2317.
  24. Park, H. J., Lee, H. J., Kim, J. W., "Participation Level in Online Knowledge Sharing: Behavioral Approach on Wikipedia", Journal of Intelligence and Information Systems, Vol.19, No.4(2013), 97-121. https://doi.org/10.13088/jiis.2013.19.4.097
  25. Park J. and Sohn, Y., "A Syntax Added Link Evaluation Technique for Improving Trustworthiness of LOD's Linkages", Journal of KIISE: Databases, Vol. 41, No. 1 (2014), 45-61.
  26. Park J. and Sohn, Y., "Trustworthiness Improving Link Evaluation Technique for LOD Linkages giving Considerations to the Syntactic Properties of RDFS, OWL, and OWL2", Journal of KIISE: Databases, Vol. 41, No. 4 (2014), 226-241.
  27. Robert, A. and Barry S., Ontologies with Basic Formal Ontology, MIT Press, 2015
  28. Schmachtenberg, M. and Bizerand, H., State of the LOD Cloud, 2017, Available at http://linkeddatacatalog.dws.informatik.uni-mannheim.de/state/ (Downloaded 21 December, 2017)
  29. Sikos, F., Mastering Structured Data on the Semantic Web, Apress, U.S.A., 2015
  30. Volz J., et al., "Silk - A Link Discovery Framework for the Web of Data", Proc. of the 2nd Workshop on Linked Data on the Web 2009 (LDOW2009), (2009), 238-247.
  31. W3C, What is Linked Data, 2017, Available at https://www.w3.org/standards/semanticweb/data (Downloaded 21 December, 2017).