DOI QR코드

DOI QR Code

A Common Shape Model To Obtain Multiple Schema Representations For RDF Knowledge Graphs Derived From RML Mappings

  • Ji-Woong Choi (School of Computer Science and Engineering, Soongsil University)
  • 투고 : 2024.10.17
  • 심사 : 2024.11.06
  • 발행 : 2024.11.29

초록

본 논문에서는 RML 매핑으로 유도된 RDF 지식 그래프에 대한 스키마를 공통 형상 모델이라는 구조를 통해 여러 언어로 자동 생성해 주는 시스템을 제안한다. 형상이란 그래프 스키마 언어에서 그래프가 갖추어야 할 제약 조건의 집합을 뜻한다. 공통 형상 모델의 구조는 특정 스키마 언어의 구문에 독립적이다. 이 모델을 구성하는 데이터는 사용자가 RML 문법에 따라 정의한 매핑 규칙 그리고 RML 매핑에서 입력 자격인 다양한 포맷의 원천 데이터를 정해진 알고리즘에 따라 분석 및 가공 후 수집된 것들이다. 제안하는 시스템은 개별 스키마 언어들의 구문에 맞게 모델 데이터를 배열하는 방법을 정의할 수 있게 하는 공통 인터페이스를 제공한다. 즉, 개별 언어는 이 인터페이스를 구현하기만 하면 자신의 구문으로 스키마를 출력할 수 있다. 본 논문에서는 제안하는 방법의 실용성을 보이기 위해 대표적인 RDF 그래프 스키마 언어인 SHACL과 ShEx로 스키마를 출력하도록 구현하였으며 출력 스키마가 유효함을 보이는 테스트 결과를 제시한다.

In this paper, we propose a system to provide schemas written in multiple languages for RDF knowledge graphs derived from RML mappings. The core of the system is a common shape model designed to support both SHACL and ShEx, which are all languages for representing the structure of RDF graphs, called "shapes". The model is syntax-neutral, but includes enough data to represent constraints with the two schema languages. The data in the model are automatically collected from RML mapping rules themselves and metadata about input data sources in RML mappings. Also, the presented system provides a common interface to support translation of the model to multiple languages. The translation to each language is performed by each syntax-specific implementation of the interface, so the model data are rearranged according to each language's syntax. This paper describes the proposed method in detail and also presents implementation results.

키워드

참고문헌

  1. A. Hogan, E. Blomqvist, M. Cochez, C. D'amato, G. De Melo, C. Gutierrez, S. Kirrane, J. E. L. Gayo, R. Navigli, S. Neumaier, A. N. Ngomo, A. Polleres, S. M. Rashid, A. Rula, L. Schmelzeisen, J. Sequeda, S. Staab, and A. Zimmermann, "Knowledge Graphs," ACM Computing Surveys, Vol. 54, No. 4, pp. 1-37, May 2022. DOI: 10.1145/3447772 
  2. N. Noy, Y. Gao, A. Jain, A. Narayanan, A. Patterson, and J. Taylor, "Industry-Scale Knowledge Graphs: Lessons and Challenges," Communications of the ACM, Vol. 62, No. 8, pp. 36-43, August 2019. DOI: 10.1145/3331166 
  3. H. Arnaout, and S. Elbassuoni, "Effective searching of RDF knowledge graphs," Journal of Web Semantics, Vol. 48, pp. 66-84, June 2018. DOI: 10.1016/j.websem.2017.12.001 
  4. A. Dimou, "High-quality knowledge graphs generation: R2rml and rml comparison, rules validation and inconsistency resolution," Applications and Practices in Ontology Design, Extraction, and Reasoning, Vol. 49, No. 4, pp. 55-72, November 2020. DOI: 10.3233/SSW200035 
  5. V. Janev, D. Graux, H. Jabeen, and E. Sallinger, "Knowledge Graphs and Big Data Processing," Springer Cham, pp. 59-72, July 2020. 
  6. H. Knublauch, and D. Kontokostas, Shapes Constraint Language (SHACL), http://www.w3.org/TR/shacl/
  7. E. Prud'hommeaux, I. Boneva, J. E. L. Gayo, and G. Kellogg, Shape Expressions Language 2.1, https://shex.io/shex-semantics/index.html
  8. M. Arenas, A. Bertails, E. Prud'hommeaux, and J. Sequeda, A Direct Mapping of Relational Data to RDF, http://www.w3.org/TR/rdb-direct-mapping/
  9. S. Das, S. Sundara, and R. Cyganiak, R2RML: RDB to RDF Mapping Language, http://www.w3.org/TR/r2rml/
  10. R. B. Thapa, and M. Giese, "A Source-to-Target Constraint Rewriting for Direct Mapping," Proceedings of 20th International Semantic Web Conference, pp. 21-38, Virtual Event, October 2021. DOI: 10.1007/978-3-030-88361-4_2 
  11. J. Choi, "Automatic Construction of SHACL Schemas for RDF Knowledge Graphs Generated by Direct Mappings," Journal of the Korea Society of Computer and Information Vol. 25, No. 10, pp. 23-34, October 2020. DOI: 10.9708/JKSCI.2020.25.10.023 
  12. I. Boneva, J. Lozano, and S. Staworko, "Relational to RDF Data Exchange in Presence of a Shape Expression Schema," Proceedings of the 12th Alberto Mendelzon International Workshop on Foundations of Data Management, pp. 1-16, Cali, Colombia, May 2018. DOI: 10.48550/arXiv.1804.11052 
  13. J. Choi, "ShEx Schema Generator for RDF Graphs Created by Direct Mapping," Journal of the Korea Society of Computer and Information Vol. 23, No. 10, pp. 33-43, October 2018. DOI: 10.9708/JKSCI.2018.23.10.033 
  14. I. Boneva, J. Lozano, and S. Staworko, "Consistency and Certain Answers in Relational to RDF Data Exchange with Shape Constraints," Proceedings of New Trends in Databases and Information Systems, pp. 97-107, Lyon, France, August 2020. DOI: 10.1007/978-3-030-54623-6_9 
  15. J. Choi, "Automatic Construction of SHACL Schemas for RDF Knowledge Graphs Generated by R2RML Mappings," Journal of the Korea Society of Computer and Information Vol. 25, No. 8, pp. 9-21, August 2020. DOI: 10.9708/JKSCI.2020.25.08.009 
  16. J. Choi, "R2RML Based ShEx Schema," Journal of the Korea Society of Computer and Information Vol. 23, No. 10, pp. 45-55, October 2018. DOI: 10.9708/JKSCI.2018.23.10.045 
  17. T. Delva, B. D. Smedt, S. M. Oo, D. V. Assche, S. Lieber, and A. Dimou, "RML2SHACL: RDF Generation Is Shaping Up," Proceedings of the 11th on Knowledge Capture Conference (K-CAP '21), pp. 153-160, New York, USA, December 2021. DOI: 10.1145/3460210.3493562 
  18. G. Carothers, RDF 1.1 N-Quads, https://www.w3.org/TR/n-quads/
  19. P. Heyvaert, A. Dimou and B. D. Meester, "RML Test Cases," https://rml.io/test-cases/
  20. G. Carothers, RDF 1.1 N-Quads, https://www.w3.org/TR/n-quads/