DOI QR코드

DOI QR Code

Empirical Analysis on the Shortcut Benefit Function and its Factors for Triple Database

트리플 데이터베이스 단축 경로 이득 함수와 구성 인자 실험 분석

  • Received : 2014.01.24
  • Accepted : 2014.02.17
  • Published : 2014.02.28

Abstract

A triple database consisting of a number of three-column tables require high cost of query processing, whereby building a shortcut is known as an effective way to reduce the cost. It is important to figure out what shortcuts needs to be selectively built. Most shortcut selection algorithms make use of a benefit model that considers the query frequency. However they work poor to reflect the database update. In this paper, we consider a benefit model for triple databases. The model considers not only the profit of query response times but also the building and maintenance costs of the shortcuts. We apply the model to design a benefit function which can be plugged in a greedy-based shortcut selection algorithm. We perform the empirical experiments on a real-world dataset and analyze the effect of each factor employed in the benefit function.

3-컬럼의 트리플 테이블로 구성되는 트리플 데이터베이스의 질의 처리는 고비용이 드는데, 단축 경로는 그 비용을 감소시키는 방법으로 알려졌다. 어떠한 단축 경로를 선택 구성할지는 주요한 문제이며, 질의 빈도를 기반으로 단축 경로 이득을 계산하는 방식이 주로 사용된다. 하지만 이러한 방식은 트리플 데이터의 추가 혹은 변경을 적절히 반영하지 못한다. 본 논문에서는 질의 처리 시간 단축 측면뿐 아니라 경로 구축 및 유지 비용도 고려하는 이득 모델을 다룬다. 이득 모델은 이득 함수로 설계되어 단축 경로 선택 기법에 적용된다. 이득 함수 구성 인자가 미치는 영향을 실세계 트리플 데이터를 사용해 실험 분석한다.

Keywords

References

  1. Abadi, D. J., Marcus, A., Madden, S. R., and Hollenbach, K., Scalable semantic web data management using vertical partitioning. In Proceedings of the 33rd international conference on Very large data bases (VLDB ʼ07), pp. 411-422. VLDB Endowment, 2007.
  2. Abadi, D. J., Marcus, A., Madden, S. R., and Hollenbach, K., SW-Store : a vertically partitioned DBMS for Semantic Web data management, The VLDB Journal, Vol. 18, No. 2, pp. 385-406, 2009. https://doi.org/10.1007/s00778-008-0125-y
  3. Agrawal, S., Chaudhuri, S., and Narasayya, V. R., Automated Selection of Materialized Views and Indexes in SQL Databases. In Proceedings of the 26th International Conference on Very Large Data Bases (VLDB ʼ00), pp. 496-505. Morgan Kaufmann Publishers Inc., 2000.
  4. Arias, M., Fernandez, J. D., Martinez- Prieto, M. A. and de la Fuente, P., An Empirical Study of Real-World SPARQL Queries, In proceedings of the 1st International Workshop on Usage Analysis and the Web of Data (USEWOD2011) in the 20th International World Wide Web Conference (WWW2011). 2011.
  5. Constantopoulos, P., Dritsou, V., and Foustoucos, E., Developing query patterns. In Proceedings of the 13th European conference on Research and advanced technology for digital libraries (ECDLʼ09), pp. 119-124. Springer-Verlag, 2009.
  6. Dritsou, V., Constantopoulos, P., Deligiannakis, A., and Kotidis, Y., Optimizing query shortcuts in RDF databases. In proceedings of the 8th extended semantic web conference on the semantic web : research and applications-Volume Part II (ESWC'11), pp. 77-92. Springer-Verlag, 2011.
  7. Huang, J., Abadi, D. J., and Ren, K., Scalable SPARQL Querying of Large RDF Graphs. In Proceedings of the VLDB Endowment, Vol. 4, No. 11, pp. 1123-1134, 2011.
  8. Kang, S., An Indexing Framework for Improving Data Consistency of Triple Database, Ph. D. Thesis, Seoul National University. 2013.
  9. Kang, S., Shim, J., and Lee, S.-g., Tridex : A Lightweight Triple Index for Relational Database-Based Semantic Web Data Management, Expert Systems with Applications, Vol. 40, No. 9, pp. 3421-3431, Elsevier, 2013. https://doi.org/10.1016/j.eswa.2012.12.050
  10. Karloff, H. and Mihail, M., On the complexity of the view-selection problem. In Proceedings of the eighteenth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems (PODS ʼ99), pp. 167-173. ACM, 1999.
  11. Lee, H., Shim, J., and Kim, D., Ontological modeling of e-catalogs using EER and description logics. In Proceedings of International Workshop on Data Engineering Issues in E-Commerce (DEECʼ05), IEEE, 2005.
  12. Lee, M., Lee, H., and Shim, J., Analysis and Modeling of Semantic Relationships in e-Catalog Domain. The Journal of Society for e-Business Studies, Society for e-Business Studies, Vol. 9, No. 3, pp. 243-258, 2004.
  13. Ley, M. The DBLP computer science bibliography. http://www.informatik.unitrier.de/-ley/db/. Nov 15, 2012.
  14. Page, L., Brin, S., Motwani, R., Winograd, T., The PageRank citation ranking : bringing order to the Web. In proceedings of the 7th International World Wide Web Conference, pp. 161-172. 1998.
  15. Scheuermann, P., Shim, J., and Vingralek, R. WATCHMAN : A data warehouse intelligent cache manager. In Proceedings of the 22th International Conference on Very Large Data Bases (VLDB ʼ96), pp. 51-62. Morgan Kaufmann Publishers Inc., 1996.

Cited by

  1. RDF 질의 처리 성능 향상을 위한 실체 뷰 선택 기법 vol.15, pp.12, 2015, https://doi.org/10.5392/jkca.2015.15.12.024