DOI QR코드

DOI QR Code

Retrieval Performance of XML Documents Using Object-Relational Databases

객체-관계형 데이터베이스에 의한 XML문헌의 검색성능 평가


Abstract

The purpose of this study is to evaluate the performance of XML retrieval based on ORDBMSs(Object-Relational Database Management Systems) approach. This paper describes indexing and retrieval methods for XML documents and the methodologies of experiments at INEX(Initiative for the Evaluation of XML retrieval). Like any other traditional information retrieval experiment, the test collection was consists of documents, topics/queries, task, relevance assessments and evaluation. EXIMA$^{TM}$ Supply, a kind of native XML DB based on ORDBMS technologies, is used for this experiment. Although this approach has many benefits, for example, no delay in storing and searching XML documents. but it showed relatively disappointed retrieval performance at INEX 2002. This result may caused since the given topics had to be decomposed and modified to be processed by the XPath processor, and during this modification the original meaning of topics can be changed inevitably and some important information nay pass over.r.

본 연구의 목적은 객체-관계형 데이터베이스 접근에 의한 XML 문헌의 검색 성능을 평가하는 것이다. 본 논문에서는 INEX(Initiative for the Evaluation of XML retrieval)에서의 XML 문헌의 색인 및 검색 방법에 대하여, 그리고 실험 방법론들에 대하여 기술하고 있다. 대부분의 전통적인 정보검색 성능평가 실험에서와 같이 본 연구에서 사용된 테스트 콜렉션(test collection)은 문헌(즉, XML 문헌), 토픽, ad hoc 검색, 적합성 판단, 평가로 이루어졌다. 그리고 ORDBMS 기술들을 기반으로 개발된 전용 XML 데이터베이스의 일종인 EXIMA$^{TM}$ Supply을 사용하여 INEX에서 제공한 대규모 XML 문헌들을 저장하고 검색하였다. 본 논문에서는 실험에서 사용한 시스템에 대한 개략적인 기능들과 색인 및 검색 과정 그리고 INEX 2002에서의 성능평가 결과에 대하여, 앞으로 개선되어야 할 기능에 대하여 논하고 있다.

Keywords

References

  1. SIGMOD 2003 Querying Structured Text in an XML Database Al-Khalifa, S.;C. Yu;H.V. Jagadish
  2. SIGIR Forum v.36 no.2 Second Edition of the XML and Information Retrieval Workshop Baeza-Yates, R.;N. Fuhr;Y.S. Maarek https://doi.org/10.1145/792550.792560
  3. Proc. of the Second International Workshop on Object-oriented Database The Design and Implementation of O2, an Object-oriented Database System Bancihon, F.;G. Barbedette;V. Benzaken;C. Delobel;S. Gamerman;C. Lecluse;P. Pfeffer;P. Richard;F. Velez
  4. SIGIR 2003 Searching XML Documents via XML Fragments Carmel, D.;Y.S. Maarek;M. Mandelbrod;Y. Mass;A. Soffer
  5. SIGMOD Record v.30 no.1 XML and Information Retrieval: a SIGIR 2000 Workshop Carmel, D.;Y. S. Maarek;A. Soffer https://doi.org/10.1145/373626.373705
  6. Proceedings of the 7th International Conference on Extending Database Technology(EDBT 2000) XML: Current Development and Future Challenges for the Database Community Ceri, S.;P. Fraternali;S. Paraboschi
  7. ICDE 2000 XML and DB2 Cheng, J.;J.XU
  8. SIGIR 2001 Expressive Retrieval from XML documents Chinenyanga, T.T.;N. Kushmerick
  9. Third International Conference on Information Integration and Web-based Applications and Services(IIWAS 2001) Accessing and Transforming Dynamic Content based on XML: Alternative Techniques and a Practical Implementation Despotopoulos, Y.;G. Patikis;J. Soldatos;L. Polymenakos;J. Kleindienst;J. Geric;W. Winiwarter;S. Bressan;I.K. Ibrahim(ed.)
  10. Proceedings of the Internatinal WWW Conference XML-QL: A Query Language for XML Deutsch, A.;M. Fernandez;D. Florescu;A. Levy;D. Suciu
  11. VLDB 2001 Query Engines for Web-accessible XML Data Fegaras, L.;R. Elmasri
  12. SIGMOD Record v.27 no.2 Catching the Boat with Strudel:Experiences With a Web-Site Management System Fernandez, M.;D.F. Florescu;J. King;A. Levy;D. Suciu https://doi.org/10.1145/276305.276341
  13. IEEE Data Engineering Bulletin v.22 no.3 Storing and Querying XMl Data using an RDBMS Florescu, D.;D. Kossman
  14. ACM SIGIR Workshop on XML and Information Retrieval NEX: Initiative for the Evaluation of XML Retrieval Fuhr, N.;N. Goevert;G. Kazai;M. Lalmas
  15. SIGIR 2001 XIRQL: A Query Language for Information Retrieval in XML Documents Fuhr, N.;K. GroBjohann
  16. ISIE 2001. 2001 IEEE International Symposium on Industrial Electronic Proceedings v.3 Mapping XML Documents to the Object-Relational Form Ha, S.;K. Kim
  17. Proceedings of the 8th International Conference on Information Knowledge Management(CIKM'99) An Effective Mechanism for Index Update in Structured Documents Jang, H.;Y. Kim;D. Shin
  18. Journal of the American Society for Information Science and Technology v.55 no.6 A Report on the First Year of the Initiative for the Evaluation of XML Retrieval:INEX'02 Kazai, G.;M. Lalmas;N. Fuhr;N. Govert https://doi.org/10.1002/asi.10386
  19. WIDM 2001 A Performance Envaluation of Storing XML Data in Relational Database Management Systems Khan, L.;Y. Rao
  20. SAC 2002 Structured Information Retrieval in XML documents Kotsakis, E.
  21. Proceeding of the 1st ACM International Conference on Digital Libraries(DL '96) Index Structures for Structured Decuments Lee, Y.K.;S.J. Yoo;K. Yoon;P.B. Berra
  22. VLDB 2001 Answering XML Queries Over Heterogeneous Data Sources Monolescu. I.;D. Florescu;D. Kossmann
  23. SIGMOD Record v.26 no.3 Lore: A Database Management System for Semi-structured Data McHugh, J.;S. Abiteboul, R. Goldman;D. Quess;J. Widom https://doi.org/10.1145/262762.262770
  24. IEEE Potentials v.19 no.1 Querying XML documents Miller, J. A.;S. Sheth https://doi.org/10.1109/45.825637
  25. ACM Trans. Inf. Sys v.14 no.4 Self-Indexing Inverted Files for Fast Text Retrieval Moffat, A.;J. Zobel https://doi.org/10.1145/237496.237497
  26. ACM Trans. Inf. Sys v.15 no.4 Proximal Nodes: A Model to Query Document Databases by Content and Structure Navarro, G.;R. Baeza-Yates https://doi.org/10.1145/263479.263482
  27. Oracle The New XML Type Datatype
  28. Lecture notes in Computer Science no.1997 Efficient Relational Storage and Retrieval of XML Documents Schmidt, A.;M. Kersten;M. Windhouwer;F. Wass
  29. VLDB Relational Dagabase for Querying XML Documents: Limitations and Opportunities Shanmugasundaram, J.;K. Tufte;C. Zhang;G. He;D.J. DeWitt;J.F. Naughton
  30. Knowledge & Information Systems v.3 no.2 XML Indexing and Retrieval with a Hybrid Storage Model Shin, D. https://doi.org/10.1007/PL00011668
  31. SIGMOD Record v.31 no.1 The Design and Performance Evaluation of Alternative XML Storage Strategies Tian, F.;D. Dewitt;J. Chen;C. Zhang https://doi.org/10.1145/507338.507341
  32. DocEng'01 Bridging XML-Schema and Relational Databases: A. System for Generating and Manipulating Relational Databases using valid XML Documents Varlamis, I.;M. Vazirgiannis https://doi.org/10.1145/502187.502203
  33. WIDM 2003 XVerter:Querying XML Data with OR-DBMS Vieira, H.;G. Ruberg;M. Mattoso
  34. W3C Document Object Model
  35. W3C Xpath
  36. ACM. SIGMOD 2001 On Supporting Containment Queries in Relational Database Management Systems Zhang, C.;J. Naughton;D. DeWitt;O. Luo;G. Lohman