Semi Automatic Ontology Generation about XML Documents

  • Gu Mi Sug (Database Laboratory, Chungbuk National University, Korea) ;
  • Hwang Jeong Hee (Database Laboratory, Chungbuk National University, Korea) ;
  • Ryu Keun Ho (Database Laboratory, Chungbuk National University, Korea) ;
  • Jung Doo Yeong (Database Laboratory, Chungbuk National University, Korea) ;
  • Lee Keum Woo (Database Laboratory, Chungbuk National University, Korea)
  • Published : 2004.10.01

Abstract

Recently XML (eXtensible Markup Language) is becoming the standard for exchanging the documents on the web. And as the amount of information is increasing because of the development of the technique in the Internet, semantic web is becoming to appear for more exact result of information retrieval than the existing one on the web. Ontology which is the basis of the semantic web provides the basic knowledge system to express a particular knowledge. So it can show the exact result of the information retrieval. Ontology defines the particular concepts and the relationships between the concepts about specific domain and it has the hierarchy similar to the taxonomy. In this paper, we propose the generation of semi-automatic ontology based on XML documents that are interesting to many researchers as the means of knowledge expression. To construct the ontology in a particular domain, we suggest the algorithm to determine the domain. So we determined that the domain of ontology is to extract the information of movie on the web. And we used the generalized association rules, one of data mining methods, to generate the ontology, using the tag and contents of XML documents. And XTM (XML Topic Maps), ISO Standard, is used to construct the ontology as an ontology language. The advantage of this method is that because we construct the ontology based on the terms frequently used documents related in the domain, it is useful to query and retrieve the related domain.

Keywords