XML Clustering Technique by Genetic Algorithm

Kim, Woo-Saeng;

전자공학회논문지CI (Journal of the Institute of Electronics Engineers of Korea CI)

제49권3호
/
Pages.1-7
/
2012
/
1229-6376(pISSN)

대한전자공학회 (The Institute of Electronics and Information Engineers)

유전자 알고리즘을 통한 XML 군집화 방법

XML Clustering Technique by Genetic Algorithm

김우생 (광운대학교 컴퓨터소프트웨어)

Kim, Woo-Saeng (Dept. of Computer Software, Kwangwoon Univ.)

투고 : 2012.03.15
심사 : 2012.05.04
발행 : 2012.05.25

PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

최근 들어 인터넷에서 많이 사용되는XML 문서들을 효율적으로 접근, 질의, 관리하는 방법들이 연구되고 있다. 본 논문은 XML 문서들을 효율적으로 군집화 하는 새로운 기법을 제안한다. XML 문서의 원소는 대응하는 트리의 노드에 대응하며, 문서에서 내포 관계는 트리의 부모와 자식 노드간의 관계에 대응한다. 따라서 유사한 XML 문서들은 대응하는 트리들에서 노드의 이름과 레벨 등이 유사하다. 이러한 성질을 유전 알고리즘의 평가 함수로 만들어 군집화를 시도하였다. 실험 결과를 통하여 제안하는 기법이 기존 방법들보다 좋은 결과를 얻을 수 있음을 보였다.

Recently, researches are studied in developing efficient techniques for accessing, querying, and managing XML documents which are frequently used in the Internet. In this paper, we propose a new method to cluster XML documents efficiently. An element of a XML document corresponds to a node of the corresponding tree and an inclusion relationship of the document corresponds to a relationship between parent and child node of the tree. Therefore, similar XML documents are similar to the node's name and level of the corresponding trees. We make evaluation function with this characteristic to cluster XML documents by genetic algorithm. The experiment shows that our proposed method has better performance than other existing methods.

키워드

참고문헌

R. Behrens, "A Grammar based model for XML schema integration,"Proc. of the 17th British National Conf. on Databases, pp.172-190, 2000.
J. Yoon, V. Raghavan, V. Chakilam, "BitCube: clustering and statistical analysis for XML documents," Proc. of the 13th Int. Conf. on Scientific and Statistical Database Management, Fairfax, Virginia, 2001.
J. Yoon, V. Raghavan, V. Chakilam, L. Kerschberg, "BitCube: a 3-D bitmap indexing for XML documents,"Journal of Intelligent Information Systems, Vol. 17, pp.241-254, 2001. https://doi.org/10.1023/A:1012861931139
A. Tagarelli, A. Greco, "Toward semantic XML clustering," 6th SIAM International Conference on Data Mining(SDM '06), pp. 188-199. Bethesda, Maryland, USA, 2006.
H. Lee, "An Unsupervised clustering technique of XML documents based on function transform and FFT," Journal of Korea Information Processing Society, 2007.
황정희, 류근호 "유사 구조 기반 XML 문서의 점진적 클러스터링," 정보과학회 논문지- 데이터베이스 제 31권 제 6호, 2004. 12.
김우생, "주성분 분석의 k 평균 알고리즘을 통한 XML 문서 군집화 기법," 정보처리학회 논문지, 2011.10.
윤병로, "쉽게 배우는 유전 알고리즘," 한빛미디어, 2008.4.
Niagara Query Engine, http://www.cs.wisc.edu/niagara/data.html

전자공학회논문지CI (Journal of the Institute of Electronics Engineers of Korea CI)

유전자 알고리즘을 통한 XML 군집화 방법

XML Clustering Technique by Genetic Algorithm

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)