Knowledge Mining from Many-valued Triadic Dataset based on Concept Hierarchy

개념계층구조를 기반으로 하는 다치 삼원 데이터집합의 지식 추출

  • Suk-Hyung Hwang ;
  • Young-Ae Jung ;
  • Se-Woong Hwang
  • 황석형 (선문대학교 AI 소프트웨어학과) ;
  • 정영애 (선문대학교 AI 소프트웨어학과) ;
  • 황세웅 (선문대학교 AI 소프트웨어학과)
  • Received : 2024.03.26
  • Accepted : 2024.04.25
  • Published : 2024.06.30

Abstract

Knowledge mining is a research field that applies various techniques such as data modeling, information extraction, analysis, visualization, and result interpretation to find valuable knowledge from diverse large datasets. It plays a crucial role in transforming raw data into useful knowledge across various domains like business, healthcare, and scientific research etc. In this paper, we propose analytical techniques for performing knowledge discovery and data mining from various data by extending the Formal Concept Analysis method. It defines algorithms for representing diverse formats and structures of the data to be analyzed, including models such as many-valued data table data and triadic data table, as well as algorithms for data processing (dyadic scaling and flattening) and the construction of concept hierarchies and the extraction of association rules. The usefulness of the proposed technique is empirically demonstrated by conducting experiments applying the proposed method to public open data.

지식 마이닝은 다종다양한 대량의 데이터로부터 데이터 모델링, 정보추출 및 분석, 가시화, 결과 해석 등과 같은 다양한 기법들을 적용하여 데이터로부터 유용하고 가치 있는 지식을 찾아내는 연구 분야로서, 비즈니스, 의료, 과학 연구 등 다양한 영역에서 원시 데이터를 유용한 지식으로 변환하기 위한 중요한 역할을 수행한다. 본 논문에서는 형식개념분석기법을 확장하여 다종다양한 데이터로부터 지식발견과 데이터 마이닝을 수행하기 위한 분석기법을 제안한다. 분석대상 데이터의 다양한 형식과 구조를 표현하기 위한 제반 모델들(다치데이터 테이블, 삼원데이터테이블)과 데이터처리(이진화 및 평탄화) 및 개념계층구조 구축과 연관규칙 추출을 위한 알고리즘들을 정의하고, 공공오픈데이터를 대상으로 본 논문에서 제안한 기법을 적용한 실험을 수행하여 제안 기법의 유용성을 실증하였다.

Keywords

References

  1. Y. Rui, V.I.S. Carmona, M. Pourvali, Y. Xing, W.W. Yi, H.B. Ruan, Y. Zhang, "Knowledge Mining: A Cross-disciplinary Survey," Machine Intelligence Research, Vol.19, No.2, pp.89-114, 2022.
  2. S. Xiaoling, Y. Yiwan, "Knowledge Discovery: Methods from data mining and machine learning," Social Science Research, Vol. 110, pp. 102817, 2023.
  3. J. Han, J. Pei, M. Kamber, "Data mining: concepts and techniques," Elsevier, pp.243-278, 2011.
  4. B. Ganter and R. Wille, "Formal Concept Analysis : Mathematical Foundations," in Springer-Verlag, New York, 1999, pp.17-20.
  5. S. Ferre, M. Huchard, M. Kaytoue, S.O. Kuznetsov, A. Napoli, "Formal concept analysis: from knowledge discovery to knowledge processing," in A Guided Tour of Artificial Intelligence Research, Springer, Switzerland, 2020, pp.411-445.
  6. R. Agrawal, R. Srikant, "Fast algorithms for mining association rules," in Proc. of the 20th International Conference on Very Large Data Bases, Santiago, Chile, 1994, pp. 487-499.
  7. M. Shahin, et al, "Big Data Analytics in Association Rule Mining:A Systematic Literature Review," in Proc. of the 3rd International Conference on Big Data Engineering and Technology(BDET 2021), Singapore, 2021, pp.40-49.
  8. I. Fister Jr., I. Fister, D. Fister, V. Podgorelec, and S. Salcedo-Sanz, "A comprehensive review of visualization methods for association rule mining: Taxonomy, challenges, open problems and future ideas," Expert Systems with Applications, Vol. 233, pp.1-46, December 2023.
  9. X. Yan, S. Zhang and C. Zhang, "On Data Structures for Association Rule Discovery," Applied Artificial Intelligence, Vol. 21, No. 2, pp.57-79, 2007.