DOI QR코드

DOI QR Code

Usability Analysis of Structured Abstracts in Journal Articles for Document Clustering

문서 클러스터링을 위한 학술지 논문의 구조적 초록 활용성 연구

  • 최상희 (대구가톨릭대학교 도서관학과) ;
  • 이재윤 (경기대학교 문헌정보학과)
  • Received : 2012.03.10
  • Accepted : 2012.03.21
  • Published : 2012.03.30

Abstract

Structured abstracts have been regarded as an essential information factor to represent topics of journal articles. This study aims to provide an unconventional view to utilize structured abstracts with the analysis on sub fields of a structured abstract in depth. In this study, a structured abstract was segmented into four fields, namely, purpose, design, findings, and values/implications. Each field was compared in the performance analysis of document clustering. In result, the purpose statement of an abstract affected on the performance of journal article clustering more than any other fields. Furthermore, certain types of keywords were identified to be excluded in the document clustering to improve clustering performance, especially by Within group average clustering method. These keywords had stronger relationship to a specific abstract field such as research design than the topic of an article.

구조적 초록은 학술 논문의 주제를 표현하는 역할을 하여 학술 논문을 처리하는데 중요한 요소로 인식되어왔다. 이 연구에서는 구조적 초록을 구성하는 세부 필드의 속성을 4개로 분석하고 초록의 구조를 활용하여 문서 클러스터링에 적용할 수 있는 가능성을 고찰고자 하였다. 구조적 초록의 필드 속성을 문서 클러스터링에 적용한 결과 클러스터링 기법간의 편차가 있었으나 연구 목적이 제공하는 정보량에 비해 주제성이 커서 클러스터링 성능에 가장 큰 영향을 미치고 있는 것으로 나타났다. 또한 분석 결과 특정 필드에 특화되어 출현하는 필드 종속적인 단어가 발생하는 것으로 나타나 필드 종속적인 단어를 배제하고 집단내 평균연결 기법을 적용하였을 때는 클러스터링의 성능이 개선되는 것으로 분석되었다.

Keywords

References

  1. 고영만, 송인석 (2011). 연구문헌의 지식구조를 반영하는 의미기반의 지식조직체계에 관한 연구. 정보관리학회지, 28(1), 145-170. doi: 10.3743/KOSIM.2011.28.1.145(Ko, Young-Man, & Song, Inseok (2011). A study on the knowledge organizing system of research papers based on semantic relation of the knowledge structure. Journal of the Korean Society for Information Management, 28(1), 145-170. doi: 10.3743/KOSIM.2011.28.1.145)
  2. 윤보현, 오효정 (2011). 개체명 기반 웹 문서 클러스터링에서 자질 조합 분석. 한국정보기술학회논문지, 9(3), 199-206.(Yun, Bo-Hyun, & Oh, Hyo-Jung (2010). Analysis of feature combination in document clustering using named entities. The Journal of Korean Institute of Information Technology, 9(3), 199-206.)
  3. 정영미, 이재윤 (2001). 클러스터링 성능 평가를 위한 비편향적 척도의 개발. 제8회 한국 정보관리학회 학술대회 논문집, 167-172.(Chung, Young Mee, & Lee, Jae Yun (2001). Development of an unbiased measure for clustering performance. Proceedings of the 7th Conference of Korean Society for Information Management, 167-172.)
  4. 조현양, 최성필 (2004). 계층적 결합형 문서 클러스터링 시스템과 복합명사 색인방법과의 연관관계 연구. 한국문헌정보학회지, 38(4), 179-192.(Cho, Hyun-Yang, & Choi, Sung-Pil (2004). The experimental study on the relationship between hierarchical agglomerative clustering and compound nouns indexing. The Journal of Korean Society for Library and Information Science, 38(4), 179-192.)
  5. Chen, C., Frank, S. C., & Tseng, T. (2010). An integration of WordNet and fuzzy association rule mining for multi-label document clustering. Data & Knowledge Engineering, 69(11), 1208-1226. doi: 10.1016/j.datak.2010.08.003
  6. Choi, Sang Hee (2010). Document clustering using reference titles. Journal of the Korean Society for Information Management, 27(2), 241-252. doi: 10.3743/KOSIM.2010.27.2.241
  7. Hahs-Vaughn, D. L., & Onwuegbuzie, A. J. (2009). Quality of abstracts in articles submitted to a scholarly journal: A mixed methods case study of the journal Research in the Schools. Library & Information Science Research, 32(1), 53-61. doi:10.1016/j.lisr.2009.08.004
  8. Hartley, J. (1997). Is it appropriate to use structured abstracts in social science journals? Learned Publishing, 10(4), 313-317. https://doi.org/10.1087/09531519750146789
  9. Hartley, J. (1998). Is it appropriate to use structured abstracts in non-medical science journals? Journal of Information Science, 24(5), 359-364.
  10. Hartley, J. (1999). Applying ergonomics to Applied Ergonomics. Applied Ergonomics, 30(6), 535-541. https://doi.org/10.1016/S0003-6870(99)00004-6
  11. Hartley, J. (2000). Clarifying the abstracts of systematic reviews. Bulletin of the Medical Library Association, 88(4), 332-337.
  12. Hartley, J. (2003). Improving the clarity of journal abstracts in psychology. Science Communication, 24(3), 366-379. https://doi.org/10.1177/1075547002250301
  13. Nakayama, T, Hirai, N., Yamazaki, S., & Naito, M. (2005). Adoption of structured abstracts by general medical journals and format for a structured abstract. Journal of the Medical Library Association, 93(2), 237-242.
  14. Sharma, S., & Harrison, J. E. (2006). Structured abstracts: Do they improve the quality of information in abstracts? American Journal of Orthodontics and Dentofacial Orthopedics, 130(4), 523-530. https://doi.org/10.1016/j.ajodo.2005.10.023
  15. Stevenson, H. A., & Harrison, J. E. (2009). Structured abstracts: Do they improve citation retrieval from dental journals? Journal of Orthodontics, 36(1), 52-60. https://doi.org/10.1179/14653120722932
  16. Zhu, S., Takigawa, I., & Mamitsuka, H. (2009). Field independent probabilistic model for clustering multi-field documents. Information Processing and Management, 45(5), 555-570. doi:10.1016./j.jpm.2009.03.005