DOI QR코드

DOI QR Code

Analysis of Research Topics among Library, Archives and Museums using Topic Modeling

토픽 모델링을 활용한 도서관, 기록관, 박물관간의 연구 주제 분석

  • 김희섭 (경북대학교 문헌정보학과) ;
  • 강보라 (경북대학교 문헌정보학과)
  • Received : 2019.11.20
  • Accepted : 2019.12.12
  • Published : 2019.12.30

Abstract

The purpose of this study is to understand the topics of the research for the establishment of cooperative platform between libraries, archives, and museums that carry out the common task of providing knowledge information in a broad sense. To achieve the purpose of this study, 637 bibliographic information on three institutions were collected from the Web version of Scopus database. Among the collected bibliographic information, 5,218 words were extracted through NetMiner V.4 and analysed topic modeling. The results are as follows: First, as a result of analyzing the frequency of word appearance according to the tf-idf weight 'Preservation' was the most hottest topic. Second, the topic modeling analysis through LDA(Latent Dirichlet Allocation) algorithm resulted in 13 topic areas. Third, as a result of expressing 13 topic areas as a network, repository construction was the central topic, and the research topics such as cooperation among institutions, conservation environment for collections, system and policy discovery, life cycle of collections, exhibition of information resources, and information retrieval were closely related to the central topic. Fourth, the trend of 13 topic areas by year 1998 is limited to the specific subjects such as system and policy discovery, information retrieval, and life cycle of collections, while the subsequent studies have been carried out after that year.

본 연구의 목적은 광의의 측면에서 지식정보제공이라는 공동의 임무를 수행하는 도서관, 기록관, 박물관간의 협력 플랫폼 구축에 관한 연구의 동향을 토픽 모델링을 통하여 파악하기 위한 것이다. 연구의 목적을 달성하기 위하여 Scopus로부터 이들 세 기관을 동시에 다루는 논문 637편의 서지정보를 수집하였다. 수집된 서지정보 중에서 초록을 대상으로 NetMiner V.4를 통하여 총 5,218개의 단어를 추출한 후 토픽모델링 분석하였으며, 그 결과는 다음과 같다. 첫째, tf-idf의 가중치에 따른 단어출현 빈도를 분석한 결과 '보존(Preservation)'이 가장 높게 나타났으며, 둘째, LDA(Latent Dirichlet Allocation) 알고리즘을 통한 토픽모델링 분석결과 13개의 주제 영역이 도출되었다. 셋째, 13개의 주제 영역을 네트워크로 표현한 결과 '리포지터리 구축(Repository Construction)'을 중심으로 기관간의 협력, 정보자원 보존을 위한 환경 구축, 정부차원에서의 제도와 정책 발굴, 정보자원의 생애주기, 정보자원의 전시, 정보자원의 검색 등이 서로 밀접한 관련성을 가진 것으로 나타났다. 넷째, 13개의 주제 영역의 연도별 동향을 살펴보면, 1998년 이전의 연구는 제도와 정책 발굴, 정보자원의 검색, 정보자원의 생애주기 등과 같이 특정 주제에 한정된 반면, 그 이후의 연구는 보다 다양한 주제를 다룬 것으로 분석되었다.

Keywords

References

  1. National Library of Korea. [Cited. 2019. 09. 17].
  2. National Library of Korea Digital Collection. [Cited. 2019. 09. 17].
  3. Kim, Namgyu et al. 2017. "Investigations on Techniques and Applications of Text Analytics." The Journal of Korean Institute of Communications and Information Sciences, 42(2): 471-492. https://doi.org/10.7840/kics.2017.42.2.471
  4. Park, Ja-Hyun, Song, Min. 2013. "A Study on the Research Trends in Library & Information Science in Korea using Topic Modeling." Journal of the Korean Society for Information Management, 30(1): 7-32. https://doi.org/10.3743/KOSIM.2013.30.1.007
  5. Park, Jong Do. 2019. "A Study on Issue Tracking on Multi-cultural Studies Using Topic Modeling." Journal of the Korean Society for Library and Information Science, 53(3): 273-289. https://doi.org/10.4275/KSLIS.2019.53.3.273
  6. Park, JunHyeong, Oh, Hyo-Jung. 2017. "Comparison of Topic Modeling Methods for Analyzing Research Trends of Archives Management in Korea : focused on LDA and HDP." Journal of Korean Library and Information Science Society, 48(4): 235-258. https://doi.org/10.16981/kliss.48.4.201712.235
  7. Yuk, JeeHee, Song, Min. 2018. "A Study of Research on Methods of Automated Biomedical Document Classification using Topic Modeling and Deep Learning." Journal of the Korean Society for Information Management, 35(2): 63-88. https://doi.org/10.3743/KOSIM.2018.35.2.063
  8. Lee, Keehoen, Jung, Hyojung, Song, Min. 2015. "Weighted Subject - Method Network Analysis of Library and Information Science Studies." Journal of the Korean Society for Library and Information Science, 49(3): 457-488. https://doi.org/10.4275/KSLIS.2015.49.3.457
  9. Lee, Mi-Kyung. 2014. "Introduction and Realization of Larchiveum." National AssemblyLibrary of Korea, 51(5): 14-23.
  10. Lee, Soo-Sang. 2016. "A Study on the Application of Topic Modeling for the Book Report Text." Journal of Korean Library and Information Science Society, 47(4): 1-18. https://doi.org/10.16981/kliss.47.4.201612.1
  11. Choi, Youngsil, Rieh, Hae-young. 2012. "Functional Planning of Larchiveum that Integrates the Functions of Archives, Libraries and Museums." Journal of the Korean Biblia Society for Library and Information Science, 23(4): 457-477. https://doi.org/10.14699/kbiblia.2012.23.4.457
  12. Han, Jeong-Won. 2019. "A Study on the Research Trends for the Space Design of Public Libraries in Korea - Focused on the Research Articles Published in Korea." Journal of the Korean Institute of Interior Design, 28(1): 170-177. https://doi.org/10.14774/JKIID.2019.28.1.170
  13. Bailey-Hainer, B. and Urban, R. 2004. "The Colorado digitization program: a collaboration success story." Library Hi Tech, 22(3): 254-262. https://doi.org/10.1108/07378830410560044
  14. Blei, D. 2012. "Probabilistic topic models." Communications of the ACM, 55(4): 77-84. https://doi.org/10.1145/2133806.2133826
  15. Blei, D. M., Ng, A. Y. and Jordan, M. I. 2003. "Latent dirichlet allocation." Journal of machine Learning research, 3(Jan): 993-1022.
  16. Brabazon, T. 2009. "Brand Wellington: When city imaging is GLAM'ed: A personal view." Place Branding and Public Diplomacy, 5(4): 260-275. https://doi.org/10.1057/pb.2009.22
  17. Cakmak, T. and Yilmaz, B. 2017. "Digitization and digital preservation in memory institutions: Analysis of the practices in Turkey." Bilgi Dunyasi, 18(1): 49-91.
  18. Conway, P. 2015. "Digital transformations and the archival nature of surrogates." Archival Science, 15(1): 51-69. https://doi.org/10.1007/s10502-014-9219-z
  19. Chen, Y. N. 2015. "A RDF-based approach to metadata crosswalk for semantic interoperability at the data element level." Library Hi Tech, 33(2): 175-194. https://doi.org/10.1108/LHT-08-2014-0078
  20. Europeana Collections Homepage. [Cited. 2019. 09. 17].
  21. Fleckenstein-Gallo, J. O. 1967. "Papyrology and sources in astronomical history." Vistas in Astronomy, 9: 151-155. https://doi.org/10.1016/0083-6656(67)90023-2
  22. Genett, M. E. 1987. "Conservation of research library collections at the American Museum of Natural History." Science and Technology Libraries, 7(3): 15-28. https://doi.org/10.1300/J122v07n03_03
  23. Heran, M. 1908. "Lloyd Library and Museum launches new initiative: Historical Research Center for the Natural Health Movement." Watermark(Archivists and Librarians in the History of the Health Sciences), 31(1): 9-11.
  24. Hewlett, S. 2002. "Volunteering in libraries, museums and archives." Cultural Trends, 12(46): 39-66. https://doi.org/10.1080/09548960209390322
  25. Hilario, A. B. R., Fernandez, T. F. and Campo, D. M. 2014. "From Bibliographic Records to Data: Changes in the Library Environment with the Application of Linked Open Data Technologies." Information Resources Management Journal, 27(3): 28-41. https://doi.org/10.4018/irmj.2014070103
  26. Karaca F. 2015. "An AHP-based indoor Air Pollution Risk Index Method for cultural heritage collections." Journal of Cultural Heritage, 16(3): 352-360. https://doi.org/10.1016/j.culher.2014.06.012
  27. Knight, K. 2001. "The Strategic Role of Resource Encouraging Partnerships." LIBER Quarterly, 11(4): 444-453. https://doi.org/10.18352/lq.7661
  28. Library and Archives Canada Homepage. [Cited. 2019. 09. 11].
  29. Marty, P. F. 2014. "Digital Convergence and the Information Profession in Cultural Heritage Organizations: Reconciling Internal and External Demands." Library Trends; Baltimore, 62(3): 613-627. https://doi.org/10.1353/lib.2014.0007
  30. Matthews, J. R. 2016. "An Environmental Scan of OCLC Alternatives: A Management Perspective." Public Library Quarterly, 35(3): 175-187. https://doi.org/10.1080/01616846.2016.1210440
  31. Nasreen, N., Bashir, B. and Loan, F. A. 2019. "World Digital Library: An Analysis of Collection." Library Philosophy and Practice, 1-12.
  32. Papadimitriou, C., Raghavan, P., Tamaki, H. and Vempala, S. 1998. "Latent Semantic Indexing: A probabilistic analysis." Proceedings of ACM PODS '98: Proceedings of the seventeenth ACM SIGACT-SIGMOD-SIGART symposium on Principles of database systems, 159-168.
  33. Repanovici, A. 2012. "Professional profile of digital repository manager." Library Hi Tech News, 29(10): 13-20. https://doi.org/10.1108/07419051211294473
  34. Shipp, J. N. 2016. "Do I really need specialist qualifications to work as a professional in a gallery, library, archive or museum?." The Australian Library Journal, 65(4): 280-287. https://doi.org/10.1080/00049670.2016.1233604
  35. Tammaro, A. M. 2014. "The convergence of libraries, archives and museums: IFLA initiatives." AIB Studi, 54(1): 115-120.
  36. Todorova, T. Y. et al. 2017. "Information professionals and copyright literacy: a multinational study." Library management, 38(6/7): 323-344.
  37. Yarrow, A., Clubb, B. and Draper, J. L. 2008. "Public Libraries, Archives and Museums: Trends in Collaboration and Cooperation." IFLA Professional Reports, 108.
  38. Yoo, D. H. and Choi, A. R. 2014. "A study of larchiveum data model for the design of digital heritage museum." International Journal of Software Engineering and its Applications, 8(10): 83-94.
  39. World Digital Library Homepage. [Cited. 2019. 09. 17].
  40. Wykoff, L., Mercier, L., Bond, T. and Cornish, A. 2005. "The Columbia River Basin Ethnic History Archive: a tri-state online history database and learning center." Library Hi Tech, 23(2): 252-264. https://doi.org/10.1108/07378830510605197
  41. Zappala, A. 1991. "Problems in standardizing the quality of paper for permanent records." Restaurator, 12(3): 137-146.