DOI QR코드

DOI QR Code

Research on Improving the Identification Accuracy of Knowledge Production Institutions in the Digital Health Field

디지털 헬스 분야 지식생산기관 식별 정확도 제고 방안 연구

  • 최성윤 (서강대학교 일반대학원 협동과정 기술경영전공 ) ;
  • 문성욱 (서강대학교 기술경영전문대학원 )
  • Received : 2024.03.07
  • Accepted : 2024.04.26
  • Published : 2024.05.31

Abstract

Despite the important roles of institutions and their collaboration in producing knowledge for innovation, the lack of accurate methods for identifying such knowledge-producing institutions has restricted empirical research on the role of institutions in innovation. This study explores methods to enhance the accuracy of identifying institutions involved in innovation process. To this end, we propose ways to improve accuracy in both aspects of information - data and algorithms - using bibliographic information in the digital health field. Specifically, in the data processing stage before applying algorithms, we address contextual inaccuracies of bibliographic information; in the algorithm application stage, we propose methods to improve the ambiguity of institution names (IND). When compared with the PKG dataset, which is publicly available datasets based on the same bibliographic information, our methods doubled the number of cases available for subsequent analysis. We also discovered that the contribution of Korean institutions in the digital health field is either underestimated or overestimated. The method presented in this study is expected to contribute to empirically researching the role of knowledge-producing institutions in innovation process and ecosystem.

현대에는 개인 연구자 대부분이 지식생산기관에 소속되어 지식생산기관의 유형과 지식생산기관 간의 협력이 과학 지식생산에 미치는 영향이 높음에도 불구하고, 지식생산기관이 정확히 식별되지 않아 과학 지식생산 과정을 실증적으로 파악하는 데는 한계가 있다. 본 연구는 지식생산기관의 식별 정확도를 높이는 방법을 제안하였다. 구체적으로 디지털 헬스 분야의 PubMed 서지정보를 수집한 후 알고리즘을 적용하기 전 데이터 처리 단계에서 '맥락적 연결'을 활용하여 기관정보의 불완전성을 해소하고, 알고리즘 적용단계에서는 기관명 모호성(IND)을 개선하는 방법을 제시하였다. 본 연구가 산출한 '지식생산기관 데이터셋'과 동일한 서지정보를 대상으로 하는 기존 공개 데이터셋인 'PKG datasets'을 비교했을 때, 본 연구가 제시한 방법은 지식생산기관 데이터셋에 포함된 대상 데이터 수를 2배로 증가시켰으며, 국가별 순위도 보다 정확하게 반영하였다. 또한 한국 지식생산기관의 디지털 헬스 분야 기여도가 과소 또는 과대 평가되고 있다는 사실도 발견하였다. 본 연구에서 제시한 방법은 향후 과학지식을 생산하고 과학 혁신을 달성하는 데 있어 지식생신기관의 역할을 실증적으로 연구하는 데 기여할 것으로 판단된다.

Keywords

Acknowledgement

이 논문은 2023년도 서강대학교 교내연구비(과제번호: 202312032.01)와 산업통상자원부 산업전문인력역량강화사업(과제번호: P0012783) 지원을 받아 수행된 연구 내용입니다.

References

  1. 문성욱, 2012, "공공-민간 협력구조와 과학기술연구의 생산성: 인간 배아줄기세포 연구를 중심으로", 한국개발연구원 
  2. Aghion, Philippe, Mathias Dewatripont, and Jeremy C. Stein. "Academic Freedom, Private sector Focus, and the Process of Innovation." The RAND Journal of Economics 39, no. 3 (September 2008): 617-35. https://doi.org/10.1111/j.1756-2171.2008.00031.x
  3. Bikard, Michael, Keyvan Vakili, and Florenta Teodoridis. "When Collaboration Bridges Institutions: The Impact of University-Industry Collaboration on Academic Productivity." Organization Science 30, no. 2 (March 2019): 426-45. https://doi.org/10.1287/orsc.2018.1235
  4. BRUIN, R E de, and H F Moed. "The Unification of Address in Scientific Publications," n.d.
  5. Bush, Vanevar. "Science: The Endless Frontier." National Science Foundation-EUA. Washington, 1945.
  6. Einav, Liran, and Jonathan Levin. "The Data Revolution and Economic Analysis." Innovation Policy and the Economy 14, no. 1 (2014): 1-24. https://doi.org/10.1086/674019
  7. Hall, Bronwyn H., and Dietmar Harhoff. "Recent Research on the Economics of Patents." Annual Review of Economics 4, no. 1 (September 1, 2012): 541-65. https://doi.org/10.1146/annurev-economics-080511-111008
  8. Hicks, Diana, and Sylvan Katz. "Hospitals: The Hidden Research System," 1996.
  9. Huang, Shuiqing, Bo Yang, Sulan Yan, and Ronald Rousseau. "Institution Name Disambiguation for Research Assessment." Scientometrics 99, no. 3 (June 2014): 823-38. https://doi.org/10.1007/s11192-013-1214-2
  10. Jones, Benjamin F. "The Burden of Knowledge and the 'Death of the Renaissance Man': Is Innovation Getting Harder?" The Review of Economic Studies 76, no. 1 (2009): 283-317. https://doi.org/10.1111/j.1467-937X.2008.00531.x
  11. Jones, Benjamin F., Stefan Wuchty, and Brian Uzzi. "Multi-University Research Teams: Shifting Impact, Geography, and Stratification in Science." Science 322, no. 5905 (November 21, 2008): 1259-62 https://doi.org/10.1126/science.1158357
  12. Jonnalagadda, Siddhartha, and Philip Topham. "NEMO: Extraction and Normalization of Organization Names from PubMed Affiliation Strings." Journal of Biomedical Discovery and Collaboration 5 (2010): 50.
  13. Joulin, Armand, Edouard Grave, Piotr Bojanowski, Matthijs Douze, Herve Jegou, and Tomas Mikolov. "Fasttext. Zip: Compressing Text Classification Models." arXiv Preprint arXiv:1612.03651, 2016.
  14. Kilkenny, Monique F., and Kerin M. Robinson. "Data Quality:'Garbage in-Garbage Out.'" Health Information Management Journal. SAGE Publications Sage UK: London, England, 2018.
  15. Kim, Jinseok, and Jason Owen-Smith. "ORCID-Linked Labeled Data for Evaluating Author Name Disambiguation at Scale." Scientometrics 126, no. 3 (March 2021): 2057-83. https://doi.org/10.1007/s11192-020-03826-6
  16. Louis, Karen Seashore, David Blumenthal, Michael E. Gluck, and Michael A. Stoto. "Entrepreneurs in Academe: An Exploration of Behaviors among Life Scientists." Administrative Science Quarterly, 1989, 110-31.
  17. Moon, Seongwuk. "How Does the Management of Research Impact the Disclosure of Knowledge? Evidence from Scientific Publications and Patenting Behavior." Economics of Innovation and New Technology 20, no. 1 (2011): 1-32. https://doi.org/10.1080/10438590902750893
  18. "NLM Technical Bulletin." U.S. National Library of Medicine, December 1999.
  19. "NLM Technical Bulletin." U.S. National Library of Medicine, December 2013.
  20. "NLM Technical Bulletin." U.S. National Library of Medicine, December 2014.
  21. Partha, Dasgupta, and Paul A. David. "Toward a New Economics of Science." Research Policy 23, no. 5 (1994): 487-521. https://doi.org/10.1016/0048-7333(94)01002-1
  22. Perkmann, Markus, Valentina Tartari, Maureen McKelvey, Erkko Autio, Anders Brostrom, Pablo D'este, Riccardo Fini, Aldo Geuna, Rosa Grimaldi, and Alan Hughes. "Academic Engagement and Commercialisation: A Review of the Literature on University-Industry Relations." Research Policy 42, no. 2 (2013): 423-42. https://doi.org/10.1016/j.respol.2012.09.007
  23. Pollitt, Christopher, Colin Talbot, Janice Caulfield, and Amanda Smullen. Agencies: How Governments Do Things through Semi-Autonomous Organizations. Springer, 2004.
  24. "PubMed Help." Bethesda (MD): National Center for Biotechnology Information (US), 2020.
  25. Sampat, Bhaven, and Heidi L. Williams. "How Do Patents Affect Follow-On Innovation? Evidence from the Human Genome." American Economic Review 109, no. 1 (January 1, 2019): 203-36. https://doi.org/10.1257/aer.20151398
  26. Stephan, Paula. How Economics Shapes Science: Harvard University Press, 2012
  27. Thune, Taran, and Andrea Mina. "Hospitals as Innovators in the Health-Care System: A Literature Review and Research Agenda." Research Policy 45, no. 8 (2016): 1545-57. https://doi.org/10.1016/j.respol.2016.03.010
  28. Torvik, Vetle I. "MapAffil: A Bibliographic Tool for Mapping Author Affiliation Strings to Cities and Their Geocodes Worldwide." D-Lib Magazine 21, no. 11/12 (November 2015).
  29. Varian, Hal R. "Big Data: New Tricks for Econometrics." The Journal of Economic Perspectives 28, no. 2 (2014): 3-27. https://doi.org/10.1257/jep.28.2.3
  30. Williams, Heidi L. "Intellectual Property Rights and Innovation: Evidence from the Human Genome." Journal of Political Economy 121, no. 1 (February 2013): 1-27. https://doi.org/10.1086/669706
  31. Wuchty, Stefan, Benjamin F. Jones, and Brian Uzzi. "The Increasing Dominance of Teams in Production of Knowledge." Science 316, no. 5827 (May 18, 2007): 1036-39. https://doi.org/10.1126/science.1136099
  32. Xu, Jian, Sunkyu Kim, Min Song, Minbyul Jeong, Donghyeon Kim, Jaewoo Kang, Justin F. Rousseau, et al. "Building a PubMed Knowledge Graph." Scientific Data 7, no. 1 (June 26, 2020): 205.
  33. Yu, Wei, Ajay Yesupriya, Anja Wulf, Junfeng Qu, Marta Gwinn, and Muin J Khoury. "An Automatic Method to Generate Domain-Specific Investigator Networks Using PubMed Abstracts." BMC Medical Informatics and Decision Making 7, no. 1 (December 2007): 17.