DOI QR코드

DOI QR Code

A Study on Big Data Analysis of Related Patents in Smart Factories Using Topic Models and ChatGPT

토픽 모형과 ChatGPT를 활용한 스마트팩토리 연관 특허 빅데이터 분석에 관한 연구

  • Sang-Gook Kim (Korea Institute of Science and Technology Information) ;
  • Minyoung Yun (Korea Institute of Science and Technology Information) ;
  • Taehoon Kwon (Korea Institute of Science and Technology Information) ;
  • Jung Sun Lim (Korea Institute of Science and Technology Information)
  • Received : 2023.09.26
  • Accepted : 2023.10.24
  • Published : 2023.12.31

Abstract

In this study, we propose a novel approach to analyze big data related to patents in the field of smart factories, utilizing the Latent Dirichlet Allocation (LDA) topic modeling method and the generative artificial intelligence technology, ChatGPT. Our method includes extracting valuable insights from a large data-set of associated patents using LDA to identify latent topics and their corresponding patent documents. Additionally, we validate the suitability of the topics generated using generative AI technology and review the results with domain experts. We also employ the powerful big data analysis tool, KNIME, to preprocess and visualize the patent data, facilitating a better understanding of the global patent landscape and enabling a comparative analysis with the domestic patent environment. In order to explore quantitative and qualitative comparative advantages at this juncture, we have selected six indicators for conducting a quantitative analysis. Consequently, our approach allows us to explore the distinctive characteristics and investment directions of individual countries in the context of research and development and commercialization, based on a global-scale patent analysis in the field of smart factories. We anticipate that our findings, based on the analysis of global patent data in the field of smart factories, will serve as vital guidance for determining individual countries' directions in research and development investment. Furthermore, we propose a novel utilization of GhatGPT as a tool for validating the suitability of selected topics for policy makers who must choose topics across various scientific and technological domains.

Keywords

Acknowledgement

This research received partial funding from two programs (K-23-L05-C02-S16, K-23-L03-C04-S01) provided by the Korea Institute of Science and Technology Information (KISTI) in South Korea.

References

  1. Abdirad, M. and Krishnan, K., Industry 4.0 in Logistics and Supply Chain Management: A Systematic Literature Review, EMJ - Engineering Management Journal, 2020, Vol. 33, No. C., pp. 1-15. https://doi.org/10.1080/10429247.2020.1783935
  2. Ampornphan, P. and Tongngam, S., Exploring technology influencers from patent data using association rule mining and social network analysis, Information (Switzerland), 2020, Vol. 11, No. 6.
  3. Blei, D. M., Ng, A. Y., and Jordan M. I., Latent Dirichlet allocation, Journal ofMachine Learning Research, 2003 Vol. 3, No. 4-5, pp. 993-1022.
  4. Cao, L. and Fei-Fei, L., Spatially coherent latent topic model for concurrent segmentation and classification of objects and scenes, in Proceedings of the 2007 IEEE 11th International Conference on Computer Vision, ICCV, 2007, pp. 1-8.
  5. Chang, T. and Yang, H., Latent semantic analysis of research papers on smart factory, ICIC Express Letters, 2017, Vol. 11, No. 4, pp. 899-904. https://doi.org/10.3144/expresspolymlett.2017.86
  6. Choi, D. and Song, B., Exploring technological trends in logistics: Topic modeling-based patent analysis, Sustainability (Switzerland), 2018, Vol. 10, No. 8.
  7. Daim, T., Bukhari, E., Bakry, D., Vanhuis, J., Yalcin, H., and Wang, X., Forecasting technology trends through the gap between science and technology: The Case of software as an e-commerce service, Foresight and STI Governance, 2021, Vol. 15, No. 2, pp. 12-24. https://doi.org/10.17323/2500-2597.2021.2.12.24
  8. Govindarajan, U., Trappey, A., and Trappey, C., Immersive Technology for Human-Centric Cyberphysical Systems in Complex Manufacturing Processes: A Comprehensive Overview of the Global Patent Profile Using Collective Intelligence, Complexity, 2018, Vol. 2018.
  9. Griffiths, T. L. and Steyvers, M., Finding scientific topcis, Proceedings of the National Academy of Sciences, April 2004.
  10. Jung, Y. and Chang, T., Text mining based online news analysis about smart factory, ICIC Express Letters, Part B: Applications, 2018, Vol.9, No. 6, p.559-565.
  11. Kim, M., Park, Y., and Yoon, J., Generating patent development maps for technology monitoring using semantic patent-topic analysis, Computers and Industrial Engineering, 2016, Vol. 98, pp. 289-299. https://doi.org/10.1016/j.cie.2016.06.006
  12. Kwon, K., Jun, S., Lee, Y., Choi, S., and Lee, C., Logistics Technology Forecasting Framework Using Patent Analysis for Technology Roadmap, Sustainability (Switzerland), 2022, Vol. 14, No. 9.
  13. Lee, C., Liu, C., Trappey, A., Mo, J., and Desouza, K., Understanding digital transformation in advanced manufacturing and engineering: A bibliometric analysis, topic modeling and research trend discovery, Advanced Engineering Informatics, 2021, Vol. 50.
  14. Lee, J., Park, S., and Lee, J., Study on the Technology Trend Screening Framework Using Unsupervised Learning, Applied Sciences (Switzerland), 2022, Vol.12, No. 17.
  15. Li, R., Wang, X., Liu, Y., Zhang, S., and Hanif, O., Research status and collaboration analysis based on big data mining: an empirical study of Alzheimer's disease, Technology Analysis and Strategic Management, 2021, Vol.33, No. 4, pp. 379-395. https://doi.org/10.1080/09537325.2020.1815693
  16. Nam, D. and Choi, G., The Identification of Emerging Technologies of Automotive Semiconductor, KSII Transactions on Internet and Information Systems, 2023, Vol.17, No. 2, pp. 663-677. https://doi.org/10.3837/tiis.2023.02.021
  17. Park, J., Kim, N., Choi, H., and Han, E., A new forecasting system using the latent dirichlet allocation (LDA) topic modeling technique, WSEAS Transactions on Environment and Development, 2018, Vol.14, p.363-373.
  18. Wang, J. and Hsu, C., A topic-based patent analytics approach for exploring technological trends in smart manufacturing, Journal of Manufacturing Technology Management, 2021, Vol.32, No. 1, pp. 110-135. https://doi.org/10.1108/JMTM-03-2020-0106
  19. Wang, J., Li, K., and Feng, L., Tracing the technological trajectory of coal slurry pipeline transportation technology: An HMM-based topic modeling approach, Frontiers in Energy Research, 2022, Vol. 10.
  20. Wang, Y., Feng, L., Wang, J., Zhao, H., and Liu, P., Technology Trend Forecasting and Technology Opportunity Discovery Based on Text Mining: The Case of Refrigerated Container Technology, Processes, 2022, Vol. 10, No. 3.
  21. Wang, Z., Gao, L., and Suo, H., Designing petrochemical smart plant of the future: State of the art, comparison and prospects;未来石化智能工厂顶层设计:现状, 对比及展望, Huagong Jinzhan/Chemical Industry and Engineering Progress, 2022, Vol. 41, No. 7, pp. 3387-3401.
  22. Yang, H., Chang, T., and Choi, Y., Exploring the research trend of smart factory with topic modeling, Sustainability (Switzerland), 2018, Vol. 10, No. 8.