• Title/Summary/Keyword: Research topic

Search Result 2,410, Processing Time 0.031 seconds

A Topic Modeling-based Recommender System Considering Changes in User Preferences (고객 선호 변화를 고려한 토픽 모델링 기반 추천 시스템)

  • Kang, So Young;Kim, Jae Kyeong;Choi, Il Young;Kang, Chang Dong
    • Journal of Intelligence and Information Systems
    • /
    • v.26 no.2
    • /
    • pp.43-56
    • /
    • 2020
  • Recommender systems help users make the best choice among various options. Especially, recommender systems play important roles in internet sites as digital information is generated innumerable every second. Many studies on recommender systems have focused on an accurate recommendation. However, there are some problems to overcome in order for the recommendation system to be commercially successful. First, there is a lack of transparency in the recommender system. That is, users cannot know why products are recommended. Second, the recommender system cannot immediately reflect changes in user preferences. That is, although the preference of the user's product changes over time, the recommender system must rebuild the model to reflect the user's preference. Therefore, in this study, we proposed a recommendation methodology using topic modeling and sequential association rule mining to solve these problems from review data. Product reviews provide useful information for recommendations because product reviews include not only rating of the product but also various contents such as user experiences and emotional state. So, reviews imply user preference for the product. So, topic modeling is useful for explaining why items are recommended to users. In addition, sequential association rule mining is useful for identifying changes in user preferences. The proposed methodology is largely divided into two phases. The first phase is to create user profile based on topic modeling. After extracting topics from user reviews on products, user profile on topics is created. The second phase is to recommend products using sequential rules that appear in buying behaviors of users as time passes. The buying behaviors are derived from a change in the topic of each user. A collaborative filtering-based recommendation system was developed as a benchmark system, and we compared the performance of the proposed methodology with that of the collaborative filtering-based recommendation system using Amazon's review dataset. As evaluation metrics, accuracy, recall, precision, and F1 were used. For topic modeling, collapsed Gibbs sampling was conducted. And we extracted 15 topics. Looking at the main topics, topic 1, top 3, topic 4, topic 7, topic 9, topic 13, topic 14 are related to "comedy shows", "high-teen drama series", "crime investigation drama", "horror theme", "British drama", "medical drama", "science fiction drama", respectively. As a result of comparative analysis, the proposed methodology outperformed the collaborative filtering-based recommendation system. From the results, we found that the time just prior to the recommendation was very important for inferring changes in user preference. Therefore, the proposed methodology not only can secure the transparency of the recommender system but also can reflect the user's preferences that change over time. However, the proposed methodology has some limitations. The proposed methodology cannot recommend product elaborately if the number of products included in the topic is large. In addition, the number of sequential patterns is small because the number of topics is too small. Therefore, future research needs to consider these limitations.

A study on the classification of research topics based on COVID-19 academic research using Topic modeling (토픽모델링을 활용한 COVID-19 학술 연구 기반 연구 주제 분류에 관한 연구)

  • Yoo, So-yeon;Lim, Gyoo-gun
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.1
    • /
    • pp.155-174
    • /
    • 2022
  • From January 2020 to October 2021, more than 500,000 academic studies related to COVID-19 (Coronavirus-2, a fatal respiratory syndrome) have been published. The rapid increase in the number of papers related to COVID-19 is putting time and technical constraints on healthcare professionals and policy makers to quickly find important research. Therefore, in this study, we propose a method of extracting useful information from text data of extensive literature using LDA and Word2vec algorithm. Papers related to keywords to be searched were extracted from papers related to COVID-19, and detailed topics were identified. The data used the CORD-19 data set on Kaggle, a free academic resource prepared by major research groups and the White House to respond to the COVID-19 pandemic, updated weekly. The research methods are divided into two main categories. First, 41,062 articles were collected through data filtering and pre-processing of the abstracts of 47,110 academic papers including full text. For this purpose, the number of publications related to COVID-19 by year was analyzed through exploratory data analysis using a Python program, and the top 10 journals under active research were identified. LDA and Word2vec algorithm were used to derive research topics related to COVID-19, and after analyzing related words, similarity was measured. Second, papers containing 'vaccine' and 'treatment' were extracted from among the topics derived from all papers, and a total of 4,555 papers related to 'vaccine' and 5,971 papers related to 'treatment' were extracted. did For each collected paper, detailed topics were analyzed using LDA and Word2vec algorithms, and a clustering method through PCA dimension reduction was applied to visualize groups of papers with similar themes using the t-SNE algorithm. A noteworthy point from the results of this study is that the topics that were not derived from the topics derived for all papers being researched in relation to COVID-19 (

    ) were the topic modeling results for each research topic (
    ) was found to be derived from For example, as a result of topic modeling for papers related to 'vaccine', a new topic titled Topic 05 'neutralizing antibodies' was extracted. A neutralizing antibody is an antibody that protects cells from infection when a virus enters the body, and is said to play an important role in the production of therapeutic agents and vaccine development. In addition, as a result of extracting topics from papers related to 'treatment', a new topic called Topic 05 'cytokine' was discovered. A cytokine storm is when the immune cells of our body do not defend against attacks, but attack normal cells. Hidden topics that could not be found for the entire thesis were classified according to keywords, and topic modeling was performed to find detailed topics. In this study, we proposed a method of extracting topics from a large amount of literature using the LDA algorithm and extracting similar words using the Skip-gram method that predicts the similar words as the central word among the Word2vec models. The combination of the LDA model and the Word2vec model tried to show better performance by identifying the relationship between the document and the LDA subject and the relationship between the Word2vec document. In addition, as a clustering method through PCA dimension reduction, a method for intuitively classifying documents by using the t-SNE technique to classify documents with similar themes and forming groups into a structured organization of documents was presented. In a situation where the efforts of many researchers to overcome COVID-19 cannot keep up with the rapid publication of academic papers related to COVID-19, it will reduce the precious time and effort of healthcare professionals and policy makers, and rapidly gain new insights. We hope to help you get It is also expected to be used as basic data for researchers to explore new research directions.

  • Entitymetrics Analysis of the Research Works of Dong-ju Yun using Textmining (텍스트마이닝을 이용한 윤동주 연구의 개체계량학적 분석)

    • Park, Jinkyeun;Kim, Taekyoun;Song, Min
      • Journal of the Korean BIBLIA Society for library and Information Science
      • /
      • v.28 no.1
      • /
      • pp.191-207
      • /
      • 2017
    • This paper employs entitymetrics analysis on the research works of Dong-ju Yun. He was a Korean poet who was studied by many researchers on his works, religion and life. We collected 1,076 papers about Dong-ju Yun and conducted various approaches including co-author citation analysis, topic modeling analysis to identify the topic trend in the study of Dong-ju Yun. Also we extracted entities like person's name and literature's title from abstract to examine the relationship among them. The result of this paper enables us to objectively identify the topic trend and infer implicit relationships between key concept associated with Dong-ju Yun based on text data. Moreover, we observed sub-research topics such as life, poem, aesthetic existence, comparative literature, literary translation, and religious beliefs. This paper shows how entitymetrics can be utilized to study intellectual structures in the humanities.

    Topic Model Analysis of Research Trend on Renewable Energy (신재생에너지 동향 파악을 위한 토픽 모형 분석)

    • Shin, KyuSik;Choi, HoeRyeon;Lee, HongChul
      • Journal of the Korea Academia-Industrial cooperation Society
      • /
      • v.16 no.9
      • /
      • pp.6411-6418
      • /
      • 2015
    • To respond the climate change and environmental pollution, the studies on renewable energy policies are increasing. The renewable energy is a new growth engine technology represented by the green industry and green technology. At present, the investments for the renewable energy supply and technology development projects of three main strategy sectors such as sunlight, wind power and hydrogen fuel cell are implemented in our country, while they are still in the early stage, accordingly reducing those uncertainty for the research direction and investment fields is the most urgent issue among others. Thus, this study applied text mining method and multinominal topic model among the big data analysis methods on our country's newspaper articles concerning the renewable energy over the last 10 years, and then analyzed the core issues and global research trend, forecasting the renewable energy fields with the growth potential. It is predicted that these results of the study based on information and communication technology will be actively applied on the renewable energy fields.

    The Stream of Uncertainty in Scientific Knowledge using Topic Modeling (토픽 모델링 기반 과학적 지식의 불확실성의 흐름에 관한 연구)

    • Heo, Go Eun
      • Journal of the Korean Society for information Management
      • /
      • v.36 no.1
      • /
      • pp.191-213
      • /
      • 2019
    • The process of obtaining scientific knowledge is conducted through research. Researchers deal with the uncertainty of science and establish certainty of scientific knowledge. In other words, in order to obtain scientific knowledge, uncertainty is an essential step that must be performed. The existing studies were predominantly performed through a hedging study of linguistic approaches and constructed corpus with uncertainty word manually in computational linguistics. They have only been able to identify characteristics of uncertainty in a particular research field based on the simple frequency. Therefore, in this study, we examine pattern of scientific knowledge based on uncertainty word according to the passage of time in biomedical literature where biomedical claims in sentences play an important role. For this purpose, biomedical propositions are analyzed based on semantic predications provided by UMLS and DMR topic modeling which is useful method to identify patterns in disciplines is applied to understand the trend of entity based topic with uncertainty. As time goes by, the development of research has been confirmed that uncertainty in scientific knowledge is moving toward a decreasing pattern.

    A Trend Analysis of Radiological Research in Korea using Topic Modeling (토픽모델링을 이용한 국내 방사선 학술연구 트렌드 분석)

    • Hong, Dong-Hee
      • Journal of the Korean Society of Radiology
      • /
      • v.16 no.3
      • /
      • pp.343-349
      • /
      • 2022
    • We intend to use topic modeling to identify radiation-themed papers published from 1989 to 2022 and analyze the relevance and weight between topics. This study analyzed topics derived from national subjects for 717 papers published until recently in 2022 to contribute to the revitalization of research in the field of radiation. Through text mining, overall research trends on the subject distribution of the study were analyzed, and five topics were derived through topic modeling. First, among the papers to be analyzed, a total of 1,675 words were frequency-analyzed through the preprocessing process of key words in a total of 717 papers centered on keywords. Second, as a result of analyzing topics based on the association of constituent words for five topics, it was found that studies focused on minimizing dose in the range that does not degrade image quality in the fields of radiation, image, CT clinical. In addition, it was found that various studies were mainly conducted in the MRI, and the study of ultrasound in various areas of disease analysis was actively attempted.

    National Petition Analysis Related to Nursing: Text Network Analysis and Topic Modeling (간호관련 국민청원 분석: 텍스트네트워크 분석 및 토픽모델링)

    • Ko, HyunJung;Jeong, Seok Hee;Lee, Eun Jee;Kim, Hee Sun
      • Journal of Korean Academy of Nursing
      • /
      • v.53 no.6
      • /
      • pp.635-651
      • /
      • 2023
    • Purpose: This study aimed to identify the main keyword, network structure, and main topics of the national petition related to "nursing" in South Korea. Methods: Data were gathered from petitions related to the national petition in Korea Blue House related to the topic "nursing" or "nurse" from August 17, 2017, to May 9, 2022. A total of 5,154 petitions were searched, and 995 were selected for the final analysis. Text network analysis and topic modeling were analyzed using the Netminer 4.5.0 program. Results: Regarding network characteristics, a density of 0.03, an average degree of 144.483, and an average distance of 1.943 were found. Compared to results of degree centrality and betweenness centrality, keywords such as "work environment," "nursing university," "license," and "education" appeared typically in the eigenvector centrality analysis. Topic modeling derived four topics: (1) "Improving the working environment and dealing with nursing professionals," (2) "requesting investigation and punishment related to medical accidents," (3) "requiring clear role regulation and legislation of medical and nonmedical professions," and (4) "demanding improvement of healthcare-related systems and services." Conclusion: This is the first study to analyze Korea's national petitions in the field of nursing. This study's results confirmed both the internal needs and external demands for nurses in South Korea. Policies and laws that reflect these results should be developed.

    Exploring Key Topics and Trends of Government-sponsored R&D Projects in Future Automotive Fields: LDA Topic Modeling Approach (미래 자동차 분야 국가연구개발사업의 주요 연구 토픽과 투자 동향 분석: LDA 토픽모델링을 중심으로)

    • Ma Hyoung Ryul;Lee Cheol-Ju
      • Journal of Korea Society of Industrial Information Systems
      • /
      • v.29 no.1
      • /
      • pp.31-48
      • /
      • 2024
    • The domestic automotive industry must consider a strategic shift from traditional automotive component manufacturing to align with future trends such as connectivity, autonomous driving, sharing, and electrification. This research conducted topic modeling on R&D projects in the future automotive sector funded by the Ministry of Trade, Industry, and Energy from 2013 to 2021. We found that topics such as sensors, communication, driver assistance technology, and battery and power technology remained consistently prominent throughout the entire period. Conversely, topics like high-strength lightweight chassis were observed only in the first period, while topics like AI, big data, and hydrogen fuel cells gained increasing importance in the second and third periods. Furthermore, this research analyzed the areas of concentrated investment for each period based on topic-specific government investment amounts and investment growth rates.

    Text Mining Driven Content Analysis of Ebola on News Media and Scientific Publications (텍스트 마이닝을 이용한 매체별 에볼라 주제 분석 - 바이오 분야 연구논문과 뉴스 텍스트 데이터를 이용하여 -)

    • An, Juyoung;Ahn, Kyubin;Song, Min
      • Journal of the Korean Society for Library and Information Science
      • /
      • v.50 no.2
      • /
      • pp.289-307
      • /
      • 2016
    • Infectious diseases such as Ebola virus disease become a social issue and draw public attention to be a major topic on news or research. As a result, there have been a lot of studies on infectious diseases using text-mining techniques. However, there is no research on content analysis of two media channels that have distinct characteristics. Accordingly, in this study, we conduct topic analysis between news (representing a social perspective) and academic research paper (representing perspectives of bio-professionals). As text-mining techniques, topic modeling is applied to extract various topics according to the materials, and the word co-occurrence map based on selected bio entities is used to compare the perspectives of the materials specifically. For network analysis, topic map is built by using Gephi. Aforementioned approaches uncovered the difference of topics between two materials and the characteristics of the two materials. In terms of the word co-occurrence map, however, most of entities are shared in both materials. These results indicate that there are differences and commonalties between social and academic materials.

    An Analysis of the Research Trend on Smart Mobility : Topic Modeling Approach (스마트 모빌리티 연구 동향에 관한 분석 : 토픽 모델링의 적용)

    • Park, Jungtae;Kim, Choongyoung;Kim, Taejong
      • The Journal of The Korea Institute of Intelligent Transport Systems
      • /
      • v.21 no.2
      • /
      • pp.85-100
      • /
      • 2022
    • Recently, with the widespread expansion of convergence based on digital connectivity, the transportation and mobility fields are rapidly changing, and research related to this is also diversifying. This study aims to analyze the research trends in the mobility field and identify key research areas and topics. Topic modeling analysis has been proved as a useful approach for analyzing the research trends. The abstracts of 142 research papers concerning mobility from the Korean academic citation index were analyzed, derived 9 research topics and linked to 6 key elements of research framework. The result showed that 'Advanced vehicle and transportaion technology' and 'Linkage and integrated services among means for mobility' were most actively studied research fields. It also found that research on insurance, law, regulation for securing user's safety and conflict-resolving with the existing industry has been conducted.


    (34141) Korea Institute of Science and Technology Information, 245, Daehak-ro, Yuseong-gu, Daejeon
    Copyright (C) KISTI. All Rights Reserved.