• Title/Summary/Keyword: Topic Modeling(LDA)

Search Result 296, Processing Time 0.029 seconds

Analysis of Research Trends Related to drug Repositioning Based on Machine Learning (머신러닝 기반의 신약 재창출 관련 연구 동향 분석)

  • So Yeon Yoo;Gyoo Gun Lim
    • Information Systems Review
    • /
    • v.24 no.1
    • /
    • pp.21-37
    • /
    • 2022
  • Drug repositioning, one of the methods of developing new drugs, is a useful way to discover new indications by allowing drugs that have already been approved for use in people to be used for other purposes. Recently, with the development of machine learning technology, the case of analyzing vast amounts of biological information and using it to develop new drugs is increasing. The use of machine learning technology to drug repositioning will help quickly find effective treatments. Currently, the world is having a difficult time due to a new disease caused by coronavirus (COVID-19), a severe acute respiratory syndrome. Drug repositioning that repurposes drugsthat have already been clinically approved could be an alternative to therapeutics to treat COVID-19 patients. This study intends to examine research trends in the field of drug repositioning using machine learning techniques. In Pub Med, a total of 4,821 papers were collected with the keyword 'Drug Repositioning'using the web scraping technique. After data preprocessing, frequency analysis, LDA-based topic modeling, random forest classification analysis, and prediction performance evaluation were performed on 4,419 papers. Associated words were analyzed based on the Word2vec model, and after reducing the PCA dimension, K-Means clustered to generate labels, and then the structured organization of the literature was visualized using the t-SNE algorithm. Hierarchical clustering was applied to the LDA results and visualized as a heat map. This study identified the research topics related to drug repositioning, and presented a method to derive and visualize meaningful topics from a large amount of literature using a machine learning algorithm. It is expected that it will help to be used as basic data for establishing research or development strategies in the field of drug repositioning in the future.

A Study on Trends of Key Issues in Port Safety at Busan Port (부산항 항만안전 주요 이슈 동향에 관한 연구)

  • Jeong-Min Lee;Do-Yeon Ha;Joo-Hye Kim
    • Journal of Navigation and Port Research
    • /
    • v.48 no.1
    • /
    • pp.34-48
    • /
    • 2024
  • As global supply chain risks proliferate unpredictably, the high interdependence of port and logistics industry intensifies the risk burden. This study conducted fundamental research to explore diverse safety issues in domestic ports. Utilizing news article data about Busan Port, we employed LDA topic modeling and time-series linear regression to understand key safety trends. Over the past 30 years, Busan Port faced nine major safety issues-maritime safety, import cargo inspection, labor strikes, and natural disasters emerged cyclically. Major port safety issues in Busan Port are primarily characterized by an unpredictable nature, falling under socio-environmental and natural phenomena types, indicating a significant impact of global uncertainty. Therefore, systematic policies need to be formulated based on identified port safety issues to enhance port safety in Busan Port. Additionally, there is a need to strengthen the resilience of port safety for unpredictable risk situations. In conclusion, advanced research activities are necessary to promote port safety enhancement in response to dynamically changing social conditions.

Text Mining-Based Analysis of Hyundai Automobile Consumer Satisfaction and Dissatisfaction Factors in the Chinese Market: A Comparison with Other Brands (텍스트 마이닝을 이용한 현대 자동차 중국시장 소비자의 만족 및 불만족 요인 분석 연구: 다른 브랜드와의 비교)

  • Cui Ran;Inyong Nam
    • The Journal of the Convergence on Culture Technology
    • /
    • v.10 no.1
    • /
    • pp.539-549
    • /
    • 2024
  • This study employed text mining techniques like frequency analysis, word clouds, and LDA topic modeling to assess consumer satisfaction and dissatisfaction with Hyundai Motor Company in the Chinese market, compared to brands such as Toyota, Volkswagen, Buick, and Geely. Focusing on compact vehicles from these brands between 2021 and 2023, this study analyzed customer reviews. The results indicated Hyundai Avante's positive factors, including a long wheelbase. However, it also highlighted dissatisfaction aspects like Manipulate, engine performance, trunk space, chassis and suspension, safety features, quantity and brand of audio speakers, music membership service, separation band, screen reflection, CarLife, and map services. Addressing these issues could significantly enhance Hyundai's competitiveness in the Chinese market. Previous studies mainly focused on literature research and surveys, which only revealed consumer perceptions limited to the variables set by the researchers. This study, through text mining and comparing various car brands, aims to gain a deeper understanding of market trends and consumer preferences, providing useful information for marketing strategies of Hyundai and other brands in the Chinese market.

Research Topics in Industrial Engineering 2001~2015 (국내 산업공학 연구 주제 2001~2015)

  • Jeong, Bokwon;Lee, Hakyeon
    • Journal of Korean Institute of Industrial Engineers
    • /
    • v.42 no.6
    • /
    • pp.421-431
    • /
    • 2016
  • Over the last four decades, industrial engineering (IE) research in Korea has continued to evolve and expand to respond to social needs. This paper aims to identify research topics in IE research and explore their dynamic changes over time. The topic modeling approach, which automatically discovers topics that pervade a large and unstructured collection of documents, is adopted to identify research topics in domestic IE research. 1,242 articles published from 2001 to 2015 in two IE journals issued by the Korean Institute of Industrial Engineers were collected and their English abstracts were analyzed. Applying the Latent Dirichlet Allocation model led us to uncover 50 topics of domestic IE research. The top 10 most popular topics are revealed, and topic trends are explored by examining the dynamic changes over time. The four topics, technology management, financial engineering, data mining (supervised learning), efficiency analysis, are selected as hot topics while several traditional topics related with manufacturing are revealed as cold topics. The findings are expected to provide fruitful implications for IE researchers.

Changes and Applications of Rural Tourism in the Post-COVID-19 Era through Social Data Analysis (소셜데이터 분석을 통한 포스트 코로나 시대 농촌관광의 변화와 적용방안)

  • Kim, Young-Jin;Lee, Sung-hee;Son, Yong-hoon
    • Journal of Korean Society of Rural Planning
    • /
    • v.27 no.4
    • /
    • pp.43-54
    • /
    • 2021
  • This study analysed changes in rural tourism between before and after COVID-19 using LDA topic analysis. In order to understand the changes in rural tourism, blog data including the keyword 'Gochang-gun travel' was used. As a result of LDA topic analysis with blog data retrieved, the study found nine topics in 2019 and 2020. 2019 and 2020 are, generally, consistent in topics, but the three topics related to rural experiential tourism that appeared in 2019 did not appear in 2020. In 2020, three new topics emerged: Beach vacations and campings. New travel activities of noncontact with other people(Untact tourism in Korean context) in the COVID-19 era, and The negative impacts on travel businesses and behaviours from COVID-19. Especially, the adverse effects of COVID-19 have made an enormous decline in rural experience tourism destinations and cancellation of local festivals. On the other hand, new tourism activities have emerged due to COVID-19. Those activities have included camping, drive-thru destinations, and cycling. Ecological and natural tourist sites such as Ungok Wetland, Seonunsan Mountain, Seonunsa Temple, and Gusipo Beach appeared. These tourist destinations have a quiet atmosphere and less density place noncontacting with other people when visiting. Also, because overseas travel has become difficult, long-term stay travel in rural areas has appeared. This study indicates that COVID-19 has less impacted rural tourism than other tourism destinations with these positive and negative impacts.

Research Trends Analysis on ESG Using Unsupervised Learning

  • Woo-Ryeong YANG;Hoe-Chang YANG
    • The Journal of Economics, Marketing and Management
    • /
    • v.11 no.3
    • /
    • pp.47-66
    • /
    • 2023
  • Purpose: The purpose of this study is to identify research trends related to ESG by domestic and overseas researchers so far, and to present research directions and clues for the possibility of applying ESG to Korean companies in the future and ESG practice through comparison of derived topics. Research design, data and methodology: In this study, as of October 20, 2022, after searching for the keyword 'ESG' in 'scienceON', 341 domestic papers with English abstracts and 1,173 overseas papers were extracted. For analysis, word frequency analysis, word co-occurrence frequency analysis, BERTopic, LDA, and OLS regression analysis were performed to confirm trends for each topic using Python 3.7. Results: As a result of word frequency analysis, It was found that words such as management, company, performance, and value were commonly used in both domestic and overseas papers. In domestic papers, words such as activity and responsibility, and in overseas papers, words such as sustainability, impact, and development were included in the top 20 words. As a result of analyzing the co-occurrence frequency of words, it was confirmed that domestic papers were related mainly to words such as company, management, and activity, and overseas papers were related to words such as investment, sustainability, and performance. As a result of topic modeling, 3 topics such as named ESG from the corporate perspective were derived for domestic papers, and a total of 7 topics such as named sustainable investment for overseas papers were derived. As a result of the annual trend analysis, each topic did not show a relatively increasing or decreasing tendency, confirming that all topics were neutral. Conclusions: The results of this study confirmed that although it is desirable that domestic papers have recently started research on consumers, the subject diversity is lower than that of overseas papers. Therefore, it is suggested that future research needs to approach various topics such as forecasting future risks related to ESG and corporate evaluation methods.

Text Network Analysis on Stalking-Related News Articles (스토킹 관련 언론기사에 대한 텍스트네트워크분석)

  • Eun-Sun Ji;Sang-Hee Jeong
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.3
    • /
    • pp.579-585
    • /
    • 2023
  • The purpose of this study is to explore keywords within stalking-related news articles according to political orientation through the text network analysis, and then to examine the implicit intentions. Selecting total 1,607 articles including 824 articles of the conservative press(The Chosun Ilbo, The Joongang Ilbo) and 783 articles of the progressive press(The Hankyoreh, The Kyunghyang Shinmun) reported from January 1, 2018 to December 31, 2022, this study explored the aspect of topic category drawn through the topic modeling technique based on LDA(Latent Dirichlet Allocation). In the results of this study, the common topics of the conservative and progressive press were improvement of the perception of gender-based violence, personal protection & intensity of punishment, and disclosure of stalkers' personal information. Regarding the topics differently shown in those two press, the conservative press showed stalkers' harmful act, and outline of 'murder case at Sindang Station' while the progressive press showed request for aggravated punishment on the 'murder case at Sindang Station', and eradication of sexual exploitation crime (in cyber space). The results of this study imply that there are changes in the type of reporting according to ideological opinions about stalking in news articles.

Research trend analysis of Korean new graduate nurses using topic modeling (토픽모델링을 활용한 신규간호사 관련 국내 연구동향 분석)

  • Park, Seungmi;Lee, Jung Lim
    • The Journal of Korean Academic Society of Nursing Education
    • /
    • v.27 no.3
    • /
    • pp.240-250
    • /
    • 2021
  • Purpose: The aim of this study is to analyze the research trends of articles on just graduated Korean nurses during the past 10 years for exploring strategies for clinical adaptation. Methods: The topics of new graduate nurses were extracted from 110 articles that have been published in Korean journals between January 2010 and July 2020. Abstracts were retrieved from 4 databases (DBpia, RISS, KISS and Google scholar). Keywords were extracted from the abstracts and cleaned using semantic morphemes. Network analysis and topic modeling were performed using the NetMiner program. Results: The core keywords included 'education', 'training', 'program', 'skill', 'care', 'performance', and 'satisfaction'. In recent articles on new graduate nurses, three major topics were extracted by Latent Dirichlet Allocation (LDA) techniques: 'turnover', 'adaptation', 'education'. Conclusion: Previous articles focused on exploring the factors related to the adaptation and turnover intentions of new graduate nurses. It is necessary to conduct further research focused on various interventions at the individual, task, and organizational levels to improve the retention of new graduate nurses.

Changes in the Perception of Second-hand Fashion Consumption in the Post-pandemic Era (포스트 팬데믹 시대의 중고 패션 소비 인식 변화)

  • Kim, Habin;Lee, Ha Kyung
    • Fashion & Textile Research Journal
    • /
    • v.24 no.1
    • /
    • pp.66-80
    • /
    • 2022
  • Even before the Covid-19 outbreak, the second-hand fashion market has been growing as the fashion industry strives towards sustainability. It has also accelerated due to the economic contraction caused by the pandemic. In previous studies, the second-hand market has been steadily studied; however, the research is insufficient compared to the diversified market. Therefore, this study investigates changes in consumers' perception of the second-hand fashion market affected by Covid-19. This study collected text data with the keyword 'second-hand fashion' from various blogs. We analyzed 24,000 posts before and after the Covid-19 outbreak by applying the LDA algorithm for topic modeling and content analysis. Seven and nine different topics for the period before and after the pandemic respectively were derived. The results revealed that during the pandemic the consumers realized the practical value of sustainability in their daily lives than they did before the pandemic. Furthermore, they tried to minimize transaction anxiety by using diverse platforms with advanced technology. They also realized economic value by buying and selling sneakers in the popular sneakers resale market. The results could help understand the rapidly growing second-hand fashion market during Covid-19.

Topic change monitoring study based on Blue House national petition using a control chart (관리도를 활용한 국민청원 토픽 모니터링 연구)

  • Lee, Heeyeon;Choi, Jieun;Lee, Sungim;Son, Won
    • The Korean Journal of Applied Statistics
    • /
    • v.34 no.5
    • /
    • pp.795-806
    • /
    • 2021
  • Recently, as text data through online channels have become vast, there is a growing interest in research that summarizes and analyzes them. One of the fundamental analyses of text data is to extract potential topics. Although the researcher may read all the data and summarize the contents one by one, it is not easy to deal with large amounts of data. Blei and Lafferty (2007) and Blei et al. (2003) proposed topic modeling methods for extracting topics using a statistical model. Since the text data is generally collected over time, it is worthwhile to monitor the topic's changes. In this study, we propose a topic index based on the results of the topic model. In addition, a control chart, a representative tool for statistical process management, is applied to monitor the topic index over time. As a practical example, we use text data collected from Blue House National Petition boards between March 5, 2018, and March 5, 2020.