• Title/Summary/Keyword: Topic Modeling(LDA)

Search Result 292, Processing Time 0.03 seconds

Sentiment Analysis of Foot-and-Mouth Disease Using Tweet Text-Mining Technique (트윗 텍스트 마이닝 기법을 이용한 구제역의 감성분석)

  • Chae, Heechan;Lee, Jonguk;Choi, Yoona;Park, Daihee;Chung, Yongwha
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.11
    • /
    • pp.419-426
    • /
    • 2018
  • Due to the FMD(foot-and-mouth disease), the domestic animal husbandry and related industries suffer enormous damage every year. Although various academic researches related to FMD are ongoing, engineering studies on the social effects of FMD are very limited. In this study, we propose a systematic methodology to analyze emotional responses of regular citizens on FMD using text mining techniques. The proposed system first collects data related to FMD from the tweets posted on Twitter, and then performs a polarity classification process using a deep-learning technique. Second, keywords are extracted from the tweet using LDA, which is one of the typical techniques of topic modeling, and a keyword network is constructed from the extracted keywords. Finally, we analyze the various social effects of regular citizens on FMD through keyword network. As a case study, we performed the emotional analysis experiment of regular citizens about FMD from July 2010 to December 2011 in Korea.

Researcher and Research Area Recommendation System for Promoting Convergence Research Using Text Mining and Messenger UI (텍스트 마이닝 방법론과 메신저UI를 활용한 융합연구 촉진을 위한 연구자 및 연구 분야 추천 시스템의 제안)

  • Yang, Nak-Yeong;Kim, Sung-Geun;Kang, Ju-Young
    • The Journal of Information Systems
    • /
    • v.27 no.4
    • /
    • pp.71-96
    • /
    • 2018
  • Purpose Recently, social interest in the convergence research is at its peak. However, contrary to the keen interest in convergence research, an infrastructure that makes it easier to recruit researchers from other fields is not yet well established, which is why researchers are having considerable difficulty in carrying out real convergence research. In this study, we implemented a researcher recommendation system that helps researchers who want to collaborate easily recruit researchers from other fields, and we expect it to serve as a springboard for growth in the convergence research field. Design/methodology/approach In this study, we implemented a system that recommends proper researchers when users enter keyword in the field of research that they want to collaborate using word embedding techniques, word2vec. In addition, we also implemented function of keyword suggestions by using keywords drawn from LDA Topicmodeling Algorithm. Finally, the UI of the researcher recommendation system was completed by utilizing the collaborative messenger Slack to facilitate immediate exchange of information with the recommended researchers and to accommodate various applications for collaboration. Findings In this study, we validated the completed researcher recommendation system by ensuring that the list of researchers recommended by entering a specific keyword is accurate and that words learned as a similar word with a particular researcher match the researcher's field of research. The results showed 85.89% accuracy in the former, and in the latter case, mostly, the words drawn as similar words were found to match the researcher's field of research, leading to excellent performance of the researcher recommendation system.

Metaverse App Market and Leisure: Analysis on Oculus Apps (메타버스 앱 시장과 여가: 오큘러스 앱 분석)

  • Kim, Taekyung;Kim, Seongsu
    • Knowledge Management Research
    • /
    • v.23 no.2
    • /
    • pp.37-60
    • /
    • 2022
  • The growth of virtual reality games and the popularization of blockchain technology are bringing significant changes to the formation of the metaverse industry ecosystem. Especially, after Meta acquired Oculus, a VR device and application company, the growth of VR-based metaverse services is accelerating. In this study, the concept that supports leisure activities in the metaverse environment is explored realting to game-like features in VR apps, which differentiates traditional mobile apps based on a smart phone device. Using exploratory text mining methods and network analysis approches, 241 apps registed in the Oculus Quest 2 App Store were analyzed. Analysis results from a quasi-network show that a leisure concept is closely related to various genre features including a game and tourism. Additionally, the anlaysis results of G & F model indicate that the leisure concept is distictive in the view of gateway brokerage role. Those results were also confirmed in LDA topic modeling analysis.

Analysis of the ESG Research Trend : Focusing on SCOPUS DB (ESG 주요 연구 동향 분석: SCOPUS DB를 중심으로)

  • Kyoo-Sung Noh
    • Journal of Digital Convergence
    • /
    • v.21 no.2
    • /
    • pp.9-16
    • /
    • 2023
  • The purpose of this study is to analyze research trends on ESG (Environmental, Social, and Governance), and to present a direction for companies and investors to use ESG information. To this end, text mining, one of the atypical data mining techniques, was used for analysis. Thesis abstracts from January 2014 to February 2023 were collected from the SCOPUS database, and Economics, Econometrics and Finance were the most common. The United States and China published the most ESG papers, and Korea published the 6th most papers in the world. This study is meaningful in that it analyzed the main research trends of ESG using text mining techniques such as LDA and topic modeling. It was confirmed that ESG is being conducted in various fields, not in a specific field, and it is differentiated from previous studies in that it analyzed various influencing factors and ripple effects of ESG.

Classifying and Characterizing the Types of Gentrified Commercial Districts Based on Sense of Place Using Big Data: Focusing on 14 Districts in Seoul (빅데이터를 활용한 젠트리피케이션 상권의 장소성 분류와 특성 분석 -서울시 14개 주요상권을 중심으로-)

  • Young-Jae Kim;In Kwon Park
    • Journal of the Korean Regional Science Association
    • /
    • v.39 no.1
    • /
    • pp.3-20
    • /
    • 2023
  • This study aims to categorize the 14 major gentrified commercial areas of Seoul and analyze their characteristics based on their sense of place. To achieve this, we conducted hierarchical cluster analysis using text data collected from Naver Blog. We divided the districts into two dimensions: "experience" and "feature" and analyzed their characteristics using LDA (Latent Dirichlet Allocation) of the text data and statistical data collected from Seoul Open Data Square. As a result, we classified the commercial districts of Seoul into 5 categories: 'theater district,' 'traditional cultural district,' 'female-beauty district,' 'exclusive restaurant and medical district,' and 'trend-leading district.' The findings of this study are expected to provide valuable insights for policy-makers to develop more efficient and suitable commercial policies.

Analysis of the Knowledge Structure of Research related to Reality Shock Experienced by New Graduate Nurses using Text Network Analysis (텍스트네트워크분석을 활용한 신규간호사가 경험하는 현실충격 관련 연구의 지식구조 분석)

  • Heejang Yun
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.1
    • /
    • pp.463-469
    • /
    • 2023
  • The aim of this study is to provide basic data that can contribute to improving successful clinical adaptation and reducing turnover of new graduate nurses by analyzing research related to reality shock experienced by new graduate nurses using text network analysis. The topics of reality shock experienced by new graduate nurses were extracted from 115 papers published in domestic and foreign journals from January 2002 to December 2021. Articles were retrieved from 6 databases (Korean DB: DBpia, KISS, RISS /International DB: Web of science, Springer, Scopus). Keywords were extracted from the abstract and organized using semantic morphemes. Network analysis and topic modeling for subject knowledge structure analysis were performed using NetMiner 4.5.0 program. The core keywords included 'new graduate nurses', 'reality shock', 'transition', 'student nurse', 'experience', 'practice', 'work environment', 'role', 'care' and 'education'. In recent articles on reality shock experienced by new graduate nurses, three major topics were extracted by LDA (Latent Dirichlet Allocation) techniques: 'turnover', 'work environment', 'experience of transition'. Based on this research, the necessity of interventional research that can effectively reduce the reality shock experienced by new graduate nurses and successfully help clinical adaptation is suggested.

Adaptive User and Topic Modeling based Automatic TV Recommendation (적응적 사용자 및 토픽 모델링 기반의 자동 TV 프로그램 추천)

  • Kim, EunHui;Pyo, Shinjee;Kim, Munchurl
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.431-434
    • /
    • 2012
  • 시간 흐름에 따라 TV 프로그램 스케줄은 변화하고 스케줄의 변화는 사용자 선호에 영향을 미친다. 이러한 스케줄 변화에 따른 토픽의 흐름이 사용자 선호도에 미치는 영향 외에도, 개성에 따른 선호도의 변화는 개인별 차이가 크다. 본 논문은 사용자 선호도 변화에 적응적으로 대응하면서 시간 변화에도 일정한 관심을 보이는 사용자의 선호도에는 가중치를 더한 모델을 목표로 한다. 따라서 제안 모델은 현재의 시청 데이터를 기준으로 한 사용자별 선호도의 선행 정보(prior)로 이전 시청선호를 두었고, 선호도 변화와 일관성을 고려하여 하나의 시청길이에 대한 선호도뿐만 아니라 여러 시청 길이의 선호도를 결합한 선호도를 구성할 수 있는 확장성 있는 모델을 제시한다. 선호도의 일관성에 대한 가중치 연산에 있어 전체 확률모델의 확률을 향상시키는 연산을 통해 정교성을 더한 모델을 제시한다. 실제 사용자들이 시청한 데이터인 2011 TNMS데이터를 기준으로 제안 모델의 성능을 확인한 결과, 기존의 LDA, MDTM모델 보다 나은 성능을 보임을 확인할 수 있었으며, 1주일 단위 추천결과, 5개 추천 시, 최대 67.9%의 추천 정확도를 확인할 수 있었다.

  • PDF

A Study on the User Perception in Fashion Design through Social Media Text-Mining (소셜미디어 텍스트마이닝을 통한 패션디자인 사용자 인식 조사)

  • An, Hyosun;Park, Minjung
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.41 no.6
    • /
    • pp.1060-1070
    • /
    • 2017
  • This study seeks methods to analyze users' perception in fashion designs shown in social media using textmining analysis methods. The research methods selected 'men's stripe shirts' as subjects and collected texts related to the subject mainly from blogs. Texts from 13,648 posts from November 1st, 2015 to October 31st, 2016 were analyzed by applying the LDA algorithm and content analysis. As a result, the wearing status per season and subjects of men's stripe shirts were derived. Across the entire period, the main topics discussed by users to be pattern, customized suits, brands, coordination and purchase information. In terms of seasons, spring time showed the sharing of information on coordinating daily looks or boyfriend looks, and during the winter season the information shared were about shirts suitable for special occasions such as job interviews and stripe shirts that match suits. The study results showed that text-mining analysis is capable of analyzing the context and provide a user-centered index responding to demands newly mentioned by users along with the rapid changes in fashion design trends.

A Comparative Analysis of Travelers' Online Reviews among China, USA, and South Korea using Sentiment Analysis in the Era of the COVID-19 Pandemic (코로나19 팬데믹 상황에서 감성분석을 이용한 미국, 중국, 한국 여행자의 온라인 리뷰 비교 분석)

  • Hong, Junwoo;Hong, Taeho
    • Journal of Information Technology Services
    • /
    • v.20 no.5
    • /
    • pp.159-176
    • /
    • 2021
  • In this study, we performed a comparative analysis of the sentiment value for the tourists in USA, China, and Korea on the COVID19 pandemic era to explore and find out the features of the tourists by using online reviews. We collected a total of 243,826 online hotel reviews for metropolitan city and vacation spot in the three countries to compare the features between the business and the vacation trips. We collected the online reviews into the tow groups from Jan. 1, 2019 to Nov. 31, 2019 for before COVID19 pandemic and from Apr. 1, 2020 to Deb 28, 2021 for during COVID19. Online reviews were categorized into 6 dimensions using LDA model. Sentiment analysis were presented for 6 dimensions by utilizing a lexicon base. We proposed an approach to analyzing the importance of each attribute by applying 6-dimensional sentiment values to conjoint analysis. Our empirical analysis showed that the proposed approach could explore and find out the changed features of travelers during the COVID19 pandemic.

Sentiment Analysis on 'HelloTalk' App Reviews Using NRC Emotion Lexicon and GoEmotions Dataset

  • Simay Akar;Yang Sok Kim;Mi Jin Noh
    • Smart Media Journal
    • /
    • v.13 no.6
    • /
    • pp.35-43
    • /
    • 2024
  • During the post-pandemic period, the interest in foreign language learning surged, leading to increased usage of language-learning apps. With the rising demand for these apps, analyzing app reviews becomes essential, as they provide valuable insights into user experiences and suggestions for improvement. This research focuses on extracting insights into users' opinions, sentiments, and overall satisfaction from reviews of HelloTalk, one of the most renowned language-learning apps. We employed topic modeling and emotion analysis approaches to analyze reviews collected from the Google Play Store. Several experiments were conducted to evaluate the performance of sentiment classification models with different settings. In addition, we identified dominant emotions and topics within the app reviews using feature importance analysis. The experimental results show that the Random Forest model with topics and emotions outperforms other approaches in accuracy, recall, and F1 score. The findings reveal that topics emphasizing language learning and community interactions, as well as the use of language learning tools and the learning experience, are prominent. Moreover, the emotions of 'admiration' and 'annoyance' emerge as significant factors across all models. This research highlights that incorporating emotion scores into the model and utilizing a broader range of emotion labels enhances model performance.