• Title/Summary/Keyword: Text sentiment analysis

Search Result 241, Processing Time 0.031 seconds

RESEARCH ON SENTIMENT ANALYSIS METHOD BASED ON WEIBO COMMENTS

  • Li, Zhong-Shi;He, Lin;Guo, Wei-Jie;Jin, Zhe-Zhi
    • East Asian mathematical journal
    • /
    • v.37 no.5
    • /
    • pp.599-612
    • /
    • 2021
  • In China, Weibo is one of the social platforms with more users. It has the characteristics of fast information transmission and wide coverage. People can comment on a certain event on Weibo to express their emotions and attitudes. Judging the emotional tendency of users' comments is not only beneficial to the monitoring of the management department, but also has very high application value for rumor suppression, public opinion guidance, and marketing. This paper proposes a two-input Adaboost model based on TextCNN and BiLSTM. Use the TextCNN model that can perform local feature extraction and the BiLSTM model that can perform global feature extraction to process comment data in parallel. Finally, the classification results of the two models are fused through the improved Adaboost algorithm to improve the accuracy of text classification.

A Convergence Study on the Topic and Sentiment of COVID19 Research in Korea Using Text Analysis (텍스트 분석을 이용한 코로나19 관련 국내 논문의 주제 및 감성에 관한 융합 연구)

  • Heo, Seong-Min;Yang, Ji-Yeon
    • Journal of the Korea Convergence Society
    • /
    • v.12 no.4
    • /
    • pp.31-42
    • /
    • 2021
  • The purpose of this study was to explore research topics and examine the trend in COVID19 related research papers. We identified eight topics using latent Dirichlet allocation and found acceptable validity in comparison with the structural topic model. The subtopics have been extracted using k-means clustering and plotted in PCA space. Additionally, we discovered the topics bearing negative tones and warning signs by sentiment analysis. The results flagged up the issues of the topics, Biomedical Related, International Dynamics and Psychological Impact. The findings could serve as a guideline for researchers who explore new research directions and policymakers who need to make decisions about which research projects to support.

A Study on the Relationship between the Emotions of the MZ Generation Revealed in Online Communities and Public Opinion Surveys (온라인 커뮤니티에 드러난 MZ세대의 감성과 여론조사 간 상관관계에 관한 연구)

  • HanByeol Stella Choi;Sulim Kim;Hee-Dong Yang
    • Journal of Information Technology Services
    • /
    • v.22 no.3
    • /
    • pp.101-118
    • /
    • 2023
  • The 'MZ generation' is accustomed to expressing their thoughts and opinions online. As a result, the role of social media in understanding the opinions and public sentiment of the MZ generation has become increasingly important. In particular, the role of social media in understanding the opinions of young people in political contexts such as policies and elections is becoming more significant. Traditionally, in such political situations, various institutions conduct opinion surveys to grasp the opinions of the people. However, existing opinion surveys have many errors and limitations in understanding the specific opinions of the entire population since they are conducted on arbitrary individuals through survey techniques. Online communities are representative social media that share the opinions of the public on specific issues such as politics, economics, and culture. Therefore, online communities are widely used as a means to supplement the limitations of traditional opinion polls. In particular, the MZ generation is familiar with online platforms, and their political support has significant influence on election results and policy decisions. With this regard, this study analyzed the relationship between the sentiment reflected in online community text data by age group on major candidates and public opinion survey support rates during the Korean presidential election for those in their 20s. The analysis showed that negative sentiments reflected in online communities by the MZ generation have a negative correlation with public opinion survey support rates. This study contributes to theory and practice by revealing a significant association between social media and public opinion polls.

An Analysis of Newspaper Articles on Fine Particle Matter Using Text Mining Techniques (텍스트마이닝을 이용한 미세먼지 관련 신문기사 분석)

  • Yang, Ji-Yeon
    • Journal of Digital Convergence
    • /
    • v.20 no.1
    • /
    • pp.1-13
    • /
    • 2022
  • This study aims to examine the trend and characteristics of newspaper articles concerned with fine particle matter. Newspaper articles since 1995 collected from Bigkinds were analyzed using text mining techniques, sentiment analysis and regression analysis. Air pollution measurement and domestic pollutants appeared frequently previously, but "China" became the keyword in the 2010s along with political action, the effects on the health, AD/PR, and domestic pollutants. Korea JoongAng Daily, Hankyoreh and Kyunghyang Shinmun have had more focused on political regulations whereas most regional daily newspapers on emission sources and reduction measures at the regional level. The results of this study are expected to be used as a reference for understanding the trend of newspaper articles. Future work includes further analysis and discussion of fine particle pollution condition and news reports in the post-COVID era.

Understanding the Sentiment on Gig Economy: Good or Bad?

  • NORAZMI, Fatin Aimi Naemah;MAZLAN, Nur Syazwani;SAID, Rusmawati;OK RAHMAT, Rahmita Wirza
    • The Journal of Asian Finance, Economics and Business
    • /
    • v.9 no.10
    • /
    • pp.189-200
    • /
    • 2022
  • The gig economy offers many advantages, such as flexibility, variety, independence, and lower cost. However, there are also safety concerns, lack of regulations, uncertainty, and unsatisfactory services, causing people to voice their opinion on social media. This paper aims to explore the sentiments of consumers concerning gig economy services (Grab, Foodpanda and Airbnb) through the analysis of social media. First, Vader Lexicon was used to classify the comments into positive, negative, and neutral sentiments. Then, the comments were further classified into three machine learning algorithms: Support Vector Machine, Light Gradient Boosted Machine, and Logistic Regression. Results suggested that gig economy services in Malaysia received more positive sentiments (52%) than negative sentiments (19%) and neutral sentiments (29%). Based on the three algorithms used in this research, LGBM has been the best model with the highest accuracy of 85%, while SVM has 84% and LR 82%. The results of this study proved the power of text mining and sentiment analysis in extracting business value and providing insight to businesses. Additionally, it aids gig managers and service providers in understanding clients' sentiments about their goods and services and making necessary adjustments to optimize satisfaction.

Topic Extraction and Classification Method Based on Comment Sets

  • Tan, Xiaodong
    • Journal of Information Processing Systems
    • /
    • v.16 no.2
    • /
    • pp.329-342
    • /
    • 2020
  • In recent years, emotional text classification is one of the essential research contents in the field of natural language processing. It has been widely used in the sentiment analysis of commodities like hotels, and other commentary corpus. This paper proposes an improved W-LDA (weighted latent Dirichlet allocation) topic model to improve the shortcomings of traditional LDA topic models. In the process of the topic of word sampling and its word distribution expectation calculation of the Gibbs of the W-LDA topic model. An average weighted value is adopted to avoid topic-related words from being submerged by high-frequency words, to improve the distinction of the topic. It further integrates the highest classification of the algorithm of support vector machine based on the extracted high-quality document-topic distribution and topic-word vectors. Finally, an efficient integration method is constructed for the analysis and extraction of emotional words, topic distribution calculations, and sentiment classification. Through tests on real teaching evaluation data and test set of public comment set, the results show that the method proposed in the paper has distinct advantages compared with other two typical algorithms in terms of subject differentiation, classification precision, and F1-measure.

News based Stock Market Sentiment Lexicon Acquisition Using Word2Vec (Word2Vec을 활용한 뉴스 기반 주가지수 방향성 예측용 감성 사전 구축)

  • Kim, Daye;Lee, Youngin
    • The Journal of Bigdata
    • /
    • v.3 no.1
    • /
    • pp.13-20
    • /
    • 2018
  • Stock market prediction has been long dream for researchers as well as the public. Forecasting ever-changing stock market, though, proved a Herculean task. This study proposes a novel stock market sentiment lexicon acquisition system that can predict the growth (or decline) of stock market index, based on economic news. For this purpose, we have collected 3-year's economic news from January 2015 to December 2017 and adopted Word2Vec model to consider the context of words. To evaluate the result, we performed sentiment analysis to collected news data with the automated constructed lexicon and compared with closings of the KOSPI (Korea Composite Stock Price Index), the South Korean stock market index based on economic news.

Conveyed Message in YouTube Product Review Videos: The discrepancy between sponsored and non-sponsored product review videos

  • Kim, Do Hun;Suh, Ji Hae
    • The Journal of Information Systems
    • /
    • v.32 no.4
    • /
    • pp.29-50
    • /
    • 2023
  • Purpose The impact of online reviews is widely acknowledged, with extensive research focused on text-based reviews. However, there's a lack of research regarding reviews in video format. To address this gap, this study aims to explore the connection between company-sponsored product review videos and the extent of directive speech within them. This article analyzed viewer sentiments expressed in video comments based on the level of directive speech used by the presenter. Design/methodology/approach This study involved analyzing speech acts in review videos based on sponsorship and examining consumer reactions through sentiment analysis of comments. We used Speech Act theory to perform the analysis. Findings YouTubers who receive company sponsorship for review videos tend to employ more directive speech. Furthermore, this increased use of directive speech is associated with a higher occurrence of negative consumer comments. This study's outcomes are valuable for the realm of user-generated content and natural language processing, offering practical insights for YouTube marketing strategies.

A Study of 'Emotion Trigger' by Text Mining Techniques (텍스트 마이닝을 이용한 감정 유발 요인 'Emotion Trigger'에 관한 연구)

  • An, Juyoung;Bae, Junghwan;Han, Namgi;Song, Min
    • Journal of Intelligence and Information Systems
    • /
    • v.21 no.2
    • /
    • pp.69-92
    • /
    • 2015
  • The explosion of social media data has led to apply text-mining techniques to analyze big social media data in a more rigorous manner. Even if social media text analysis algorithms were improved, previous approaches to social media text analysis have some limitations. In the field of sentiment analysis of social media written in Korean, there are two typical approaches. One is the linguistic approach using machine learning, which is the most common approach. Some studies have been conducted by adding grammatical factors to feature sets for training classification model. The other approach adopts the semantic analysis method to sentiment analysis, but this approach is mainly applied to English texts. To overcome these limitations, this study applies the Word2Vec algorithm which is an extension of the neural network algorithms to deal with more extensive semantic features that were underestimated in existing sentiment analysis. The result from adopting the Word2Vec algorithm is compared to the result from co-occurrence analysis to identify the difference between two approaches. The results show that the distribution related word extracted by Word2Vec algorithm in that the words represent some emotion about the keyword used are three times more than extracted by co-occurrence analysis. The reason of the difference between two results comes from Word2Vec's semantic features vectorization. Therefore, it is possible to say that Word2Vec algorithm is able to catch the hidden related words which have not been found in traditional analysis. In addition, Part Of Speech (POS) tagging for Korean is used to detect adjective as "emotional word" in Korean. In addition, the emotion words extracted from the text are converted into word vector by the Word2Vec algorithm to find related words. Among these related words, noun words are selected because each word of them would have causal relationship with "emotional word" in the sentence. The process of extracting these trigger factor of emotional word is named "Emotion Trigger" in this study. As a case study, the datasets used in the study are collected by searching using three keywords: professor, prosecutor, and doctor in that these keywords contain rich public emotion and opinion. Advanced data collecting was conducted to select secondary keywords for data gathering. The secondary keywords for each keyword used to gather the data to be used in actual analysis are followed: Professor (sexual assault, misappropriation of research money, recruitment irregularities, polifessor), Doctor (Shin hae-chul sky hospital, drinking and plastic surgery, rebate) Prosecutor (lewd behavior, sponsor). The size of the text data is about to 100,000(Professor: 25720, Doctor: 35110, Prosecutor: 43225) and the data are gathered from news, blog, and twitter to reflect various level of public emotion into text data analysis. As a visualization method, Gephi (http://gephi.github.io) was used and every program used in text processing and analysis are java coding. The contributions of this study are as follows: First, different approaches for sentiment analysis are integrated to overcome the limitations of existing approaches. Secondly, finding Emotion Trigger can detect the hidden connections to public emotion which existing method cannot detect. Finally, the approach used in this study could be generalized regardless of types of text data. The limitation of this study is that it is hard to say the word extracted by Emotion Trigger processing has significantly causal relationship with emotional word in a sentence. The future study will be conducted to clarify the causal relationship between emotional words and the words extracted by Emotion Trigger by comparing with the relationships manually tagged. Furthermore, the text data used in Emotion Trigger are twitter, so the data have a number of distinct features which we did not deal with in this study. These features will be considered in further study.

Korean Voice Phishing Text Classification Performance Analysis Using Machine Learning Techniques (머신러닝 기법을 이용한 한국어 보이스피싱 텍스트 분류 성능 분석)

  • Boussougou, Milandu Keith Moussavou;Jin, Sangyoon;Chang, Daeho;Park, Dong-Joo
    • Annual Conference of KIPS
    • /
    • 2021.11a
    • /
    • pp.297-299
    • /
    • 2021
  • Text classification is one of the popular tasks in Natural Language Processing (NLP) used to classify text or document applications such as sentiment analysis and email filtering. Nowadays, state-of-the-art (SOTA) Machine Learning (ML) and Deep Learning (DL) algorithms are the core engine used to perform these classification tasks with high accuracy, and they show satisfying results. This paper conducts a benchmarking performance's analysis of multiple SOTA algorithms on the first known labeled Korean voice phishing dataset called KorCCVi. Experimental results reveal performed on a test set of 366 samples reveal which algorithm performs the best considering the training time and metrics such as accuracy and F1 score.