• Title/Summary/Keyword: Sentiment mining

Search Result 239, Processing Time 0.04 seconds

Sentiment analysis of online food product review using ensemble technique (앙상블 기법을 활용한 온라인 음식 상품 리뷰 감성 분석)

  • Kim, Han-Min;Park, Kyungbo
    • Journal of Digital Convergence
    • /
    • v.17 no.4
    • /
    • pp.115-122
    • /
    • 2019
  • In the online marketplace, consumers are exposed to various products and freely express opinions. As consumer product reviews have a important effect on the success of online markets and other consumers, online market needs to accurately analyze the consumers' emotions about their products. Text mining, which is one of the data analysis techniques, can analyze the consumer's reviews on the products and efficiently manage the products. Previous studies have analyzed specific domains and less than 20,000 data, despite the different accuracy of the analysis results depending on the data domain and size. Further, there are few studies on additional factors that can improve the accuracy of analysis. This study analyzed 72,530 review data of food product domain that was not mainly covered in previous studies by using ensemble technique. We also examined the influence of summary review on improving accuracy of analysis. As a result of the study, this study found that Boosting ensemble technique has the highest accuracy of analysis. In addition, the summary review contributed to improving accuracy of the analysis.

Research on the change of perception of abandoned dogs through big data analysis

  • Jang, Ji-Yun;Lee, Seok-Won
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.9
    • /
    • pp.115-123
    • /
    • 2021
  • This study aims to analyze the changes in public perception of abandoned dogs through big data analysis. Data from January 2017 to July 2020 were collected to analyze how the quantitative change in social issues with abandoned dogs as a keyword had an effect on public perception of abandoned dogs, and factors that influence positive/negative perceptions. As a result of the study, it was confirmed that the number of stray dogs and the number of documents related to stray dogs had a positive correlation, and specific time series changes were found through various analysis techniques such as text mining, network analysis, and sentiment analysis. This study will have significance as basic data that can be used for policy establishment or other research on abandoned dogs. we hope it will help to solve problems so as to improve awareness of abandoned dogs and develop a sense of responsibility.

Analysis of Urban-to-Rural Migrants' Perceptions of the 'Everyday Landscape' Using Diary-Based Text Mining (일기를 통해 본 귀농·귀촌인 '일상 경관' 인식 - 텍스트 마이닝 적용 -)

  • OH Jungshim
    • Korean Journal of Heritage: History & Science
    • /
    • v.57 no.3
    • /
    • pp.184-199
    • /
    • 2024
  • This study was conducted in response to the global trend of emphasizing the importance of "everyday landscapes", focusing on the perspective of those who have returned to rural life. With a focus on the case of Gokseong-gun in Jeollanam-do, 460 diaries written by these individuals were collected and analyzed using text mining techniques such as "frequency analysis", "topic modeling", and "sentiment analysis". The analysis of noun morphemes was interpreted from a cognitive aspect, while adjective morphemes were interpreted from an emotional aspect. In particular, this study applied semantic network analysis to overcome the limitations of existing sentiment analysis, and extracted a word network list and examined the content of nouns connected to adjectives that express emotions to identify the targets and contents of sentiments. This method represents a differentiated approach that is not commonly found in existing research. One of the intriguing findings is that the urban-to-rural migrants identified everyday landscapes such as "flowers on neighborhood walking paths", "harvest of a garden", "neighborhood events", and "cozy cafe spaces" as important. These elements all contain visual and enjoyable aspects of everyday landscapes. Currently, many rural villages are attempting to add visual elements to their everyday landscapes by unifying roof colors or painting murals on walls. However, such artificial measures do not necessarily leave a lasting impression on people. A critical review of current policies and systems is necessary. This research is significant because it is the first to study everyday landscapes from the perspective of urban-to-rural migration using diaries and text mining. With a lack of domestic research on everyday landscapes, this study hopes to contribute to the activation of related research in Korea.

An Experimental Evaluation of Short Opinion Document Classification Using A Word Pattern Frequency (단어패턴 빈도를 이용한 단문 오피니언 문서 분류기법의 실험적 평가)

  • Chang, Jae-Young;Kim, Ilmin
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.5
    • /
    • pp.243-253
    • /
    • 2012
  • An opinion mining technique which was developed from document classification in area of data mining now becomes a common interest in domestic as well as international industries. The core of opinion mining is to decide precisely whether an opinion document is a positive or negative one. Although many related approaches have been previously proposed, a classification accuracy was not satisfiable enough to applying them in practical applications. A opinion documents written in Korean are not easy to determine a polarity automatically because they often include various and ungrammatical words in expressing subjective opinions. Proposed in this paper is a new approach of classification of opinion documents, which considers only a frequency of word patterns and excludes the grammatical factors as much as possible. In proposed method, we express a document into a bag of words and then apply a learning algorithm using a frequency of word patterns, and finally decide the polarity of the document using a score function. Additionally, we also present the experiment results for evaluating the accuracy of the proposed method.

Rating Individual Food Items of Restaurant Menu based on Online Customer Reviews using Text Mining Technique (신뢰성있는 온라인 고객 리뷰 텍스트 마이닝 기반 식당 개별 음식 아이템 평가)

  • Syed, Muzamil Hussain;Chung, Sun-Tae
    • Annual Conference of KIPS
    • /
    • 2020.05a
    • /
    • pp.389-392
    • /
    • 2020
  • The growth in social media, blogs and restaurant listing directories have led to increasing customer reviews about restaurants, their quality of food items and services available on the internet. These user reviews offer a massive amount of valuable information that can be used for various decision-making purposes. Currently, most food recommendation sites provide recommendation scores about restaurants rather than food items of the restaurant and the provided recommendation scores may be biased since they are calculated only from user reviews listed only in their sites. Usually, people wants a reliable recommendation about foods, not restaurant. In this paper, we present a reliable Korean food items rating method; we first extract food items by applying NER technique to restaurant reviews collected from many Korean restaurant recommendation web sites, blogs and web data. Then, we apply lexicon-based sentiment analysis on collected user reviews and predict people's opinions as sentiment polarity scores (+1 for positive; -1 for negative; 0 for neutral). Finally, by taking average of all calculated polarity scores about a food item, we obtain a rating to individual menu items of the restaurant. The proposed food item rating is more reliable since it does not depend on reviews of only one site.

Measuring Hotel Service Quality Using Social Media Analytics: The Moderating Effects of Brand of Origin

  • Byounggu Choi;Shin-Hyeok Kang
    • Asia pacific journal of information systems
    • /
    • v.33 no.3
    • /
    • pp.677-701
    • /
    • 2023
  • With the rapid advancement of social media analytics and artificial intelligence, many studies have used online customer reviews as an important source to measure service quality in many industries, including the hotel industry. However, these studies have failed to identify the relative importance of different dimensions of service quality and their role in customer satisfaction. To fill this research gap, this study aims to identify the effects of service quality on hotel customer satisfaction from the multidimensional perspectives using sentiment analysis with self-training on online reviews. Additionally, the moderating role of the brand of origin for each service quality dimension is also investigated. Drawing on the SERVQUAL model and brand of origin concept, this study develops 12 hypotheses and empirically tests them using 30,070 online customer hotel reviews collected from TripAdvisor.com. The results indicated that overall service quality and each dimension of SERVQUAL significantly influenced customer satisfaction of hotels. The results also confirmed the moderating effects of brand of origin on overall service quality. However, the moderating effects of brand of origin for the tangible, reliability, and empathy dimensions of service quality were significant, whereas the effects for responsiveness and assurance were not. This study sheds new light on service quality measurement by analyzing the multidimensional features of service quality and the role of brand of origin in the hotel service context.

An Experimental Study on Opinion Classification Using Supervised Latent Semantic Indexing(LSI) (지도적 잠재의미색인(LSI)기법을 이용한 의견 문서 자동 분류에 관한 실험적 연구)

  • Lee, Ji-Hye;Chung, Young-Mee
    • Journal of the Korean Society for information Management
    • /
    • v.26 no.3
    • /
    • pp.451-462
    • /
    • 2009
  • The aim of this study is to apply latent semantic indexing(LSI) techniques for efficient automatic classification of opinionated documents. For the experiments, we collected 1,000 opinionated documents such as reviews and news, with 500 among them labelled as positive documents and the remaining 500 as negative. In this study, sets of content words and sentiment words were extracted using a POS tagger in order to identify the optimal feature set in opinion classification. Findings addressed that it was more effective to employ LSI techniques than using a term indexing method in sentiment classification. The best performance was achieved by a supervised LSI technique.

Sentiment Prediction using Emotion and Context Information in Unstructured Documents (비정형 문서에서 감정과 상황 정보를 이용한 감성 예측)

  • Kim, Jin-Su
    • Journal of Convergence for Information Technology
    • /
    • v.10 no.10
    • /
    • pp.40-46
    • /
    • 2020
  • With the development of the Internet, users share their experiences and opinions. Since related keywords are used witho0ut considering information such as the general emotion or genre of an unstructured document such as a movie review, the sensitivity accuracy according to the appropriate emotional situation is impaired. Therefore, we propose a system that predicts emotions based on information such as the genre to which the unstructured document created by users belongs or overall emotions. First, representative keyword related to emotion sets such as Joy, Anger, Fear, and Sadness are extracted from the unstructured document, and the normalized weights of the emotional feature words and information of the unstructured document are trained in a system that combines CNN and LSTM as a training set. Finally, by testing the refined words extracted through movie information, morpheme analyzer and n-gram, emoticons, and emojis, it was shown that the accuracy of emotion prediction using emotions and F-measure were improved. The proposed prediction system can predict sentiment appropriately according to the situation by avoiding the error of judging negative due to the use of sad words in sad movies and scary words in horror movies.

A Sentiment Analysis Algorithm for Automatic Product Reviews Classification in On-Line Shopping Mall (온라인 쇼핑몰의 상품평 자동분류를 위한 감성분석 알고리즘)

  • Chang, Jae-Young
    • The Journal of Society for e-Business Studies
    • /
    • v.14 no.4
    • /
    • pp.19-33
    • /
    • 2009
  • With the continuously increasing volume of e-commerce transactions, it is now popular to buy some products and to evaluate them on the World Wide Web. The product reviews are very useful to customers because they can make better decisions based on the indirect experiences obtainable through the reviews. Product Reviews are results expressing customer's sentiments and thus are divided into positive reviews and negative ones. However, as the number of reviews in on-line shopping increases, it is inefficient or sometimes impossible for users to read all the relevant review documents. In this paper, we present a sentiment analysis algorithm for automatically classifying subjective opinions of customer's reviews using opinion mining technology. The proposed algorithm is to focus on product reviews of on-line shopping, and provides summarized results from large product review data by determining whether they are positive or negative. Additionally, this paper introduces an automatic review analysis system implemented based on the proposed algorithm, and also present the experiment results for verifying the efficiency of the algorithm.

  • PDF

User Sentiment Analysis on Amazon Fashion Product Review Using Word Embedding (워드 임베딩을 이용한 아마존 패션 상품 리뷰의 사용자 감성 분석)

  • Lee, Dong-yub;Jo, Jae-Choon;Lim, Heui-Seok
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.4
    • /
    • pp.1-8
    • /
    • 2017
  • In the modern society, the size of the fashion market is continuously increasing both overseas and domestic. When purchasing a product through e-commerce, the evaluation data for the product created by other consumers has an effect on the consumer's decision to purchase the product. By analysing the consumer's evaluation data on the product the company can reflect consumer's opinion which can leads to positive affect of performance to company. In this paper, we propose a method to construct a model to analyze user's sentiment using word embedding space formed by learning review data of amazon fashion products. Experiments were conducted by learning three SVM classifiers according to the number of positive and negative review data using the formed word embedding space which is formed by learning 5.7 million Amazon review data.. Experimental results showed the highest accuracy of 88.0% when learning SVM classifier using 50,000 positive review data and 50,000 negative review data.