• Title/Summary/Keyword: Online mining

Search Result 398, Processing Time 0.03 seconds

The Impact of Coupang Reviews on Product Sales : Based on FCB Grid Model (쿠팡 리뷰가 상품 매출에 미치는 영향 분석 : FCB Grid Model을 기준으로)

  • Ryu, Sung Gwan;Lee, Ji Young;Lee, Sang Woo
    • The Journal of Information Systems
    • /
    • v.31 no.2
    • /
    • pp.159-177
    • /
    • 2022
  • Purpose Online reviews are critical for sales of online shopping platforms because they provide useful information to consumers. As the eCommerce market grows rapidly, the role of online reviews is becoming more important. The purpose of this study is to analyze how online reviews written by domestic consumers affect product sales by classifying the types of products. Design/methodology/approach This study analyzed how the effects of review characteristics(reviewer reputation, reviewer exposure, review length, time, rating, image, and emotional score) on the usefulness of online reviews differ depending on the product types. Subsequently, how the impact of review attributes (review usefulness, number of reviews, ratings, and emotional scores) on product sales differs according to each product type was compared. Based on the FCB Grid model, the product type was classified into high involvement-rational, high involvement-emotional, low involvement -rational, and low involvement-emotional product types. Findings According to the analysis result, the characteristics of reviews useful to consumers were different for each product type, and the review attributes affecting product sales were also different for each product type. This study confirmed that it revealed that product characteristics are major consideration in evaluating the review usefulness and the factors affecting product sales.

Online Social Media Review Mining for Living Items with Probabilistic Approach: A Case Study

  • Li, Shuai;Hao, Fei;Kim, Hee-Cheol
    • Smart Media Journal
    • /
    • v.2 no.2
    • /
    • pp.20-27
    • /
    • 2013
  • The concept of social media is top of the agenda for many business executives and decision makers, as well as consultants try to identify ways where companies can make profitable use of applications such as Netflix, Flixster. The social media is playing an increasingly important role as the information sources for customers making product choices etc. With the flourish of Web 2.0 technology, customer reviews are becoming more and more useful and important information resources for people to save their time and energy on purchasing products that they want. This paper proposes the Bayesian Probabilistic Classification algorithm to mine the social media review, and evaluates it by different splits and cross validation mechanism from the real data set. The explored study experimental results show the robustness and effectiveness of proposed approach for mining the social media review.

  • PDF

Comparing Machine Learning Classifiers for Movie WOM Opinion Mining

  • Kim, Yoosin;Kwon, Do Young;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.8
    • /
    • pp.3169-3181
    • /
    • 2015
  • Nowadays, online word-of-mouth has become a powerful influencer to marketing and sales in business. Opinion mining and sentiment analysis is frequently adopted at market research and business analytics field for analyzing word-of-mouth content. However, there still remain several challengeable areas for 1) sentiment analysis aiming for Korean word-of-mouth content in film market, 2) availability of machine learning models only using linguistic features, 3) effect of the size of the feature set. This study took a sample of 10,000 movie reviews which had posted extremely negative/positive rating in a movie portal site, and conducted sentiment analysis with four machine learning algorithms: naïve Bayesian, decision tree, neural network, and support vector machines. We found neural network and support vector machine produced better accuracy than naïve Bayesian and decision tree on every size of the feature set. Besides, the performance of them was boosting with increasing of the feature set size.

Text Mining and Sentiment Analysis for Predicting Box Office Success

  • Kim, Yoosin;Kang, Mingon;Jeong, Seung Ryul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.8
    • /
    • pp.4090-4102
    • /
    • 2018
  • After emerging online communications, text mining and sentiment analysis has been frequently applied into analyzing electronic word-of-mouth. This study aims to develop a domain-specific lexicon of sentiment analysis to predict box office success in Korea film market and validate the feasibility of the lexicon. Natural language processing, a machine learning algorithm, and a lexicon-based sentiment classification method are employed. To create a movie domain sentiment lexicon, 233,631 reviews of 147 movies with popularity ratings is collected by a XML crawling package in R program. We accomplished 81.69% accuracy in sentiment classification by the Korean sentiment dictionary including 706 negative words and 617 positive words. The result showed a stronger positive relationship with box office success and consumers' sentiment as well as a significant positive effect in the linear regression for the predicting model. In addition, it reveals emotion in the user-generated content can be a more accurate clue to predict business success.

Tourism Destination Recommender System for the Cold Start Problem

  • Zheng, Xiaoyao;Luo, Yonglong;Xu, Zhiyun;Yu, Qingying;Lu, Lin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.7
    • /
    • pp.3192-3212
    • /
    • 2016
  • With the advent and popularity of e-commerce, an increasing number of consumers prefer to order tourism products online. A recommender system can help these users contend with information overload; however, such a system is affected by the cold start problem. Online tourism destination searching is a more difficult task than others on account of its more restrictive factors. In this paper, we therefore propose a tourism destination recommender system that employs opinion-mining technology to refine user preferences and item opinion reputations. These elements are then fused into a hybrid collaborative filtering method by combining user- and item-based collaborative filtering approaches. Meanwhile, we embed an artificial interactive module in our recommender system to alleviate the cold start problem. Compared with several well-known cold start recommendation approaches, our method provides improved recommendation accuracy and quality. A series of experimental evaluations using a publicly available dataset demonstrate that the proposed recommender system outperforms existing recommender systems in addressing the cold start problem.

Design and Implementation of a USN Middleware for Context-Aware and Sensor Stream Mining

  • Jin, Cheng-Hao;Lee, Yang-Koo;Lee, Seong-Ho;Yun, Un-il;Ryu, Keun-Ho
    • Spatial Information Research
    • /
    • v.19 no.1
    • /
    • pp.127-133
    • /
    • 2011
  • Recently, with the advances in sensor techniques and net work computing, Ubiquitous Sensor Network (USN) has been received a lot of attentions from various communities. The sensor nodes distributed in the sensor network tend to continuously generate a large amount of data, which is called stream data. Sensor stream data arrives in an online manner so that it is characterized as high-speed, real-time and unbounded and it requires fast data processing to get the up-to-date results. The data stream has many application domains such as traffic analysis, physical distribution, U-healthcare and so on. Therefore, there is an overwhelming need of a USN middleware for processing such online stream data to provide corresponding services to diverse applications. In this paper, we propose a novel USN middleware which can provide users both context-aware service and meaningful sequential patterns. Our proposed USN middleware is mainly focused on location based applications which use stream location data. We also show the implementation of our proposed USN middleware. By using the proposed USN middleware, we can save the developing cost of providing context aware services and stream sequential patterns mainly in location based applications.

A Sentiment Classification Approach of Sentences Clustering in Webcast Barrages

  • Li, Jun;Huang, Guimin;Zhou, Ya
    • Journal of Information Processing Systems
    • /
    • v.16 no.3
    • /
    • pp.718-732
    • /
    • 2020
  • Conducting sentiment analysis and opinion mining are challenging tasks in natural language processing. Many of the sentiment analysis and opinion mining applications focus on product reviews, social media reviews, forums and microblogs whose reviews are topic-similar and opinion-rich. In this paper, we try to analyze the sentiments of sentences from online webcast reviews that scroll across the screen, which we call live barrages. Contrary to social media comments or product reviews, the topics in live barrages are more fragmented, and there are plenty of invalid comments that we must remove in the preprocessing phase. To extract evaluative sentiment sentences, we proposed a novel approach that clusters the barrages from the same commenter to solve the problem of scattering the information for each barrage. The method developed in this paper contains two subtasks: in the data preprocessing phase, we cluster the sentences from the same commenter and remove unavailable sentences; and we use a semi-supervised machine learning approach, the naïve Bayes algorithm, to analyze the sentiment of the barrage. According to our experimental results, this method shows that it performs well in analyzing the sentiment of online webcast barrages.

Mining Loot Box News : Analysis of Keyword Similarities Using Word2Vec (확률형 아이템 뉴스 마이닝 : Word2Vec 활용한 키워드 유사도 분석)

  • Kim, Taekyung;Son, Wonseok;Jeon, Seongmin
    • Journal of Information Technology Services
    • /
    • v.20 no.2
    • /
    • pp.77-90
    • /
    • 2021
  • Online and mobile games represent digital entertainment. Not only the game grows fast, but also it has been noted for unique business models such as a subscription revenue model and free-to-play with partial payment. But, a recent revenue mechanism, called a loot-box system, has been criticized due to overspending, weak protection to teenagers, and more over gambling-like features. Policy makers and research communities have counted on expert opinions, review boards, and temporal survey studies to build countermeasures to minimize negative effects of online and mobile games. In this process, speed was not seriously considered. In this study, we attempt to use a big data source to find a way of observing a trend for policy makers and researchers. Specifically, we tried to apply the Word2Vec data mining algorithm to news repositories. From the findings, we acknowledged that the suggested design would be effective in lightening issues timely and precisely. This study contributes to digital entertainment service communities by providing a practical method to follow up trends; thus, helping practitioners have concrete grounds for balancing public concerns and business purposes.

Brand Fandom Dynamic Analysis Framework based on Customer Data in Online Communities

  • Yu Cheng;Sangwoo Park;Inseop Lee;Changryong Kim;Sanghun Sul
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.8
    • /
    • pp.2222-2240
    • /
    • 2023
  • Brand fandom refers to a collection of consumers with strong emotions toward a brand. Studying the dynamics of brand fandom can help brands understand which services or strategies influence their consumers to become a part of brand fandom. However, existing literature on fandom in the last three decades has mainly used qualitative methods, and there is still a lack of research on fandom using quantitative methods. Specifically, previous studies lack a framework for locating fandoms from online textual data and analyzing their dynamics. This study proposes a framework for exploring brand fandom dynamics based on online textual data. This framework consists of four phases based on the design thinking model: Preparing Data, Defining Fandom Categories, Generating Fandom Dynamics, and Analyzing Fandom Dynamics. This framework uses techniques such as social network analysis and process mining, combined with brand personality theory. We demonstrate the applicability of this framework using case studies of two Korean home appliance brands. The dataset contains 14,593 posts by consumers in 374 online communities. The results show that the proposed framework can analyze brand fandom dynamics using textual customer data. Our study contributes to the interdisciplinary research at the intersection of data-driven service design and consumer culture quantification.

Core Keywords Extraction forEvaluating Online Consumer Reviews Using a Decision Tree: Focusing on Star Ratings and Helpfulness Votes (의사결정나무를 활용한 온라인 소비자 리뷰 평가에 영향을 주는 핵심 키워드 도출 연구: 별점과 좋아요를 중심으로)

  • Min, Kyeong Su;Yoo, Dong Hee
    • The Journal of Information Systems
    • /
    • v.32 no.3
    • /
    • pp.133-150
    • /
    • 2023
  • Purpose This study aims to develop classification models using a decision tree algorithm to identify core keywords and rules influencing online consumer review evaluations for the robot vacuum cleaner on Amazon.com. The difference from previous studies is that we analyze core keywords that affect the evaluation results by dividing the subjects that evaluate online consumer reviews into self-evaluation (star ratings) and peer evaluation (helpfulness votes). We investigate whether the core keywords influencing star ratings and helpfulness votes vary across different products and whether there is a similarity in the core keywords related to star ratings or helpfulness votes across all products. Design/methodology/approach We used random under-sampling to balance the dataset. We progressively removed independent variables based on decreasing importance through backwards elimination to evaluate the classification model's performance. As a result, we identified classification models that best predict star ratings and helpfulness votes for each product's online consumer reviews. Findings We have identified that the core keywords influencing self-evaluation and peer evaluation vary across different products, and even for the same model or features, the core keywords are not consistent. Therefore, companies' producers and marketing managers need to analyze the core keywords of each product to highlight the advantages and prepare customized strategies that compensate for the shortcomings.