• Title/Summary/Keyword: 온라인 마이닝

Search Result 240, Processing Time 0.029 seconds

The Effects of Cultural Factors in Tourists' Restaurant Satisfaction: Using Text Mining and Online Reviews (문화적 요인이 관광객의 음식점 만족도에 미치는 영향: 텍스트 마이닝과 온라인 리뷰를 활용하여)

  • Jiajia Meng;Gee-Woo Bock;Han-Min Kim
    • Information Systems Review
    • /
    • v.25 no.1
    • /
    • pp.145-164
    • /
    • 2023
  • The proliferation of online reviews on dining experiences has significantly affected consumers' choices of restaurants, especially overseas. Food quality, service, ambiance, and price have been identified as specific attributes for the choice of a restaurant in prior studies. In addition to these four representative attributes, cultural factors, which may also significantly impact the choice of a restaurant for tourists, in particular, have not received much attention in previous studies. This study employs the text mining technique to analyze over 10,000 online reviews of 76 Korean restaurants posted by Chinese tourists on dianping.com to explore the influence of cultural factors on the consumer's choice of restaurants in the overseas travel context. The findings reveal that "Hallyu (Korean Wave)" influences Chinese tourists' dining experiences in Korea and their satisfaction. Moreover, Korean food-related words, such as cold noodle, bibimbap, rice cake, pig trotters, and kimchi stew, appeared across all the review topics. Our findings contribute to the existing tourism and hospitality literature by identifying the critical role of cultural factors on consumers', especially tourists', satisfaction with the choice of a restaurant using text mining. The findings also provide practical guidance to restaurant owners in Korea to attract more Chinese tourists.

Comparison of Online Shopping Mall BEST 100 using Exploratory Data Analysis (탐색적 자료 분석(EDA) 기법을 활용한 국내 11개 대표 온라인 쇼핑몰 BEST 100 비교)

  • Kang, Jicheon;Kang, Juyoung
    • The Journal of Bigdata
    • /
    • v.3 no.1
    • /
    • pp.1-12
    • /
    • 2018
  • Since the beginning of the first online shopping mall, BEST 100 is being provided as the core of all shopping mall websites. BEST 100 is greatly important because consumers can identify popular products at a glance. However, there are only studies using sales outcome indicators, and prior studies using BEST 100 are insignificant. Therefore, this study selected 11 online shopping malls and compared their main characteristics. As a research method, exploratory data analysis technique (EDA) was used by crawling the BEST 100 components of each shopping mall website, such as product name, price, and free shipping check. As a result, the total average price of 11 shopping malls was 72,891.41 won. Sales texts were classified into 8 categories by text mining. The most common category was the fashion part, but it is significant that the setting of the category analyzed the marketing text, not the product attribute. This study has implications for understanding the current online market flow and suggesting future directions by using EDA.

An Analysis of Information Diffusion in the Blog World (블로그 월드에서 정보 파급 분석)

  • Kwon, Yong-Suk;Kim, Sang-Wook;Park, Sun-Ju
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.05a
    • /
    • pp.223-226
    • /
    • 2008
  • 인터넷 기술의 발달로 인해 온라인상에서도 사회연결망이 나타나고 있다. 블로그 월드는 대표적인 온라인 사회연결망이다. 블로그 월드의 구성원인 블로거는 정보를 생성할 수도 있고, 정보를 얻기 위하여 다른 블로거와 명시적 관계를 맺을 수도 있으며, 이러한 관계를 통해 온라인 사회연결망인 블로그연결망을 구성한다. 사회 연결망 이론에서는 사회 연결망에서 정보의 파급이 구성원간의 관계를 통하여 이루어진다고 한다. 그러나 블로그 연결망과 실제 블로그 월드에서 발생한 정보 파급 이력을 비교 관찰해 보면, 사회연결망 이론과 달리 관계가 존재하지 않는 구성원 사이에서 정보 파급이 일어난다. 또한, 정보의 파급이 폭발적으로 일어나는 현상도 존재한다. 본 논문에서는 이러한 두 현상이 서로 연관이 있음을 밝히고, 이러한 현상을 일으키는 원인을 규명하는 분석방법을 제안한다. 제안하는 분석방법은 다음과 같다. 우선, 관계가 존재하지 않는 구성원 간에 정보 파급 현상을 유발할 수 있는 후보원인들을 모두 도출한다. 다음으로, 폭발적인 정보 파급 현상을 보이는 정보의 집단을 데이터 마이닝의 클러스터링 기술을 이용하여 도출한다. 도출된 정보의 집단과 후보 원인간의 상관관계를 데이터 마이닝의 특성분석 방법을 이용하여 구한다. 블로그 월드는 구성원과 그 사이의 관계, 정보 파급 이력에 대한 데이터를 모두 저장하고 있다. 본 논문은 실제 블로그 월드의 데이터를 이용하여 블로그 월드에서 정보의 폭발적 파급을 유발하는 원인들을 규명하고 그 원인들이 가지는 특징을 설명하였다.

Sentiment analysis of online food product review using ensemble technique (앙상블 기법을 활용한 온라인 음식 상품 리뷰 감성 분석)

  • Kim, Han-Min;Park, Kyungbo
    • Journal of Digital Convergence
    • /
    • v.17 no.4
    • /
    • pp.115-122
    • /
    • 2019
  • In the online marketplace, consumers are exposed to various products and freely express opinions. As consumer product reviews have a important effect on the success of online markets and other consumers, online market needs to accurately analyze the consumers' emotions about their products. Text mining, which is one of the data analysis techniques, can analyze the consumer's reviews on the products and efficiently manage the products. Previous studies have analyzed specific domains and less than 20,000 data, despite the different accuracy of the analysis results depending on the data domain and size. Further, there are few studies on additional factors that can improve the accuracy of analysis. This study analyzed 72,530 review data of food product domain that was not mainly covered in previous studies by using ensemble technique. We also examined the influence of summary review on improving accuracy of analysis. As a result of the study, this study found that Boosting ensemble technique has the highest accuracy of analysis. In addition, the summary review contributed to improving accuracy of the analysis.

A Study on the Effects of Online Word-of-Mouth on Game Consumers Based on Sentimental Analysis (감성분석 기반의 게임 소비자 온라인 구전효과 연구)

  • Jung, Keun-Woong;Kim, Jong Uk
    • Journal of Digital Convergence
    • /
    • v.16 no.3
    • /
    • pp.145-156
    • /
    • 2018
  • Unlike the past, when distributors distributed games through retail stores, they are now selling digital content, which is based on online distribution channels. This study analyzes the effects of eWOM (electronic Word of Mouth) on sales volume of game sold on Steam, an online digital content distribution channel. Recently, data mining techniques based on Big Data have been studied. In this study, emotion index of eWOM is derived by emotional analysis which is a text mining technique that can analyze the emotion of each review among factors of eWOM. Emotional analysis utilizes Naive Bayes and SVM classifier and calculates the emotion index through the SVM classifier with high accuracy. Regression analysis is performed on the dependent variable, sales variation, using the emotion index, the number of reviews of each game, the size of eWOM, and the user score of each game, which is a rating of eWOM. Regression analysis revealed that the size of the independent variable eWOM and the emotion index of the eWOM were influential on the dependent variable, sales variation. This study suggests the factors of eWOM that affect the sales volume when Korean game companies enter overseas markets based on steam.

Development of Hybrid Recommender System Using Review Data Mining: Kindle Store Data Analysis Case (리뷰 데이터 마이닝을 이용한 하이브리드 추천시스템 개발: Amazon Kindle Store 데이터 분석사례)

  • Yihua Zhang;Qinglong Li;Ilyoung Choi;Jaekyeong Kim
    • Information Systems Review
    • /
    • v.23 no.1
    • /
    • pp.155-172
    • /
    • 2021
  • With the recent increase in online product purchases, a recommender system that recommends products considering users' preferences has still been studied. The recommender system provides personalized product recommendation services to users. Collaborative Filtering (CF) using user ratings on products is one of the most widely used recommendation algorithms. During CF, the item-based method identifies the user's product by using ratings left on the product purchased by the user and obtains the similarity between the purchased product and the unpurchased product. CF takes a lot of time to calculate the similarity between products. In particular, it takes more time when using text-based big data such as review data of Amazon store. This paper suggests a hybrid recommendation system using a 2-phase methodology and text data mining to calculate the similarity between products easily and quickly. To this end, we collected about 980,000 online consumer ratings and review data from the online commerce store, Amazon Kinder Store. As a result of several experiments, it was confirmed that the suggested hybrid recommendation system reflecting the user's rating and review data has resulted in similar recommendation time, but higher accuracy compared to the CF-based benchmark recommender systems. Therefore, the suggested system is expected to increase the user's satisfaction and increase its sales.

Expansion of Opinion Mining based on Entity Association Network Model (개체연관망 모델에 의한 오피니언마이닝의 확장)

  • Kim, Keun-Hyung
    • The KIPS Transactions:PartD
    • /
    • v.18D no.4
    • /
    • pp.237-244
    • /
    • 2011
  • Opinion Mining summarizes with classifying sensitive opinions of customers in huge online customer reviews for the attributes of products or services by positive and negative opinions. Because the customers represent their interests through subjective opinions as well as objective facts, the existing opinion mining techniques, which can analyze just the sensitive opinions, need to be expanded.. In this paper, We propose the novel entity association network model which expands the existing opinion mining techniques. The entity association model can not only represent positive and negative degree of the sensitive opinions, but also can represent the degree of the associations and relative importances between entities. We designed and implemented the customer reviews analysis system based on the entity association network model. We recognized that the system can represent more abundant information than the existing opinion mining techniques.

Analysis and Evaluation of Frequent Pattern Mining Technique based on Landmark Window (랜드마크 윈도우 기반의 빈발 패턴 마이닝 기법의 분석 및 성능평가)

  • Pyun, Gwangbum;Yun, Unil
    • Journal of Internet Computing and Services
    • /
    • v.15 no.3
    • /
    • pp.101-107
    • /
    • 2014
  • With the development of online service, recent forms of databases have been changed from static database structures to dynamic stream database structures. Previous data mining techniques have been used as tools of decision making such as establishment of marketing strategies and DNA analyses. However, the capability to analyze real-time data more quickly is necessary in the recent interesting areas such as sensor network, robotics, and artificial intelligence. Landmark window-based frequent pattern mining, one of the stream mining approaches, performs mining operations with respect to parts of databases or each transaction of them, instead of all the data. In this paper, we analyze and evaluate the techniques of the well-known landmark window-based frequent pattern mining algorithms, called Lossy counting and hMiner. When Lossy counting mines frequent patterns from a set of new transactions, it performs union operations between the previous and current mining results. hMiner, which is a state-of-the-art algorithm based on the landmark window model, conducts mining operations whenever a new transaction occurs. Since hMiner extracts frequent patterns as soon as a new transaction is entered, we can obtain the latest mining results reflecting real-time information. For this reason, such algorithms are also called online mining approaches. We evaluate and compare the performance of the primitive algorithm, Lossy counting and the latest one, hMiner. As the criteria of our performance analysis, we first consider algorithms' total runtime and average processing time per transaction. In addition, to compare the efficiency of storage structures between them, their maximum memory usage is also evaluated. Lastly, we show how stably the two algorithms conduct their mining works with respect to the databases that feature gradually increasing items. With respect to the evaluation results of mining time and transaction processing, hMiner has higher speed than that of Lossy counting. Since hMiner stores candidate frequent patterns in a hash method, it can directly access candidate frequent patterns. Meanwhile, Lossy counting stores them in a lattice manner; thus, it has to search for multiple nodes in order to access the candidate frequent patterns. On the other hand, hMiner shows worse performance than that of Lossy counting in terms of maximum memory usage. hMiner should have all of the information for candidate frequent patterns to store them to hash's buckets, while Lossy counting stores them, reducing their information by using the lattice method. Since the storage of Lossy counting can share items concurrently included in multiple patterns, its memory usage is more efficient than that of hMiner. However, hMiner presents better efficiency than that of Lossy counting with respect to scalability evaluation due to the following reasons. If the number of items is increased, shared items are decreased in contrast; thereby, Lossy counting's memory efficiency is weakened. Furthermore, if the number of transactions becomes higher, its pruning effect becomes worse. From the experimental results, we can determine that the landmark window-based frequent pattern mining algorithms are suitable for real-time systems although they require a significant amount of memory. Hence, we need to improve their data structures more efficiently in order to utilize them additionally in resource-constrained environments such as WSN(Wireless sensor network).

Sensibility by Weather and e-Commerce Purchase Behavior

  • Hyun-Jin Yeo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.4
    • /
    • pp.177-182
    • /
    • 2024
  • A consumer's decisions are made by affection of product. Affection has types: evaluation, mood, emotion and sensibility that means unconscious changes. Previous researches have clarified weather factors affect to sensibility that means weather factors may have causal effects to consumer's decision making. This research utilize weather information from KMA(Korea Meteorological Administration) and SNS geographical information and text to make weather sensibility model, and clarify the model shows significant change to online shop customer's purchase behavior(purchase frequency) by merging customer's address information and geometric information of the model for apply weather model. As a result, a model utilize daily precipitation, sunshine hours, average ground temperature, and average relative humidity makes significant result to e-commerce purchase behavior frequency.

Lexical and Phrasal Analysis of Online Discourse of Type 2 Diabetes Patients based on Text-Mining (텍스트마이닝 기법을 이용한 제 2형 당뇨환자 온라인 담론의 어휘 및 구문구조 분석)

  • Hwang, Moonl-Hyon;Park, Jungsik
    • Journal of Digital Convergence
    • /
    • v.12 no.6
    • /
    • pp.655-667
    • /
    • 2014
  • This paper has identified five major categories of the T2D patients' concerns based on an online forum where the patients voluntarily verbalized their naturally occurring emotional reactions and concerns related to T2D. We have emphasized the fact that the lexical and phrasal analysis brought to the forefront the prevailing negative reactions and desires for clear information, professional advice, and emotional support. This study used lexical and phrasal analysis based on text-mining tools to estimate the potential of using a large sample of patient conversation of a specific disease posted on the internet for clinical features and patients' emotions. As a result, the study showed that quantitative analysis based on text-mining is a viable method of generalizing the psychological concerns and features of T2D patients.