• 제목/요약/키워드: 온라인 마이닝

Search Result 242, Processing Time 0.022 seconds

An Exploratory Study on Key Attributes of Specialty Coffee by Online Big Data Analysis (온라인 빅 데이터 분석을 활용한 스페셜티 커피 속성에 대한 탐색적 연구)

  • Lim, Miri;Wun, Daiyeol;Ryu, Gihwan
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.3
    • /
    • pp.275-282
    • /
    • 2020
  • Social interest on high-quality specialty coffee is increased due to customers' growing experience upon coffee and recent change of coffee culture, which is taking one step further from putting emphasis on not just price and quality but also psychological satisfaction. As a culture of drinking coffee and giving much value on its taste and flavor, a number of customers increasingly demand coffee which is probable to suit one's taste. Likewise, the number of specialty coffee shops is increasing with growing qualities of their coffee. Therefore, the purpose of this study is to analyze the main attributes of specialty coffee and to build a marketing system for specialty coffee shops. The text mining on domestic web portal sites by online big-data analysis is used to extract components of properties of specialty coffee and analyze the degree of how the elements affect the properties. According to the result of the study, words related to coffee taste, coffee beans and baristas were found to play a central role in the properties of specialty coffee.

On-Line Mining using Association Rules and Sequential Patterns in Electronic Commerce (전자상거래에서 연관규칙과 순차패턴을 이용한 온라인 마이닝)

  • 김성학
    • Journal of the Korea Computer Industry Society
    • /
    • v.2 no.7
    • /
    • pp.945-952
    • /
    • 2001
  • In consequence of expansion of internet users, electronic commerce is becoming a new prototype for marketing and sales, arid most of electronic commerce sites or internet shopping malls provide a rich source of information and convenient user interfaces about the organizations customers to maintain their patrons. One of the convenient interfaces for users is service to recommend products. To do this, they must exploit methods to extract and analysis specific patterns from purchasing information, behavior and market basket about customers. The methods are association rules and sequential patterns, which are widely used to extract correlation among products, and in most of on-line electronic commerce sites are executed with users information and purchased history by category-oriented. But these can't represent the diverse correlation among products and also hardly reflect users' buying patterns precisely, since the results are simple set of relations for single purchased pattern. In this paper, we propose an efficient mining technique, which allows for multiple purchased patterns that are category-independent and have relationship among items in the linked structure of single pattern items.

  • PDF

Analysis of Changes in Restaurant Attributes According to the Spread of Infectious Diseases: Application of Text Mining Techniques (감염병 확산에 따른 레스토랑 선택속성 변화 분석: 텍스트마이닝 기법 적용)

  • Joonil Yoo;Eunji Lee;Chulmo Koo
    • Information Systems Review
    • /
    • v.25 no.4
    • /
    • pp.89-112
    • /
    • 2023
  • In March 2020, as it was declared a COVID-19 pandemic, various quarantine measures were taken. Accordingly, many changes have occurred in the tourism and hospitality industries. In particular, quarantine guidelines, such as the introduction of non-face-to-face services and social distancing, were implemented in the restaurant industry. For decades, research on restaurant attributes has emphasized the importance of three attributes: atmosphere, service quality, and food quality. Nevertheless, to the best of our knowledge, research on restaurant attributes considering the COVID-19 situation is insufficient. To respond to this call, this study attempted an exploratory approach to classify new restaurant attributes based on understanding environmental changes. This study considered 31,115 online reviews registered in Naverplace as an analysis unit, with 475 general restaurants located in Euljiro, Seoul. Further, we attempted to classify restaurant attributes by clustering words within online reviews through TF-IDF and LDA topic modeling techniques. As a result of the analysis, the factors of "prevention of infectious diseases" were derived as new attributes of restaurants in the context of COVID-19 situations, along with the atmosphere, service quality, and food quality. This study is of academic significance by expanding the literature of existing restaurant attributes in that it categorized the three attributes presented by existing restaurant attributes and further presented new attributes. Moreover, the analysis results have led to the formulation of practical recommendations, considering both the operational aspects of restaurants and policy implications.

A Clustering Algorithm for Sequence Data Using Rough Set Theory (러프 셋 이론을 이용한 시퀀스 데이터의 클러스터링 알고리즘)

  • Oh, Seung-Joon;Park, Chan-Woong
    • Journal of the Korea Society of Computer and Information
    • /
    • v.13 no.2
    • /
    • pp.113-119
    • /
    • 2008
  • The World Wide Web is a dynamic collection of pages that includes a huge number of hyperlinks and huge volumes of usage informations. The resulting growth in online information combined with the almost unstructured web data necessitates the development of powerful web data mining tools. Recently, a number of approaches have been developed for dealing with specific aspects of web usage mining for the purpose of automatically discovering user profiles. We analyze sequence data, such as web-logs, protein sequences, and retail transactions. In our approach, we propose the clustering algorithm for sequence data using rough set theory. We present a simple example and experimental results using a splice dataset and synthetic datasets.

  • PDF

Technology Mining and Sentiment Analysis on Hydrogen Fuel Cell Using National R&D and Social Data (국가R&D와 소셜 데이터를 활용한 수소연료전지 기술마이닝과 감성분석)

  • Lee, Byeong-Hee;Choi, Jung-Woo;Kim, Tae-Hyun
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.341-343
    • /
    • 2022
  • 온실가스 배출 문제가 세계적인 현안으로 부각되면서 수소를 에너지원으로 사용하는 수소경제가 주목받고 있다. 수소연료전지는 수소경제의 구성요소 중 하나로, 수소를 활용해 열과 전기를 생산하며 에너지 변환 효율이 높이는데 장점이 있다. 본 연구는 세계적인 온라인 커뮤니티인 레딧(Reddit)에서 수집한 수소연료전지와 관련된 소셜 데이터를 텍스트마이닝과 감성분석 기법으로 분석하였다. 분석 결과 9,211건의 댓글을 LDA(Latent Dirichlet Allocation)을 이용해 4개의 토픽 그룹으로 분류할 수 있었다. 이 중 수소연료전지와 관련이 높은 그룹을 선정해 STM(Structural Topic Model) 분석으로 10개 토픽을 추출하였고, 기후 환경, 수소 산업, 수소 차와 관련 있는 토픽 3개를 발견할 수 있었다. 이 연구 결과를 통해 수소연료전지의 세계적으로 실제적인 내용을 빠르고 효과적으로 파악하여 수소연료전지에 대한 예측하고, 우리나라의 수소연료전지 관련 국가R&D의 정책적 방향을 제시하고자 한다.

Keywords Analysis of Clothing Materials in Consumer Reviews Using Big Data Text Mining (빅데이터 텍스트 마이닝을 활용한 소비자 리뷰에서의 의류 소재 키워드 분석)

  • Gaeun Kang;Jiwon Park;Shinjung Yoo
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.48 no.4
    • /
    • pp.729-743
    • /
    • 2024
  • This research explores consumer preferences for materials in different clothing product categories, using web-crawling and text mining techniques. Specifically, the study focuses on the material-related terms found in consumer reviews across three distinct product categories: functional clothing, formal shirts, and knit sweaters. Top-selling products within each category were identified on the Naver Shopping website based on the volume of reviews, and the four most-reviewed products were selected. Six hundred reviews per product were analyzed using the Textom big-data analysis software to determine the frequency of material-related mentions and word associations. The analysis utilized two comparative metrics: product category and usage duration. Our findings reveal notable variations in the material preferences mentioned by consumers across different product categories. The study suggests a need to re-evaluate existing standardized review criteria to better reflect consumer interests specific to each product category. Additionally, an increase in material-related terms in reviews over one month indicates the potential importance of extending the duration of product reviews to enhance the accuracy of information that reflects longer-term consumer experiences with material quality.

Sentiment Analysis and Star Rating Prediction Based on Big Data Analysis of Online Reviews of Foreign Tourists Visiting Korea (방한 관광객의 온라인 리뷰에 대한 빅데이터 분석 기반의 감성분석 및 평점 예측모형)

  • Hong, Taeho
    • Knowledge Management Research
    • /
    • v.23 no.1
    • /
    • pp.187-201
    • /
    • 2022
  • Online reviews written by tourists provide important information for the management and operation of the tourism industry. The star rating of online reviews is a simple quantitative evaluation of a product or service, but it is difficult to reflect the sincere attitude of tourists. There is also an issue; the star rating and review content are not matched. In this study, a star rating prediction model based on online review content was proposed to solve the discrepancy problem. We compared the differences in star ratings and sentiment by continent through sentiment analysis on tourist attractions and hotels written by foreign tourists who visited Korea. Variables were selected through TF-IDF vectorization and sentiment analysis results. Logit, artificial neural network, and SVM(Support Vector Machine) were used for the classification model, and artificial neural network and SVR(Support Vector regression) were applied for the rating prediction model. The online review rating prediction model proposed in this study could solve inconsistency problems and also could be applied even if when there is no star rating.

A Text Mining Analysis of Attributes for Satisfaction and Effect of Consumer Ratings to Korea and China Duty Free Stores - Focusing on Chinese Tourists - (텍스트 마이닝을 통한 한국과 중국 시내면세점 만족 속성과 소비자 평점에 미치는 영향 분석 -중국인 관광객을 중심으로)

  • Yang, DaSom;Kim, Jong Uk
    • Journal of Digital Convergence
    • /
    • v.18 no.8
    • /
    • pp.1-9
    • /
    • 2020
  • This study aims to find new attributes by analyzing Korea and China duty free store online reviews and examine the influence of these attributes on star ratings(satisfaction)of duty free store. For study, we used Dazhong Dianping that largest online review site in China. Using R, we analyzed 5,659 reviews of Korea duty free store and 4,051 reviews of China duty free store. According to the analysis, Sale, Food and Membership attributes had a positive effect on star rating of Korea duty free store. Sale, Product, Airport, Food and Membership had a positive effect on star rating of China duty free store. This study has identified new factors such as food that showed the importance of providing space of restaurants while shopping at duty free store. This study has contributed to the existing literature by finding new attribute such as food. Practically, this finding will help to duty free industry workers better understand the impact of providing space of restaurants on duty free store.

Formulating Strategies from Consumer Opinion Analysis on AI Kids Phone using Text Mining (AI 키즈폰의 소비자리뷰 분석을 통한 제품개선 전략에 대한 연구)

  • Kim, Dohun;Cha, Kyungjin
    • The Journal of Society for e-Business Studies
    • /
    • v.24 no.2
    • /
    • pp.71-89
    • /
    • 2019
  • In order to come up with satisfying product and improvement, firms use traditional marketing research methods to obtain consumers' opinions and further try to reflect them. Recently, gathering data from consumer communication platforms like internet and SNS has become popular methods. Meanwhile, with the development of information technology, mobile companies are launching new digital products for children to protect them from harmful content and provide them with necessary functions and information. Among these digital products, Kids Phone, which is a wearable device with safe functions that enable parents to learn childern's location. Kids phone is relatively cheaper and simpler than smartphone but it is noted that there are several problems such as some useless functions and frequent breakdowns. This study analyzes the reviews of Kids phones from domestic mobile companies, identifies the characteristics, strengths and weaknesses of the products, proposes improvement methods strategies for devices and services through SNS consumer analysis. In order to do that customer review data from online shopping malls was gathered and was further analyzed through text mining methods such as TF/IDF, Sentiment Analysis, and network analysis. Customer review data was gathered through crawling Online shopping Mall and Naver Blog/$Caf\acute{e}$. Data analysis and visualization was done using 'R', 'Textom', and 'Python'. Such analysis allowed us to figure out main issues and recent trends regarding kids phones and to suggest possible service improvement strategies based on sentiment analysis.

A Study on the Perception and Experience of Daejeon Public Library Users Using Text Mining: Focusing on SNS and Online News Articles (텍스트마이닝을 활용한 대전시 공공도서관 이용자의 인식과 경험 연구 - SNS와 온라인 뉴스 기사를 중심으로 -)

  • Jiwon Choi;Seung-Jin Kwak
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.58 no.2
    • /
    • pp.363-384
    • /
    • 2024
  • This study was conducted to examine the user's experiences with the public library in Daejeon using big data analysis, focusing on the text mining technique. To know this, first, the overall evaluation and perception of users about the public library in Daejeon were explored by collecting data on social media. Second, through analysis using online news articles, the pending issues that are being discussed socially were identified. As a result of the analysis, the proportion of users with children was first high. Next, it was found that topics through LDA analysis appeared in four categories: 'cultural event/program', 'data use', 'physical environment and facilities', and 'library service'. Finally, it was confirmed that keywords for the additional construction of libraries and complex cultural spaces and the establishment of a library cooperation system appeared at the core in the news article data. Based on this, it was proposed to build a library in consideration of regional balance and to create a social parenting community network through business agreements with childcare and childcare institutions. This will contribute to identifying the policy and social trends of public libraries in Daejeon and implementing data-based public library operations that reflect local community demands.