• Title/Summary/Keyword: 온라인 마이닝

Search Result 242, Processing Time 0.029 seconds

Lexicon of Semantic-Polarity of Korean Adjectives for the Classification of On-line Opinion Documents (온라인 오피니언 문서 분류를 위한 한국어 형용사 의미 극성 사전)

  • Ahn, Ae-Lim;Shim, Seung-Hye;Nam, Jee-Sun
    • Annual Conference on Human and Language Technology
    • /
    • 2010.10a
    • /
    • pp.166-171
    • /
    • 2010
  • 본 논문은 한국어 온라인 리뷰 문서의 오피니언 분류(Opinion Classification)에 있어 그 핵심 키워드가 형용사 (Adjective) 범주라는 점을 고려하여, 한국어 형용사를 <문맥에 의존하지 않는 절대 극성>과, <문맥에 의존하여 극성이 바뀌는 상대극성>으로 대분류한 뒤 그 각각의 의미 극성을 하위 분류하는 작업을 수행하였다. 기존의 연구에서 특징적인 오피니언 어휘 수십개에 의존하여 자동 분류를 시도하고자 하였던 문제점을 극복하기 위해서는 한국어 형용사 전체 범주에 대한 체계적인 극성 분류가 이루어져야 할 필요가 있으며, 여기서 특히 상세히 주목받지 못했던 상대 극성 어휘에 대한 본격적인 의미 분류가 요구된다. 본 연구에서 제시하는 형용사의 극성 분류는 기존의 이론 언어학적 형용사 의미 분류와 달리 온라인 오피니언 문서에서 도메인에 따라 나타나는 특징적 의미 유형을 결정하고, 이를 기준으로 온라인 오피니언 문서의 극성 판별에 효과적으로 적용할 수 있는 사전을 구축하였다는 점에서 의의를 가진다.

  • PDF

The Dynamics of Online word-of-mouth and Marketing Performance : Exploring Mobile Game Application Reviews (온라인 구전과 마케팅 성과의 다이나믹스 연구 : 모바일 게임 앱 리뷰를 중심으로)

  • Kim, In-kiw;Cha, Seong-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.20 no.12
    • /
    • pp.36-48
    • /
    • 2020
  • App market has continuously been growth since its launch. The market revenues will reach about 1,000 billion US dollars in 2019. App is a core service for smartphone. Currently, there are more than 1.5 million mobile apps in App platform calling out for attention. So, if you are looking at developing a successful app, you need to have a solid marketing and distribution strategy. Online word of mouth(eWOM) is one of the most effective, powerful App marketing method. eWOM affect potential consumers' decision making, and this effect can spread rapidly through online social network. Despite the increasing research on word of mouth, only few studies have focused on content analysis. Most of studies focused on the causes and acceptance of eWOM and eWOM performance measurement. This study aims to content analysis of mobile apps review In 2013, Google researchers announced Word2Vec. This method has overcome the weakness of previous studies. This is faster and more accurate than traditional methods. This study found out the relationship between mobile app reviews and checked for reactions by Word2vec.

An Intelligent Recommendation System by Integrating the Attributes of Product and Customer in the Movie Reviews (영화 리뷰의 상품 속성과 고객 속성을 통합한 지능형 추천시스템)

  • Hong, Taeho;Hong, Junwoo;Kim, Eunmi;Kim, Minsu
    • Journal of Intelligence and Information Systems
    • /
    • v.28 no.2
    • /
    • pp.1-18
    • /
    • 2022
  • As digital technology converges into the e-commerce market across industries, online transactions have activated, and the use of online has increased. With the recent spread of infectious diseases such as COVID-19, this market flow is accelerating, and various product information can be provided to customers online. Providing a variety of information provides customers with various opportunities but causes difficulties in decision-making. The recommendation system can help customers to make a decision more effectively. However, the previous research on recommendation systems is limited to only quantitative data and does not reflect detailed factors of products and customers. In this study, we propose an intelligent recommendation system that quantifies the attributes of products and customers by applying text mining techniques to qualitative data based on online reviews and integrates the existing objective indicators of total star rating, sentiment, and emotion. The proposed integrated recommendation model showed superior performance to the overall rating-oriented recommendation model. It expects the new business value to be created through the recommendation result reflecting detailed factors of products and customers.

A Study on the Method for Extracting the Purpose-Specific Customized Information from Online Product Reviews based on Text Mining (텍스트 마이닝 기반의 온라인 상품 리뷰 추출을 통한 목적별 맞춤화 정보 도출 방법론 연구)

  • Kim, Joo Young;Kim, Dong soo
    • The Journal of Society for e-Business Studies
    • /
    • v.21 no.2
    • /
    • pp.151-161
    • /
    • 2016
  • In the era of the Web 2.0, characterized by the openness, sharing and participation, it is easy for internet users to produce and share the data. The amount of the unstructured data which occupies most of the digital world's data has increased exponentially. One of the kinds of the unstructured data called personal online product reviews is necessary for both the company that produces those products and the potential customers who are interested in those products. In order to extract useful information from lots of scattered review data, the process of collecting data, storing, preprocessing, analyzing, and drawing a conclusion is needed. Therefore we introduce the text-mining methodology for applying the natural language process technology to the text format data like product review in order to carry out extracting structured data by using R programming. Also, we introduce the data-mining to derive the purpose-specific customized information from the structured review information drawn by the text-mining.

Performance Improvement of Data Preprocessing for Intersite Web Usage Mining (사이트간 웹 사용 마이닝을 위한 데이터 전처리의 성능 향상)

  • Hyun, Woo-Seok
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10b
    • /
    • pp.357-361
    • /
    • 2006
  • 매일 새롭게 생기는 웹 페이지 수가 수천만 개, 온라인 문서들의 수가 수십억 개에 이르게 되자, 웹 사이트를 설계함에 있어서 웹 서버 로그 파일에 기록된 사용자의 행동을 분석하는 것이 중요한 부분이 되어가고 있다. 분석가들은 전체 웹 사이트에서 사용자 행동의 완전한 개요를 알기 원하기 때문에 고객이 방문했던 모든 다른 웹 서버를 통하여 사용자의 패스(path)를 다시 수집해야만 한다. 본 연구에서는 모든 로그 파일을 연결해서 방문했던 곳을 재구성하는 향상된 데이터 전처리 방법에 의하여 실험을 하여 로그 파일 크기를 감소시키게 되어 데이터 전처리의 성능이 향상되었음을 보였다.

  • PDF

Music Recommendation Using Data Mining (데이터 마이닝을 이용한 음악 추천)

  • Lee, Hye-In;Yun, So-Young;Youn, Sung-Dae
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.10a
    • /
    • pp.372-375
    • /
    • 2018
  • 본 논문은 온라인 음원 서비스 이용자들이 겪는 선택의 어려움을 최소화하고, 낭비되는 시간을 줄이기 위한 음악 추천 기법을 제안하고자 한다. 제안하는 기법은 개인정보의 이용 없이 아이템을 추천할 수 있는 아이템 기반 협업필터링 알고리즘을 사용한다. 더 정확한 추천을 위해 음원의 메타데이터를 이용한다. 실험을 통해 제안하는 기법이 메타데이터를 이용하지 않을 때보다 추천 성능이 향상되는 것을 확인하였다.

An Investigation on Expanding Co-occurrence Criteria in Association Rule Mining (온라인 연관관계 분석의 장바구니 기준에 대한 연구)

  • Kim, Mi-Sung;Kim, Nam-Gyu
    • CRM연구
    • /
    • v.4 no.2
    • /
    • pp.19-29
    • /
    • 2011
  • There is a large difference between purchasing patterns in an online shopping mall and in an offline market. This difference may be caused mainly by the difference in accessibility of online and offline markets. It means that an interval between the initial purchasing decision and its realization appears to be relatively short in an online shopping mall, because a customer can make an order immediately. Because of the short interval between a purchasing decision and its realization, an online shopping mall transaction usually contains fewer items than that of an offline market. In an offline market, customers usually keep some items in mind and buy them all at once a few days after deciding to buy them, instead of buying each item individually and immediately. On the contrary, more than 70% of online shopping mall transactions contain only one item. This statistic implies that traditional data mining techniques cannot be directly applied to online market analysis, because hardly any association rules can survive with an acceptable level of Support because of too many Null Transactions. Most market basket analyses on online shopping mall transactions, therefore, have been performed by expanding the co-occurrence criteria of traditional association rule mining. While the traditional co-occurrence criteria defines items purchased in one transaction as concurrently purchased items, the expanded co-occurrence criteria regards items purchased by a customer during some predefined period (e.g., a day) as concurrently purchased items. In studies using expanded co-occurrence criteria, however, the criteria has been defined arbitrarily by researchers without any theoretical grounds or agreement. The lack of clear grounds of adopting a certain co-occurrence criteria degrades the reliability of the analytical results. Moreover, it is hard to derive new meaningful findings by combining the outcomes of previous individual studies. In this paper, we attempt to compare expanded co-occurrence criteria and propose a guideline for selecting an appropriate one. First of all, we compare the accuracy of association rules discovered according to various co-occurrence criteria. By doing this experiment we expect that we can provide a guideline for selecting appropriate co-occurrence criteria that corresponds to the purpose of the analysis. Additionally, we will perform similar experiments with several groups of customers that are segmented by each customer's average duration between orders. By this experiment, we attempt to discover the relationship between the optimal co-occurrence criteria and the customer's average duration between orders. Finally, by a series of experiments, we expect that we can provide basic guidelines for developing customized recommendation systems.

  • PDF

A Study on Continuous Monitoring Reinforcement for Sales Audit Using Process Mining Under Big Data Environment (빅데이터 환경에서 프로세스 마이닝을 이용한 영업감사 상시 모니터링 강화에 대한 연구)

  • Yoo, Young-Seok;Park, Han-Gyu;Back, Seung-Hoon;Hong, Sung-Chan
    • Journal of Internet Computing and Services
    • /
    • v.17 no.6
    • /
    • pp.123-131
    • /
    • 2016
  • Process mining in big data environment utilize a number of data were generated from the business process. It generates lots of knowledge and insights regarding implementation and improvement of the process through the event log of the company's enterprise resource planning (ERP) system. In recent years, various research activities engaged with the audit work of company organizations are trying actively by using the maximum strength of the mining process. However, domestic studies on applicable sales auditing system for the process mining are insufficient under big data environment. Therefore, we propose process-mining methods that can be optimally applied to online and traditional auditing system. In advance, we propose continuous monitoring information system that can early detect and prevent the risk under the big data environment by monitoring risk factors in the organizations of enterprise. The scope of the research of this paper is to design a pre-verification system for risk factor via practical examples in sales auditing. Furthermore, realizations of preventive audit, continuous monitoring for high risk, reduction of fraud, and timely action for violation of rules are enhanced by proposed sales auditing system. According to the simulation results, avoidance of financial risks, reduction of audit period, and improvement of audit quality are represented.

Current Status of Educational Big Data Research (교육 빅데이터 관련 연구 동향)

  • Lee, Eun-young;Park, Do-oung;Choi, In-ong
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2014.07a
    • /
    • pp.175-176
    • /
    • 2014
  • 본고에서는 교육 빅데이터의 개념, 가치, 처리 기술 및 분석 방법 등을 탐색하였다. '온라인과 오프라인 교수 학습 활동의 투입, 과정, 산출을 통해 생산되는 국가, 지역, 학교, 교사, 학생 수준의 자료'로 정의할 수 있는 교육 빅데이터는 Hadoop으로 대표되는 분산 컴퓨팅 기술을 통해 효율적으로 처리할 수 있다. 대규모 교육 자료에서 의미있고 유용한 결과를 도출하기 위해 주로 사용되는 분석 방법에는 교육 데이터 마이닝, 학습 분석학과 시각 자료 분석학이 있다. 교육 데이터 마이닝은 학생과 교사, 학교의 다양한 수준에서 자료를 폭넓게 분석하는 측면이 강한 반면에 학습 분석학은 학생 수준에서의 자료 분석에 더 초점을 맞추는 경향이 있으며, 시각 자료 분석학은 자료에 대한 분석 자체보다는 분석 결과를 효과적으로 표현하는 방식에 초점이 주어져 있다.

  • PDF

An Analytical Approach Using Topic Mining for Improving the Service Quality of Hotels (호텔 산업의 서비스 품질 향상을 위한 토픽 마이닝 기반 분석 방법)

  • Moon, Hyun Sil;Sung, David;Kim, Jae Kyeong
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.21-41
    • /
    • 2019
  • Thanks to the rapid development of information technologies, the data available on Internet have grown rapidly. In this era of big data, many studies have attempted to offer insights and express the effects of data analysis. In the tourism and hospitality industry, many firms and studies in the era of big data have paid attention to online reviews on social media because of their large influence over customers. As tourism is an information-intensive industry, the effect of these information networks on social media platforms is more remarkable compared to any other types of media. However, there are some limitations to the improvements in service quality that can be made based on opinions on social media platforms. Users on social media platforms represent their opinions as text, images, and so on. Raw data sets from these reviews are unstructured. Moreover, these data sets are too big to extract new information and hidden knowledge by human competences. To use them for business intelligence and analytics applications, proper big data techniques like Natural Language Processing and data mining techniques are needed. This study suggests an analytical approach to directly yield insights from these reviews to improve the service quality of hotels. Our proposed approach consists of topic mining to extract topics contained in the reviews and the decision tree modeling to explain the relationship between topics and ratings. Topic mining refers to a method for finding a group of words from a collection of documents that represents a document. Among several topic mining methods, we adopted the Latent Dirichlet Allocation algorithm, which is considered as the most universal algorithm. However, LDA is not enough to find insights that can improve service quality because it cannot find the relationship between topics and ratings. To overcome this limitation, we also use the Classification and Regression Tree method, which is a kind of decision tree technique. Through the CART method, we can find what topics are related to positive or negative ratings of a hotel and visualize the results. Therefore, this study aims to investigate the representation of an analytical approach for the improvement of hotel service quality from unstructured review data sets. Through experiments for four hotels in Hong Kong, we can find the strengths and weaknesses of services for each hotel and suggest improvements to aid in customer satisfaction. Especially from positive reviews, we find what these hotels should maintain for service quality. For example, compared with the other hotels, a hotel has a good location and room condition which are extracted from positive reviews for it. In contrast, we also find what they should modify in their services from negative reviews. For example, a hotel should improve room condition related to soundproof. These results mean that our approach is useful in finding some insights for the service quality of hotels. That is, from the enormous size of review data, our approach can provide practical suggestions for hotel managers to improve their service quality. In the past, studies for improving service quality relied on surveys or interviews of customers. However, these methods are often costly and time consuming and the results may be biased by biased sampling or untrustworthy answers. The proposed approach directly obtains honest feedback from customers' online reviews and draws some insights through a type of big data analysis. So it will be a more useful tool to overcome the limitations of surveys or interviews. Moreover, our approach easily obtains the service quality information of other hotels or services in the tourism industry because it needs only open online reviews and ratings as input data. Furthermore, the performance of our approach will be better if other structured and unstructured data sources are added.