• Title/Summary/Keyword: 온라인 마이닝

Search Result 240, Processing Time 0.026 seconds

Box Office Hit Prediction Using Data mining and Text mining (데이터마이닝과 텍스트마이닝을 활용한 영화 흥행 예측)

  • Jo, Hyo-jung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2021.05a
    • /
    • pp.316-318
    • /
    • 2021
  • 영화 수익에 있어 영화의 흥행 여부는 중요한 영향을 끼친다. 영화 흥행 요인은 영화 산업의 규모가 커지면서 많은 제작사들 및 투자자들이 고려해야 하는 사항이 되었다. 따라서 영화의 흥행을 예측하기 위한 많은 모델이 연구되었다. 본 연구의 목적은 선행연구에서 흥행에 유의미한 영향을 끼친다고 밝혀진 스크린 수, 감독명, 제작사명 등의 내재적인 속성과 더불어 온라인 구전 변수를 사용하여 영화 흥행 예측 모델을 만드는 것이다. 이때 기사 수, 블로그 수와 같이 온라인 구전의 크기를 나타내는 변수들을 사용하는 대신 개봉 후 첫 주간의 관람객 리뷰를 텍스트마이닝을 이용하여 전체 리뷰 중 긍정 리뷰의 비율에 따라 점수를 매긴 후 독립변수로 사용한다. 그 후, 데이터 마이닝 기법을 활용하여 만든 모델에 앞서 언급한 독립변수를 입력 값으로 사용하여 영화의 흥행을 예측한다. 최종적으로 의사결정트리와 로지스틱회귀를 수행한 결과 영화 흥행에 영향을 주는 독립변수를 찾고 모델의 성능을 평가하였다. 로지스틱회귀의 결과 관객 수, 평점이 영화의 흥행에 특히 유의한 영향을 끼치는 변수로 선정되었고 리뷰 역시 유의한 변수로 선정되었다. 이때 만들어진 모델은 약 90%의 높은 수준의 정확도를 보여주었다. 의사결정트리의 결과 관객 수가 가장 중요한 변수로 선정되었다.

Improvement of recommendation system using attribute-based opinion mining of online customer reviews

  • Misun Lee;Hyunchul Ahn
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.12
    • /
    • pp.259-266
    • /
    • 2023
  • In this paper, we propose an algorithm that can improve the accuracy performance of collaborative filtering using attribute-based opinion mining (ABOM). For the experiment, a total of 1,227 online consumer review data about smartphone apps from domestic smartphone users were used for analysis. After morpheme analysis using the KKMA (Kkokkoma) analyzer and emotional word analysis using KOSAC, attribute extraction is performed using LDA topic modeling, and the topic modeling results for each weighted review are used to add up the ratings of collaborative filtering and the sentiment score. MAE, MAPE, and RMSE, which are statistical model performance evaluations that calculate the average accuracy error, were used. Through experiments, we predicted the accuracy of online customers' app ratings (APP_Score) by combining traditional collaborative filtering among the recommendation algorithms and the attribute-based opinion mining (ABOM) technique, which combines LDA attribute extraction and sentiment analysis. As a result of the analysis, it was found that the prediction accuracy of ratings using attribute-based opinion mining CF was better than that of ratings implementing traditional collaborative filtering.

Design and Implementation of Opinion Mining System based on Association Model (연관성 모델에 기반한 오피년마이닝 시스템의 설계 및 구현)

  • Kim, Keun-Hyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.15 no.1
    • /
    • pp.133-140
    • /
    • 2011
  • For both customers and companies, it is very important to analyze online customer reviews, which consist of small documents that include opinions or experiences about products or services, because the customers can get good informations and the companies can establish good marketing strategies. In this paper, we propose the association model for the opinion mining which can analyze customer opinions posted on web. The association model is to modify the association rules mining model in data mining in order to apply efficiently and effectively the association mining techniques to the opinion mining. We designed and implemented the opinion mining systems based on the modified association model and the grouping idea which would enable it to generate significant rules more.

A Design of Satisfaction Analysis System For Content Using Opinion Mining of Online Review Data (온라인 리뷰 데이터의 오피니언마이닝을 통한 콘텐츠 만족도 분석 시스템 설계)

  • Kim, MoonJi;Song, EunJeong;Kim, YoonHee
    • Journal of Internet Computing and Services
    • /
    • v.17 no.3
    • /
    • pp.107-113
    • /
    • 2016
  • Following the recent advancement in the use of social networks, a vast amount of different online reviews is created. These variable online reviews which provide feedback data of contents' are being used as sources of valuable information to both contents' users and providers. With the increasing importance of online reviews, studies on opinion mining which analyzes online reviews to extract opinions or evaluations, attitudes and emotions of the writer have been on the increase. However, previous sentiment analysis techniques of opinion-mining focus only on the classification of reviews into positive or negative classes but does not include detailed information analysis of the user's satisfaction or sentiment grounds. Also, previous designs of the sentiment analysis technique only applied to one content domain that is, either product or movie, and could not be applied to other contents from a different domain. This paper suggests a sentiment analysis technique that can analyze detailed satisfaction of online reviews and extract detailed information of the satisfaction level. The proposed technique can analyze not only one domain of contents but also a variety of contents that are not from the same domain. In addition, we design a system based on Hadoop to process vast amounts of data quickly and efficiently. Through our proposed system, both users and contents' providers will be able to receive feedback information more clearly and in detail. Consequently, potential users who will use the content can make effective decisions and contents' providers can quickly apply the users' responses when developing marketing strategy as opposed to the old methods of using surveys. Moreover, the system is expected to be used practically in various fields that require user comments.

Web Mining for successful e-Business based on Artificial Intelligence Techniques (성공적인 e-Business를 위한 인공지능 기법 기반 웹 마이닝)

  • 이장희;유성진;박상찬
    • Journal of Intelligence and Information Systems
    • /
    • v.8 no.2
    • /
    • pp.159-175
    • /
    • 2002
  • Web mining is an emerging science of applying modem data mining technologies to the problem of extracting valid, comprehensible, and actionable information from large databases of web in e-Business environment and of using it to make crucial e-Business decisions. In this paper, we present the noble framework of data visualization system based on web mining for analyzing the characteristics of on-line customers in e-Business. We also propose the framework of forecasting system for providing the forecasting information of sales/purchase through the use of web mining based on artificial intelligence techniques such as back-propagation network, memory-based reasoning, and self-organizing map.

  • PDF

A Study on Consumer perception changes of online education before and after COVID-19 using text mining (텍스트 마이닝을 활용한 온라인 교육에 대한 소비자 인식 변화 분석: COVID-19 전후를 중심으로)

  • Sohn, Minsung;Im, Meeja;Park, Kyunghwan
    • Journal of Digital Convergence
    • /
    • v.19 no.1
    • /
    • pp.29-43
    • /
    • 2021
  • Coinciding with the advent of COVID-19, online education is on the rise both domestically and globally, and has become an absolutely necessary and irreplaceable form of education. It is a very curious question what the perception of people about the suddenly growing form of education is, and how it has changed. This study investigated changes in consumers' perception of online education using big data. To this end, we divided the time into four stages: before COVID-19 (November to December 2019), after the triggering of COVID-19 (January to February 2020), right after the online classes started (March to April 2020), after experiencing some online education (May to June 2020). Then we conducted text mining, namely, keyword frequency analysis, network analysis, word cloud analysis, and sentiment analysis were performed. The implications derived as a result of the analysis can help education policy makers and educators working in the field to improve online education quality and establish its future directions.

Study on Participants' Perceptions of Sharing Economy Policies: A Text Ming Approach to Online Community Posts (공유경제 참여자의 비즈니스 등록정책에 대한 인식과 심적기재: 온라인 발화에 대한 텍스트마이닝)

  • Park, Soo Kyung
    • Journal of Digital Convergence
    • /
    • v.20 no.2
    • /
    • pp.47-56
    • /
    • 2022
  • With the advent of online platforms, individuals have been able to trade small resources, such as a room, in the market. However, as there is no clear regulation on these economic activities, various side effects have emerged. Accordingly, the government reestablished related policies to resolve the unintended consequences of these economic activities. However, the policy has not been implemented yet, and many participants do not comply with the policy. Therefore, this study intends to examine their perceptions in detail. For this purpose, a text mining technique was applied. Posts and comments from major online communities were collected. By applying the topic modeling technique, 5 topics were derived. Compliance with the government's policy is a voluntary decision. Therefore, it is necessary to carry out an in-depth understanding of the policy target. Therefore, based on this study, it is expected that in the future, methods to induce them to conform to policy can be discussed in detail.

A Comparison of Text Mining Algorithms for Product Review Analysis (상품 리뷰 분석을 위한 텍스트 마이닝 기법의 비교)

  • Lee, Ji-Woong;Jin, Young-Taek
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.882-884
    • /
    • 2019
  • 오늘날 정보화 시대에서는 온라인 쇼핑의 상품리뷰 등 대용량의 텍스트 문서가 존재하며 제품에 대한 정서적인 의견뿐만 아니라 제품 선호도 및 상품 비교와 같은 유용한 정보를 제공한다. 본 논문에서는 사용자가 작성한 상품 리뷰로부터 제품의 특성을 비교하는 비교의견을 추출하기 위해 적용한 다양한 텍스트 마이닝 기법의 비교 결과를 제시한다.

A Study on the Purchasing Factors of Color Cosmetics Using Big Data: Focusing on Topic Modeling and Concor Analysis (빅데이터를 활용한 색조화장품의 구매 요인에 관한 연구: 토픽모델링과 Concor 분석을 중심으로)

  • Eun-Hee Lee;Seung- Hee Bae
    • Journal of the Korean Applied Science and Technology
    • /
    • v.40 no.4
    • /
    • pp.724-732
    • /
    • 2023
  • In this study, we tried to analyze the characteristics of color cosmetics information search and the major information of interest in the color cosmetics market after COVID-19 shown in the text mining analysis results by collecting data on online interest information of consumers in the color cosmetics market after COVID-19. In the empirical analysis, text mining was performed on all documents such as news, blogs, cafes, and web pages, including the word "color cosmetics". As a result of the analysis, online information searches for color cosmetics after COVID-19 were mainly focused on purchase information, information on skin and mask-related makeup methods, and major topics such as interest brands and event information. As a result, post-COVID-19 color cosmetics buyers will become more sensitive to purchase information such as product value, safety, price benefits, and store information through active online information search, so a response strategy is required.

Study on Designing and Implementing Online Customer Analysis System based on Relational and Multi-dimensional Model (관계형 다차원모델에 기반한 온라인 고객리뷰 분석시스템의 설계 및 구현)

  • Kim, Keun-Hyung;Song, Wang-Chul
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.4
    • /
    • pp.76-85
    • /
    • 2012
  • Through opinion mining, we can analyze the degree of positive or negative sentiments that customers feel about important entities or attributes in online customer reviews. But, the limit of the opinion mining techniques is to provide only simple functions in analyzing the reviews. In this paper, we proposed novel techniques that can analyze the online customer reviews multi-dimensionally. The novel technique is to modify the existing OLAP techniques so that they can be applied to text data. The novel technique, that is, multi-dimensional analytic model consists of noun, adjective and document axes which are converted into four relational tables in relational database. The multi-dimensional analysis model would be new framework which can converge the existing opinion mining, information summarization and clustering algorithms. In this paper, we implemented the multi-dimensional analysis model and algorithms. we recognized that the system would enable us to analyze the online customer reviews more complexly.