• Title/Summary/Keyword: Review filtering

Search Result 94, Processing Time 0.026 seconds

Improving Accuracy of Noise Review Filtering for Places with Insufficient Training Data

  • Hyeon Gyu Kim
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.7
    • /
    • pp.19-27
    • /
    • 2023
  • In the process of collecting social reviews, a number of noise reviews irrelevant to a given search keyword can be included in the search results. To filter out such reviews, machine learning can be used. However, if the number of reviews is insufficient for a target place to be analyzed, filtering accuracy can be degraded due to the lack of training data. To resolve this issue, we propose a supervised learning method to improve accuracy of the noise review filtering for the places with insufficient reviews. In the proposed method, training is not performed by an individual place, but by a group including several places with similar characteristics. The classifier obtained through the training can be used for the noise review filtering of an arbitrary place belonging to the group, so the problem of insufficient training data can be resolved. To verify the proposed method, a noise review filtering model was implemented using LSTM and BERT, and filtering accuracy was checked through experiments using real data collected online. The experimental results show that the accuracy of the proposed method was 92.4% on the average, and it provided 87.5% accuracy when targeting places with less than 100 reviews.

A Study on the Point-Mass Filter for Nonlinear State-Space Models (비선형 상태공간 모델을 위한 Point-Mass Filter 연구)

  • Yeongkwon Choe
    • Journal of Industrial Technology
    • /
    • v.43 no.1
    • /
    • pp.57-62
    • /
    • 2023
  • In this review, we introduce the non-parametric Bayesian filtering algorithm known as the point-mass filter (PMF) and discuss recent studies related to it. PMF realizes Bayesian filtering by placing a deterministic grid on the state space and calculating the probability density at each grid point. PMF is known for its robustness and high accuracy compared to other nonparametric Bayesian filtering algorithms due to its uniform sampling. However, a drawback of PMF is its inherently high computational complexity in the prediction phase. In this review, we aim to understand the principles of the PMF algorithm and the reasons for the high computational complexity, and summarize recent research efforts to overcome this challenge. We hope that this review contributes to encouraging the consideration of PMF applications for various systems.

Improvement of recommendation system using attribute-based opinion mining of online customer reviews

  • Misun Lee;Hyunchul Ahn
    • Journal of the Korea Society of Computer and Information
    • /
    • v.28 no.12
    • /
    • pp.259-266
    • /
    • 2023
  • In this paper, we propose an algorithm that can improve the accuracy performance of collaborative filtering using attribute-based opinion mining (ABOM). For the experiment, a total of 1,227 online consumer review data about smartphone apps from domestic smartphone users were used for analysis. After morpheme analysis using the KKMA (Kkokkoma) analyzer and emotional word analysis using KOSAC, attribute extraction is performed using LDA topic modeling, and the topic modeling results for each weighted review are used to add up the ratings of collaborative filtering and the sentiment score. MAE, MAPE, and RMSE, which are statistical model performance evaluations that calculate the average accuracy error, were used. Through experiments, we predicted the accuracy of online customers' app ratings (APP_Score) by combining traditional collaborative filtering among the recommendation algorithms and the attribute-based opinion mining (ABOM) technique, which combines LDA attribute extraction and sentiment analysis. As a result of the analysis, it was found that the prediction accuracy of ratings using attribute-based opinion mining CF was better than that of ratings implementing traditional collaborative filtering.

Social Big Data Analysis for Franchise Stores

  • Kim, Hyeon Gyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.8
    • /
    • pp.39-46
    • /
    • 2021
  • When conducting social big data analysis for franchise stores, reviews of multiple branches of a franchise can be collected together, from which analysis results can be distorted significantly. To improve its accuracy, it should be possible to filter reviews of other branches properly which are not subject to the analysis. This paper presents a method for social big data analysis which reflects characteristics of franchise stores. The proposed method consists of search key configuration and review filtering. For the former, the open data provided by Small Business Promotion Agency is used to extract region names for collecting reviews more accurately. For the latter, open search APIs provided by Naver or Kakao are used to obtain franchise branch information for filtering reviews of other branches that are not subject to analysis. To verify performance of the proposed method, experiments were conducted based on real social reviews collected from online, where the results showed that the accuracy of the proposed review filtering was 93.6% on the average.

Usage of Filtering-facepiece Masks for Healthcare Workers and Importance of Fit Testing (보건의료종사자의 안면부여과식 마스크의 사용과 밀착도검사의 중요성)

  • Han, Don-Hee
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.25 no.3
    • /
    • pp.245-253
    • /
    • 2015
  • Objectives: One aim of the study is to compare filtering facepiece masks for healthcare workers between Korea and other countries. The other is to emphasize the importance of fit testing for these masks using an analysis of previous research. Materials: An extensive literature review was performed by searching a number of websites and existing studies. Results: KF94 and KF99 masks certified by the Korean CDC are suitable for healthcare workers as filtering facepiece masks. The standards for these respirators are similar to FFP2 and FFP3 of EN 143 and 149. The performance, such as filtering efficiency, is almost the same between KP94 and N95. It was found that fit testing of respirators for healthcare workers was important to reduce infection risk. Conclusions: KF94 should be emphasized as filtering facepiece masks for healthcare workers rather than N95. Even though Korea has no fit testing regulations, implementing fit testing in healthcare settings is strongly recommended to decrease infection risk.

Are Particulate Filtering Respirators Available in Korea Efficient for Nanoparticles? (<종설>국내 시판 방진마스크는 나노입자에 적합한가?)

  • Han, Don-Hee
    • Journal of Korean Society of Occupational and Environmental Hygiene
    • /
    • v.21 no.1
    • /
    • pp.62-71
    • /
    • 2011
  • There is widespread concern that particulate filtering respirators (PFRs) available in Korea will be efficient for nanoparticles. The purpose of this review study was to analyse research literature and recommend PFRs suitable for protection against nanoparticles. In all studies, respirators containing electret filter media (N95, P100 and FFP2, FFP3) consistently have their MPPS below 100 nm and particle penetration levels at the MPPS can vary widely, but they comply with NIOSH or EN certification criterion. Electret filtering facepieces respirators (FFRs) were found to shift in the Most-Penetrating Particle Size(MPPS) from 30-60 to 200-300 nm range after the electric charges were removed, and FFRs were above their minimum penetrations of criterion. Korean special class and first class FFRs (the same as FFP3 and FFP2, respectively) would be effcient for nanoparticles unless FFRs are removed electric charges. It is difficult to evaluate if mechanical PFRs is efficient for nanoparticles due to the lack of related materials.

Optimal Fingerprint Data Filtering Model for Location Based Services (위치기반 서비스 강화를 위한 최적 데이터 필터링 기법 및 측위 시스템 적용 모델)

  • Jung, Jun;Kim, Jae-Hoon
    • Korean Management Science Review
    • /
    • v.29 no.2
    • /
    • pp.79-90
    • /
    • 2012
  • Focusing on the rapid market penetration of smart phones, the importance of LBS (Location Based Service) is drastically increased. However, traditional GPS method has critical weakness caused by limited availability, such as indoor environment. WPS is newly attractive method as a widely applicable positioning method. In WPS, RSSI (Received Signal Strength Indication) data of all Wi-Fi APs (Access Point) are measured and stored into a huge database. The stored RSSI data in database make single radio fingerprint map. By the radio fingerprint map, we can estimate the actual position of target point. The essential factor of radio fingerprint database is data integrity of RSSI. Because of millions of APs in urban area, RSSI measurement data are seriously contaminated. Therefore, we present the unified filtering method for RSSI measurement data. As the results of filtering, we can show the effectiveness of suggested method in practical positioning system of mobile operator.

How to improve the accuracy of recommendation systems: Combining ratings and review texts sentiment scores (평점과 리뷰 텍스트 감성분석을 결합한 추천시스템 향상 방안 연구)

  • Hyun, Jiyeon;Ryu, Sangyi;Lee, Sang-Yong Tom
    • Journal of Intelligence and Information Systems
    • /
    • v.25 no.1
    • /
    • pp.219-239
    • /
    • 2019
  • As the importance of providing customized services to individuals becomes important, researches on personalized recommendation systems are constantly being carried out. Collaborative filtering is one of the most popular systems in academia and industry. However, there exists limitation in a sense that recommendations were mostly based on quantitative information such as users' ratings, which made the accuracy be lowered. To solve these problems, many studies have been actively attempted to improve the performance of the recommendation system by using other information besides the quantitative information. Good examples are the usages of the sentiment analysis on customer review text data. Nevertheless, the existing research has not directly combined the results of the sentiment analysis and quantitative rating scores in the recommendation system. Therefore, this study aims to reflect the sentiments shown in the reviews into the rating scores. In other words, we propose a new algorithm that can directly convert the user 's own review into the empirically quantitative information and reflect it directly to the recommendation system. To do this, we needed to quantify users' reviews, which were originally qualitative information. In this study, sentiment score was calculated through sentiment analysis technique of text mining. The data was targeted for movie review. Based on the data, a domain specific sentiment dictionary is constructed for the movie reviews. Regression analysis was used as a method to construct sentiment dictionary. Each positive / negative dictionary was constructed using Lasso regression, Ridge regression, and ElasticNet methods. Based on this constructed sentiment dictionary, the accuracy was verified through confusion matrix. The accuracy of the Lasso based dictionary was 70%, the accuracy of the Ridge based dictionary was 79%, and that of the ElasticNet (${\alpha}=0.3$) was 83%. Therefore, in this study, the sentiment score of the review is calculated based on the dictionary of the ElasticNet method. It was combined with a rating to create a new rating. In this paper, we show that the collaborative filtering that reflects sentiment scores of user review is superior to the traditional method that only considers the existing rating. In order to show that the proposed algorithm is based on memory-based user collaboration filtering, item-based collaborative filtering and model based matrix factorization SVD, and SVD ++. Based on the above algorithm, the mean absolute error (MAE) and the root mean square error (RMSE) are calculated to evaluate the recommendation system with a score that combines sentiment scores with a system that only considers scores. When the evaluation index was MAE, it was improved by 0.059 for UBCF, 0.0862 for IBCF, 0.1012 for SVD and 0.188 for SVD ++. When the evaluation index is RMSE, UBCF is 0.0431, IBCF is 0.0882, SVD is 0.1103, and SVD ++ is 0.1756. As a result, it can be seen that the prediction performance of the evaluation point reflecting the sentiment score proposed in this paper is superior to that of the conventional evaluation method. In other words, in this paper, it is confirmed that the collaborative filtering that reflects the sentiment score of the user review shows superior accuracy as compared with the conventional type of collaborative filtering that only considers the quantitative score. We then attempted paired t-test validation to ensure that the proposed model was a better approach and concluded that the proposed model is better. In this study, to overcome limitations of previous researches that judge user's sentiment only by quantitative rating score, the review was numerically calculated and a user's opinion was more refined and considered into the recommendation system to improve the accuracy. The findings of this study have managerial implications to recommendation system developers who need to consider both quantitative information and qualitative information it is expect. The way of constructing the combined system in this paper might be directly used by the developers.

Development of Hybrid Recommender System Using Review Data Mining: Kindle Store Data Analysis Case (리뷰 데이터 마이닝을 이용한 하이브리드 추천시스템 개발: Amazon Kindle Store 데이터 분석사례)

  • Yihua Zhang;Qinglong Li;Ilyoung Choi;Jaekyeong Kim
    • Information Systems Review
    • /
    • v.23 no.1
    • /
    • pp.155-172
    • /
    • 2021
  • With the recent increase in online product purchases, a recommender system that recommends products considering users' preferences has still been studied. The recommender system provides personalized product recommendation services to users. Collaborative Filtering (CF) using user ratings on products is one of the most widely used recommendation algorithms. During CF, the item-based method identifies the user's product by using ratings left on the product purchased by the user and obtains the similarity between the purchased product and the unpurchased product. CF takes a lot of time to calculate the similarity between products. In particular, it takes more time when using text-based big data such as review data of Amazon store. This paper suggests a hybrid recommendation system using a 2-phase methodology and text data mining to calculate the similarity between products easily and quickly. To this end, we collected about 980,000 online consumer ratings and review data from the online commerce store, Amazon Kinder Store. As a result of several experiments, it was confirmed that the suggested hybrid recommendation system reflecting the user's rating and review data has resulted in similar recommendation time, but higher accuracy compared to the CF-based benchmark recommender systems. Therefore, the suggested system is expected to increase the user's satisfaction and increase its sales.

Personalized Movie Recommendation System Using Context-Aware Collaborative Filtering Technique (상황기반과 협업 필터링 기법을 이용한 개인화 영화 추천 시스템)

  • Kim, Min Jeong;Park, Doo-Soon;Hong, Min;Lee, HwaMin
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.4 no.9
    • /
    • pp.289-296
    • /
    • 2015
  • The explosive growth of information has been difficult for users to get an appropriate information in time. The various ways of new services to solve problems has been provided. As customized service is being magnified, the personalized recommendation system has been important issue. Collaborative filtering system in the recommendation system is widely used, and it is the most successful process in the recommendation system. As the recommendation is based on customers' profile, there can be sparsity and cold-start problems. In this paper, we propose personalized movie recommendation system using collaborative filtering techniques and context-based techniques. The context-based technique is the recommendation method that considers user's environment in term of time, emotion and location, and it can reflect user's preferences depending on the various environments. In order to utilize the context-based technique, this paper uses the human emotion, and uses movie reviews which are effective way to identify subjective individual information. In this paper, this proposed method shows outperforming existing collaborative filtering methods.