DOI QR코드

DOI QR Code

Sentiment Analysis of Movie Review Using Integrated CNN-LSTM Mode

CNN-LSTM 조합모델을 이용한 영화리뷰 감성분석

  • Park, Ho-yeon (Department of Management Information Systems, Graduate School, Dongguk University_Seoul) ;
  • Kim, Kyoung-jae (Business School, Dongguk University_Seoul)
  • 박호연 (동국대학교_서울 일반대학원 경영정보학과) ;
  • 김경재 (동국대학교_서울 경영대학 경영정보학과)
  • Received : 2019.12.05
  • Accepted : 2019.12.16
  • Published : 2019.12.31

Abstract

Rapid growth of internet technology and social media is progressing. Data mining technology has evolved to enable unstructured document representations in a variety of applications. Sentiment analysis is an important technology that can distinguish poor or high-quality content through text data of products, and it has proliferated during text mining. Sentiment analysis mainly analyzes people's opinions in text data by assigning predefined data categories as positive and negative. This has been studied in various directions in terms of accuracy from simple rule-based to dictionary-based approaches using predefined labels. In fact, sentiment analysis is one of the most active researches in natural language processing and is widely studied in text mining. When real online reviews aren't available for others, it's not only easy to openly collect information, but it also affects your business. In marketing, real-world information from customers is gathered on websites, not surveys. Depending on whether the website's posts are positive or negative, the customer response is reflected in the sales and tries to identify the information. However, many reviews on a website are not always good, and difficult to identify. The earlier studies in this research area used the reviews data of the Amazon.com shopping mal, but the research data used in the recent studies uses the data for stock market trends, blogs, news articles, weather forecasts, IMDB, and facebook etc. However, the lack of accuracy is recognized because sentiment calculations are changed according to the subject, paragraph, sentiment lexicon direction, and sentence strength. This study aims to classify the polarity analysis of sentiment analysis into positive and negative categories and increase the prediction accuracy of the polarity analysis using the pretrained IMDB review data set. First, the text classification algorithm related to sentiment analysis adopts the popular machine learning algorithms such as NB (naive bayes), SVM (support vector machines), XGboost, RF (random forests), and Gradient Boost as comparative models. Second, deep learning has demonstrated discriminative features that can extract complex features of data. Representative algorithms are CNN (convolution neural networks), RNN (recurrent neural networks), LSTM (long-short term memory). CNN can be used similarly to BoW when processing a sentence in vector format, but does not consider sequential data attributes. RNN can handle well in order because it takes into account the time information of the data, but there is a long-term dependency on memory. To solve the problem of long-term dependence, LSTM is used. For the comparison, CNN and LSTM were chosen as simple deep learning models. In addition to classical machine learning algorithms, CNN, LSTM, and the integrated models were analyzed. Although there are many parameters for the algorithms, we examined the relationship between numerical value and precision to find the optimal combination. And, we tried to figure out how the models work well for sentiment analysis and how these models work. This study proposes integrated CNN and LSTM algorithms to extract the positive and negative features of text analysis. The reasons for mixing these two algorithms are as follows. CNN can extract features for the classification automatically by applying convolution layer and massively parallel processing. LSTM is not capable of highly parallel processing. Like faucets, the LSTM has input, output, and forget gates that can be moved and controlled at a desired time. These gates have the advantage of placing memory blocks on hidden nodes. The memory block of the LSTM may not store all the data, but it can solve the CNN's long-term dependency problem. Furthermore, when LSTM is used in CNN's pooling layer, it has an end-to-end structure, so that spatial and temporal features can be designed simultaneously. In combination with CNN-LSTM, 90.33% accuracy was measured. This is slower than CNN, but faster than LSTM. The presented model was more accurate than other models. In addition, each word embedding layer can be improved when training the kernel step by step. CNN-LSTM can improve the weakness of each model, and there is an advantage of improving the learning by layer using the end-to-end structure of LSTM. Based on these reasons, this study tries to enhance the classification accuracy of movie reviews using the integrated CNN-LSTM model.

인터넷 기술과 소셜 미디어의 빠른 성장으로 인하여, 구조화되지 않은 문서 표현도 다양한 응용 프로그램에 사용할 수 있게 마이닝 기술이 발전되었다. 그 중 감성분석은 제품이나 서비스에 내재된 사용자의 감성을 탐지할 수 있는 분석방법이기 때문에 지난 몇 년 동안 많은 관심을 받아왔다. 감성분석에서는 주로 텍스트 데이터를 이용하여 사람들의 감성을 사전 정의된 긍정 및 부정의 범주를 할당하여 분석하며, 이때 사전 정의된 레이블을 이용하기 때문에 다양한 방향으로 연구가 진행되고 있다. 초기의 감성분석 연구에서는 쇼핑몰 상품의 리뷰 중심으로 진행되었지만, 최근에는 블로그, 뉴스기사, 날씨 예보, 영화 리뷰, SNS, 주식시장의 동향 등 다양한 분야에 적용되고 있다. 많은 선행연구들이 진행되어 왔으나 대부분 전통적인 단일 기계학습기법에 의존한 감성분류를 시도하였기에 분류 정확도 면에서 한계점이 있었다. 본 연구에서는 전통적인 기계학습기법 대신 대용량 데이터의 처리에 우수한 성능을 보이는 딥러닝 기법과 딥러닝 중 CNN과 LSTM의 조합모델을 이용하여 감성분석의 분류 정확도를 개선하고자 한다. 본 연구에서는 대표적인 영화 리뷰 데이터셋인 IMDB의 리뷰 데이터 셋을 이용하여, 감성분석의 극성분석을 긍정 및 부정으로 범주를 분류하고, 딥러닝과 제안하는 조합모델을 활용하여 극성분석의 예측 정확도를 개선하는 것을 목적으로 한다. 이 과정에서 여러 매개 변수가 존재하기 때문에 그 수치와 정밀도의 관계에 대해 고찰하여 최적의 조합을 찾아 정확도 등 감성분석의 성능 개선을 시도한다. 연구 결과, 딥러닝 기반의 분류 모형이 좋은 분류성과를 보였으며, 특히 본 연구에서 제안하는 CNN-LSTM 조합모델의 성과가 가장 우수한 것으로 나타났다.

Keywords

References

  1. Abid, F., M. Alam, M. Yasir, and C. Li, "Sentiment analysis through recurrent variants latterly on convolutional neural network of Twitter," Future Generation Computer Systems, Vol.95(2019), 292-308. https://doi.org/10.1016/j.future.2018.12.018
  2. Appel, O., F. Chiclana and J. Carter, "Main concepts, state of the art and future research questions in sentiment analysis," Acta Polytechnica Hungarica, Vol.12, No.3(2015), 87-108.
  3. Baydogan, M. G., G. Runger, and E. Tuv, "A bag-of-features framework to classify time series," IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol.35, No.11(2013), 2796-2802. https://doi.org/10.1109/TPAMI.2013.72
  4. Bengio, Y., P. Simard, and P. Frasconi, "Learning long-term dependencies with gradient descent is difficult," IEEE Transactions on Neural Networks, Vol.5, No.2(1994), 157-166. https://doi.org/10.1109/72.279181
  5. Ferguson, P., N. O'Hare, M. Davy, A. Bermingham, P. Sheridan, C. Gurrin and A. F. Smeaton, "Exploring the use of paragraph-level annotations for sentiment analysis of financial blogs," Proceedings of WOMAS 2009-Workshop on Opinion Mining and Sentiment Analysis, (2009).
  6. Graves, A., "Long short-term memory," Supervised Sequence Labelling with Recurrent Neural Networks, Springer, Berlin, Heidelberg, 2012
  7. Hyun, J., S. Ryu, and S.-Y. Lee, "How to improve the accuracy of recommendation systems: Combining ratings and review texts sentiment scores," Journal of Intelligence and Information Systems, Vol.25, No.1(2019), 219-239. https://doi.org/10.13088/JIIS.2019.25.1.219
  8. Kim, Y., and M. Song, "A study on analyzing sentiments on movie reviews by multi-level sentiment classifier," Journal of Intelligence and Information Systems, Vol.22, No.3(2016), 71-89. https://doi.org/10.13088/jiis.2016.22.3.071
  9. Lee, S.-H., J. Cui, and J.-W. Kim, "Sentiment analysis on movie review through building modified sentiment dictionary by movie genre," Journal of Intelligence and Information Systems, Vol.22, No.2(2016), 97-113. https://doi.org/10.13088/jiis.2016.22.2.097
  10. Liu, P., X. Qiu, and X. Huang, "Recurrent neural network for text classification with multi-task learning," arXiv 1605.05101, 2016.
  11. Liu, S., and J.-H. Chen, "A multi-label classification based approach for sentiment classification," Expert Systems with Applications, Vol.42, No.3(2015), 1083-1093. https://doi.org/10.1016/j.eswa.2014.08.036
  12. Lochter, J. V., R. F.Zanetti, D. Reller, and T. A. Almeida, "Short text opinion detection using ensemble of classifiers and semantic indexing," Expert Systems with Applications, Vol.62, (2016), 243-249. https://doi.org/10.1016/j.eswa.2016.06.025
  13. Nguyen, H., and N. M. Le, "An ensemble method with sentiment features and clustering support," Neurocomputing, Vol.370(2019), 155-165. https://doi.org/10.1016/j.neucom.2019.08.071
  14. Pak, A. and P. Paroubek, "Twitter as a corpus for sentiment analysis and opinion mining," Proceedings of the Seventh International Conference on Language Resources and Evaluation, (2010), 1320-1326.
  15. Pang, B., L. Lee, and S. Vaithyanathan, "Thumbs up?: sentiment classification using machine learning techniques," Proceedings of the ACL-02 conference on Empirical methods in Natural Language Processing, 2002.
  16. Rao, G., W. Huang, Z. Feng, and Q. Cong, "LSTM with sentence representations for document level sentiment classification," Neurocomputing, Vol.308(2018), 49-57. https://doi.org/10.1016/j.neucom.2018.04.045
  17. Sailunaz, K., and R. Alhajj, "Emotion and sentiment analysis from Twitter text," Journal of Computational Science, Vol.36(2019), 101003. https://doi.org/10.1016/j.jocs.2019.05.009
  18. Shuang K., Z. Zhang, H. Guo, and J. Loo, "A sentiment information Collector-Extractor architecture based neural network for sentiment analysis," Information Sciences, Vol.467 (2018), 549-558. https://doi.org/10.1016/j.ins.2018.08.026
  19. Tang, F., L. Fu, B. Yao, and W. Xu, "Aspect based fine-grained sentiment analysis for online reviews," Information Sciences, Vol.488(2019), 190-204. https://doi.org/10.1016/j.ins.2019.02.064
  20. Xu, J., D. Chen, X. Qiu, and X. Huang, "Cached long short-term memory neural networks for document-level sentiment classification," arXiv preprint arXiv:1610.04989, 2016.

Cited by

  1. Keyword extraction for film reviews based on social network analysis and natural language technology vol.189, 2020, https://doi.org/10.1051/e3sconf/202018903019