구문분석과 기계학습 기반 하이브리드 텍스트 논조 자동분석

Hybrid Approach to Sentiment Analysis based on Syntactic Analysis and Machine Learning

  • 홍문표 (성균관대학교 독어독문학과) ;
  • 신미영 (경북대학교 IT 대학 전자공학부) ;
  • 박신혜 (성균관대학교 독어독문학과) ;
  • 이형민 (경북대학교 IT 대학 전자공학부)
  • 투고 : 2010.11.07
  • 심사 : 2010.12.03
  • 발행 : 2010.12.31

초록

This paper presents a hybrid approach to the sentiment analysis of online texts. The sentiment of a text refers to the feelings that the author of a text has towards a certain topic. Many existing approaches employ either a pattern-based approach or a machine learning based approach. The former shows relatively high precision in classifying the sentiments, but suffers from the data sparseness problem, i.e. the lack of patterns. The latter approach shows relatively lower precision, but 100% recall. The approach presented in the current work adopts the merits of both approaches. It combines the pattern-based approach with the machine learning based approach, so that the relatively high precision and high recall can be maintained. Our experiment shows that the hybrid approach improves the F-measure score for more than 50% in comparison with the pattern-based approach and for around 1% comparing with the machine learning based approach. The numerical improvement from the machine learning based approach might not seem to be quite encouraging, but the fact that in the current approach not only the sentiment or the polarity information of sentences but also the additional information such as target of sentiments can be classified makes the current approach promising.

키워드

과제정보

연구 과제번호 : Development of Machine Translation Technology for Korean/Chniese/English Spoken Language and Business Documents