DOI QR코드

DOI QR Code

Optimization of Domain-Independent Classification Framework for Mood Classification

  • Choi, Sung-Pil (School of Engineering, Information and Communications University) ;
  • Jung, Yu-Chul (School of Engineering, Information and Communications University) ;
  • Myaeng, Sung-Hyon (School of Engineering, Information and Communications University)
  • 발행 : 2007.12.31

초록

In this paper, we introduce a domain-independent classification framework based on both k-nearest neighbor and Naive Bayesian classification algorithms. The architecture of our system is simple and modularized in that each sub-module of the system could be changed or improved efficiently. Moreover, it provides various feature selection mechanisms to be applied to optimize the general-purpose classifiers for a specific domain. As for the enhanced classification performance, our system provides conditional probability boosting (CPB) mechanism which could be used in various domains. In the mood classification domain, our optimized framework using the CPB algorithm showed 1% of improvement in precision and 2% in recall compared with the baseline.

키워드

참고문헌

  1. Fabrizio Sebastiani, "Machine Learning in Automated Text Categorization," ACM Computing Surveys, 34(1):1-47, 2002 https://doi.org/10.1145/505282.505283
  2. Andrew McCallum and Kamal Nigam, "A Comparison of Event Models for Naive Bayes Text Classification," AAAI/ICML-98 Workshop on Learning for Text Categorization, pp.41-48
  3. Yiming Yang and J. Pedersen, "A Comparative Study on Feature Selection in Text Categorization," Proc. of the 14th International Conference on Machine Learning, ICML '97, pp.412-420
  4. David Lewis, "Evaluating and Optimizing Autonomous Text Classification Systems," Proc. of the 18th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, 1995
  5. Margaret M. Bradley and Peter J. Lang, "Affective Norms for English Words (ANEW): Instruction Manual and Affective Ratings," Technical Report C-1, The Center for Research in Psychophysiology, University of Florida
  6. Hugo Liu, Henry Lieberman, Ted Selker, "A model of textual sensing using real-world knowledge,"Proceedings of the 8th international conference on Intelligent user interfaces, PP.125-132, 2003
  7. G. Mishne, "Experiments with Mood Classification in Blog Posts," Style 2005 at SIGIR 2005, 2005
  8. Owsley, S., Sood, S., and Hammond, K. Domain Specific Affective Classification of Documents. AAAI 2006 Spring Symposium on Computational Approaches to Analysing Weblogs, 2006
  9. Ruberry, M., Owsley, S., and Shamma D.A. Affective Behaviors for Theatrical Agents. Proc .of Intelligent User Interfaces Workshop on Affective Interactions: the Computer in the Affective Loop, 2005
  10. S. T. Dumais, J. Platt, D. Heckerman and M. Sahami, "Inductive learning algorithms and representations for text categorization," Proc. of CIKM '98, pp.148-155
  11. Y. B. Lee and S. H. Myaeng, "Text Genre Classification with Genre-Revealing and Subject-Revealing Features," Proc. of the 25th ACM SIGIR Conference, Tempere, Finland, 2002
  12. Isabelle Guyon, Andr´e Elisseeff, "An Introduction to Variable and Feature Selection," Journal of Machine Learning Research, 3, pp. 1157-1182, 2003 https://doi.org/10.1162/153244303322753616
  13. Rosso, Paolo and Edgardo Ferretti and Daniel Jimenez and Vicente Vidal, "Text Categorization and Information Retrieval Using WordNet Senses,"Proceedings of the Second Global WordNet Conference , pp. 299-304, Brno, Czech Republic, January 20-23, 2004
  14. Thorsten Joachims, "Text Categorization with Support Vector Machines: Learning with Many Relevant Features," Proceedings of ECML-98, 10th European Conference on Machine Learning, 1998