DOI QR코드

DOI QR Code

A Novel Posterior Probability Estimation Method for Multi-label Naive Bayes Classification

  • Kim, Hae-Cheon (School of Computer Science and Engineering, Chang-Ang University) ;
  • Lee, Jaesung (School of Computer Science and Engineering, Chung-Ang University)
  • Received : 2018.03.20
  • Accepted : 2018.05.28
  • Published : 2018.06.29

Abstract

A multi-label classification is to find multiple labels associated with the input pattern. Multi-label classification can be achieved by extending conventional single-label classification. Common extension techniques are known as Binary relevance, Label powerset, and Classifier chains. However, most of the extended multi-label naive bayes classifier has not been able to accurately estimate posterior probabilities because it does not reflect the label dependency. And the remaining extended multi-label naive bayes classifier has a problem that it is unstable to estimate posterior probability according to the label selection order. To estimate posterior probability well, we propose a new posterior probability estimation method that reflects the probability between all labels and labels efficiently. The proposed method reflects the correlation between labels. And we have confirmed through experiments that the extended multi-label naive bayes classifier using the proposed method has higher accuracy then the existing multi-label naive bayes classifiers.

Keywords

References

  1. Lee, Jaedong, et al., "An approach for multi-label classification by directed acyclic graph with label correlation maximization," Information Sciences, Vol. 351, pp. 101-114, March 2016. https://doi.org/10.1016/j.ins.2016.02.037
  2. Sucar, L. Enrique, et al., "Multi-label classification with Bayesian network-based chain classifiers," Pattern Recognition Letters, Vol. 41, pp. 14-22, November 2014. https://doi.org/10.1016/j.patrec.2013.11.007
  3. Zhang, Min-Ling, and Zhi-Hua Zhou, "ML-KNN: A lazy learning approach to multi-label learning," Pattern recognition, Vol. 40, No. 7, pp. 2038-2048, 2007. https://doi.org/10.1016/j.patcog.2006.12.019
  4. Zhang, Min-Ling, Jose M. Pena, and Victor Robles, "Feature selection for multi-label naive Bayes classification," Information Sciences, Vol. 179, No. 19, pp. 3218-3229, June 2009. https://doi.org/10.1016/j.ins.2009.06.010
  5. Godbole, Shantanu, and Sunita Sarawagi, "Discriminative methods for multi-labeled classification," Pacific-Asia conference on knowledge discovery and data mining, pp. 22-30, Berlin, Heidelberg, Germany, 2004.
  6. Read, Jesse, et al., "Classifier chains for multi-label classification," Machine learning, Vol. 85, No. 3, pp. 333, 2011. https://doi.org/10.1007/s10994-011-5256-5
  7. Ueda, Naonori, and Kazumi Saito, "Parametric mixture models for multi-labeled text," Advances in neural information processing systems, pp. 737-744, 2003.
  8. I. Katakis et al., "Multilabel Text Classification for Automated Tag Suggestion," 2008 Discovery Challenge, Antwerp, Belgium, September, 2008.
  9. D. Turnbull et al., "Semantic Annotation and Retrieval of Music and Sound Effects," IEEE Trans. Audio Speech Lang. Process, Vol. 16, No. 2, pp. 467-476, 2008. https://doi.org/10.1109/TASL.2007.913750
  10. Trohidis, Konstantinos, et al., "Multi-Label Classification of Music into Emotions," ISMIR, Vol. 8, pp. 325-330, 2008.
  11. Elisseeff, Andre, and Jason Weston, "A kernel method for multi-labelled classification," Advances in neural information processing systems, pp. 681-687, 2002.
  12. A. Cano et al., "LAIM discretization for multi-label data," Information Science, Vol. 330, pp. 370-384, 2016. https://doi.org/10.1016/j.ins.2015.10.032
  13. Zhang, Min-Ling, and Zhi-Hua Zhou, "A review on multi-label learning algorithms," IEEE transactions on knowledge and data engineering, Vol. 26, No. 8, pp. 1819-1837, 2014. https://doi.org/10.1109/TKDE.2013.39
  14. Klimt, Bryan, and Yiming Yang, "The enron corpus: A new dataset for email classification research," European Conference on Machine Learning, pp. 217-226, Springer, Berlin, Heidelberg, 2004.
  15. Boutell, Matthew R., et al., "Learning multi-label scene classification," Pattern recognition, Vol. 37, No. 9, pp. 1757-1771, 2004. https://doi.org/10.1016/j.patcog.2004.03.009
  16. Barutcuoglu, Zafer, Robert E. Schapire, and Olga G. Troyanskaya, "Hierarchical multi-label prediction of gene function," Bioinformatics, Vol. 22, No. 7, pp. 830-836, 2006. https://doi.org/10.1093/bioinformatics/btk048
  17. Joachims, Thorsten, "Text categorization with support vector machines: Learning with many relevant features," European conference on machine learning, pp. 137-142 Springer, Berlin, Heidelberg, 1998.
  18. Read, Jesse, et al., "Classifier chains for multi-label classification," Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 254-269, Springer, Berlin, Heidelberg, 2009.
  19. Demsar, Janez, "Statistical comparisons of classifiers over multiple data sets," Journal of Machine learning research, pp. 1-30. January 2006.
  20. M. Zhang and L. Wu, "LIFT: Multi-label Learning with Label-Specific Features," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 37, No.1, pp. 107-120, 2015. https://doi.org/10.1109/TPAMI.2014.2339815
  21. S. Diplaris et al., "Protein Classification with Multiple Algorithms," Proc. 10th Panhellenic Conf. Inf., pp. 448-456, Volos, Greece, Nov 2005.
  22. J. Pestian et al., "A shared task involving multi-label classification of clinical free text," Proc. Work. BioNLP 2007, pp. 97-104, Prague, Czech, June 2007.
  23. A. Srivastava and B. Zane-Ulman, "Discovering recurring anomalies in text reports regarding complex space systems," Proc. 2005 IEEE Aerospace Conf., pp. 3853-3862, Big Sky, USA, Mar 2005.
  24. Baek, Yeong Tae, "Detection of Character Emotional Type Based on Classification of Emotional Words at Story," Journal of the Korea Society of Computer and Information, Vol. 18, No. 9, pp. 131-138, 2013. https://doi.org/10.9708/jksci.2013.18.9.131
  25. Wang, Tinghuai, Jean-Yves Guillemaut, and John Collomosse, "Multi-label propagation for coherent video segmentation and artistic stylization," Image Processing (ICIP), 2010 17th IEEE International Conference on, pp. 3005-3008, 2010.
  26. Persing, Isaac, and Vincent Ng, "Modeling thesis clarity in student essays," Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Vol. 1, pp. 260-269, 2013.
  27. Park, Jong-Beom, "Development of the Poker Game Achievement Engine for Artificial Intelligence," Journal of the Korea Society of Computer and Information, Vol. 14, No. 11, pp. 41-52, 2009.