A Novel Posterior Probability Estimation Method for Multi-label Naive Bayes Classification

Kim, Hae-Cheon;Lee, Jaesung;

doi:10.9708/jksci.2018.23.06.001

한국컴퓨터정보학회논문지 (Journal of the Korea Society of Computer and Information)

제23권6호
/
Pages.1-7
/
2018
/
1598-849X(pISSN)
/
2383-9945(eISSN)

한국컴퓨터정보학회 (Korean Society of Computer Information)

DOI QR Code

A Novel Posterior Probability Estimation Method for Multi-label Naive Bayes Classification

Kim, Hae-Cheon (School of Computer Science and Engineering, Chang-Ang University) ;
Lee, Jaesung (School of Computer Science and Engineering, Chung-Ang University)

투고 : 2018.03.20
심사 : 2018.05.28
발행 : 2018.06.29

https://doi.org/10.9708/jksci.2018.23.06.001 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

A multi-label classification is to find multiple labels associated with the input pattern. Multi-label classification can be achieved by extending conventional single-label classification. Common extension techniques are known as Binary relevance, Label powerset, and Classifier chains. However, most of the extended multi-label naive bayes classifier has not been able to accurately estimate posterior probabilities because it does not reflect the label dependency. And the remaining extended multi-label naive bayes classifier has a problem that it is unstable to estimate posterior probability according to the label selection order. To estimate posterior probability well, we propose a new posterior probability estimation method that reflects the probability between all labels and labels efficiently. The proposed method reflects the correlation between labels. And we have confirmed through experiments that the extended multi-label naive bayes classifier using the proposed method has higher accuracy then the existing multi-label naive bayes classifiers.

키워드

참고문헌

Lee, Jaedong, et al., "An approach for multi-label classification by directed acyclic graph with label correlation maximization," Information Sciences, Vol. 351, pp. 101-114, March 2016. https://doi.org/10.1016/j.ins.2016.02.037
Sucar, L. Enrique, et al., "Multi-label classification with Bayesian network-based chain classifiers," Pattern Recognition Letters, Vol. 41, pp. 14-22, November 2014. https://doi.org/10.1016/j.patrec.2013.11.007
Zhang, Min-Ling, and Zhi-Hua Zhou, "ML-KNN: A lazy learning approach to multi-label learning," Pattern recognition, Vol. 40, No. 7, pp. 2038-2048, 2007. https://doi.org/10.1016/j.patcog.2006.12.019
Zhang, Min-Ling, Jose M. Pena, and Victor Robles, "Feature selection for multi-label naive Bayes classification," Information Sciences, Vol. 179, No. 19, pp. 3218-3229, June 2009. https://doi.org/10.1016/j.ins.2009.06.010
Godbole, Shantanu, and Sunita Sarawagi, "Discriminative methods for multi-labeled classification," Pacific-Asia conference on knowledge discovery and data mining, pp. 22-30, Berlin, Heidelberg, Germany, 2004.
Read, Jesse, et al., "Classifier chains for multi-label classification," Machine learning, Vol. 85, No. 3, pp. 333, 2011. https://doi.org/10.1007/s10994-011-5256-5
Ueda, Naonori, and Kazumi Saito, "Parametric mixture models for multi-labeled text," Advances in neural information processing systems, pp. 737-744, 2003.
I. Katakis et al., "Multilabel Text Classification for Automated Tag Suggestion," 2008 Discovery Challenge, Antwerp, Belgium, September, 2008.
D. Turnbull et al., "Semantic Annotation and Retrieval of Music and Sound Effects," IEEE Trans. Audio Speech Lang. Process, Vol. 16, No. 2, pp. 467-476, 2008. https://doi.org/10.1109/TASL.2007.913750
Trohidis, Konstantinos, et al., "Multi-Label Classification of Music into Emotions," ISMIR, Vol. 8, pp. 325-330, 2008.
Elisseeff, Andre, and Jason Weston, "A kernel method for multi-labelled classification," Advances in neural information processing systems, pp. 681-687, 2002.
A. Cano et al., "LAIM discretization for multi-label data," Information Science, Vol. 330, pp. 370-384, 2016. https://doi.org/10.1016/j.ins.2015.10.032
Zhang, Min-Ling, and Zhi-Hua Zhou, "A review on multi-label learning algorithms," IEEE transactions on knowledge and data engineering, Vol. 26, No. 8, pp. 1819-1837, 2014. https://doi.org/10.1109/TKDE.2013.39
Klimt, Bryan, and Yiming Yang, "The enron corpus: A new dataset for email classification research," European Conference on Machine Learning, pp. 217-226, Springer, Berlin, Heidelberg, 2004.
Boutell, Matthew R., et al., "Learning multi-label scene classification," Pattern recognition, Vol. 37, No. 9, pp. 1757-1771, 2004. https://doi.org/10.1016/j.patcog.2004.03.009
Barutcuoglu, Zafer, Robert E. Schapire, and Olga G. Troyanskaya, "Hierarchical multi-label prediction of gene function," Bioinformatics, Vol. 22, No. 7, pp. 830-836, 2006. https://doi.org/10.1093/bioinformatics/btk048
Joachims, Thorsten, "Text categorization with support vector machines: Learning with many relevant features," European conference on machine learning, pp. 137-142 Springer, Berlin, Heidelberg, 1998.
Read, Jesse, et al., "Classifier chains for multi-label classification," Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 254-269, Springer, Berlin, Heidelberg, 2009.
Demsar, Janez, "Statistical comparisons of classifiers over multiple data sets," Journal of Machine learning research, pp. 1-30. January 2006.
M. Zhang and L. Wu, "LIFT: Multi-label Learning with Label-Specific Features," IEEE Trans. Pattern Anal. Mach. Intell., Vol. 37, No.1, pp. 107-120, 2015. https://doi.org/10.1109/TPAMI.2014.2339815
S. Diplaris et al., "Protein Classification with Multiple Algorithms," Proc. 10th Panhellenic Conf. Inf., pp. 448-456, Volos, Greece, Nov 2005.
J. Pestian et al., "A shared task involving multi-label classification of clinical free text," Proc. Work. BioNLP 2007, pp. 97-104, Prague, Czech, June 2007.
A. Srivastava and B. Zane-Ulman, "Discovering recurring anomalies in text reports regarding complex space systems," Proc. 2005 IEEE Aerospace Conf., pp. 3853-3862, Big Sky, USA, Mar 2005.
Baek, Yeong Tae, "Detection of Character Emotional Type Based on Classification of Emotional Words at Story," Journal of the Korea Society of Computer and Information, Vol. 18, No. 9, pp. 131-138, 2013. https://doi.org/10.9708/jksci.2013.18.9.131
Wang, Tinghuai, Jean-Yves Guillemaut, and John Collomosse, "Multi-label propagation for coherent video segmentation and artistic stylization," Image Processing (ICIP), 2010 17th IEEE International Conference on, pp. 3005-3008, 2010.
Persing, Isaac, and Vincent Ng, "Modeling thesis clarity in student essays," Proceedings of the 51st Annual Meeting of the Association for Computational Linguistics, Vol. 1, pp. 260-269, 2013.
Park, Jong-Beom, "Development of the Poker Game Achievement Engine for Artificial Intelligence," Journal of the Korea Society of Computer and Information, Vol. 14, No. 11, pp. 41-52, 2009.

한국컴퓨터정보학회논문지 (Journal of the Korea Society of Computer and Information)

A Novel Posterior Probability Estimation Method for Multi-label Naive Bayes Classification

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)