DOI QR코드

DOI QR Code

Finding a plan to improve recognition rate using classification analysis

  • Kim, SeungJae (Department of Convergence Honam University) ;
  • Kim, SungHwan (College of IT convergence engineering and SW Center University Business unit, Chosun University)
  • Received : 2020.11.27
  • Accepted : 2020.12.05
  • Published : 2020.12.31

Abstract

With the emergence of the 4th Industrial Revolution, core technologies that will lead the 4th Industrial Revolution such as AI (artificial intelligence), big data, and Internet of Things (IOT) are also at the center of the topic of the general public. In particular, there is a growing trend of attempts to present future visions by discovering new models by using them for big data analysis based on data collected in a specific field, and inferring and predicting new values with the models. In order to obtain the reliability and sophistication of statistics as a result of big data analysis, it is necessary to analyze the meaning of each variable, the correlation between the variables, and multicollinearity. If the data is classified differently from the hypothesis test from the beginning, even if the analysis is performed well, unreliable results will be obtained. In other words, prior to big data analysis, it is necessary to ensure that data is well classified according to the purpose of analysis. Therefore, in this study, data is classified using a decision tree technique and a random forest technique among classification analysis, which is a machine learning technique that implements AI technology. And by evaluating the degree of classification of the data, we try to find a way to improve the classification and analysis rate of the data.

Keywords

References

  1. Tom M. Mitchell, "The discipline of machine learning(Vol. 9)," Carnegie Mellon University, Shcool of Computer Science, MachineLearning Department, 2006. DOI: https://www.researchgate.net/publication/268201693_The_Discipline_of_Machine_Learning
  2. Young Jin Kim, Joung Woo Ryu, Won Moon Song and Myung Won Kim, "Fire Probability Prediction Based on Weather Information Using Decision Tree," Journal of KIISE, JOK: software and application", Vol.40, No.11, 2013.11. DOI: http://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE02283313
  3. Yoo, Se Hee, Park, Il Su and Kim, Yoomi, "A Decision-Tree Analysis of Influential Factors and Reasons for Unmet Dental Care in Korean Adults," Health and Social Welfare Review, Vol.34, No.4, pp.294-335, 2017 DOI: https://dx.doi.org/10.15709/hswr.2017.37.4.294.
  4. Yoon, Tae-Tok and Lee, Jee-Hyong, "Design of Heuristic Decision Tree (HDT) Using Human Knowledge," Korean Institute of Intelligent Systems, Vol.19, No.1, pp.161-164, 2009. 4 DOI: https://doi.org/10.5391/JKIS.2009.19.4.525.
  5. Donghoon Lee, Songhwai Oh, "Head Pose Estimation Using Random Forests," The Korean Institute of Information Scientists and Engineers, Vol.40, No.8, pp.435-441, 2013.8. DOI: http://kiise.or.kr/e_journal/2013/8/SA/pdf/02.pdf
  6. Jun Heon Lee and Jun Geol Baek, "Real-time control chart using a random forest-based multi-category classifier," Korean Institute Of Industrial Engineers, pp. 673-682, 2017.11 DOI http://www.dbpia.co.kr/journal/articleDetail?nodeId=NODE07262460
  7. Kim, Pan Jun, "An Analytical Study on Automatic Classification of Domestic Journal articles Using Random Forest," Journal of the Korean Society for information Management , vol.36. no.2, pp.57-77, 2018 DOI: https://doi.org/10.3743/KOSIM.2018.35.2.037.
  8. Um, Nam-Kyoung , Woo, Sung-Hee and Lee, Sang-Ho, "The Hybrid Model using SVM and Decision Tree for Intrusion Detection," KIPS Transactions on Computer and Communication Systems, Vol.14, No.1, pp.1-6, 2007. DOI: 10.3745/KIPSTC.2007.14.1.1, Full Text
  9. Ju, Sang-Lim, Kim, Nam-Il and Kim, Kyung-Seok, "Deep Learning-based Antenna Selection Scheme for Millimeter-wave Systems in Urban Micro Cell Scenario," The Journal of The Institute of Internet, Broadcasting and Communication (IIBC), Vol. 20, No. 5, pp.57-62, Oct. 31, 2020. DOI: https://doi.org/10.7236/JIIBC.2020.20.5.57.
  10. Lee, Gye-Sung and Kim, In-Kook, "A Study on Simplification of Machine Learning Model," The Journal of The Institute of Internet, Broadcasting and Communication (IIBC), Vol.16, No.4, pp.147-152. Aug, 31, 2016. DOI: http://dx.doi.org/10.7236/JIIBC.201616.4.147