DOI QR코드

DOI QR Code

Application and Comparison of Data Mining Technique to Prevent Metal-Bush Omission

메탈부쉬 누락예방을 위한 데이터마이닝 기법의 적용 및 비교

  • Received : 2023.07.25
  • Accepted : 2023.08.07
  • Published : 2023.09.30

Abstract

The metal bush assembling process is a process of inserting and compressing a metal bush that serves to reduce the occurrence of noise and stable compression in the rotating section. In the metal bush assembly process, the head diameter defect and placement defect of the metal bush occur due to metal bush omission, non-pressing, and poor press-fitting. Among these causes of defects, it is intended to prevent defects due to omission of the metal bush by using signals from sensors attached to the facility. In particular, a metal bush omission is predicted through various data mining techniques using left load cell value, right load cell value, current, and voltage as independent variables. In the case of metal bush omission defect, it is difficult to get defect data, resulting in data imbalance. Data imbalance refers to a case where there is a large difference in the number of data belonging to each class, which can be a problem when performing classification prediction. In order to solve the problem caused by data imbalance, oversampling and composite sampling techniques were applied in this study. In addition, simulated annealing was applied for optimization of parameters related to sampling and hyper-parameters of data mining techniques used for bush omission prediction. In this study, the metal bush omission was predicted using the actual data of M manufacturing company, and the classification performance was examined. All applied techniques showed excellent results, and in particular, the proposed methods, the method of mixing Random Forest and SA, and the method of mixing MLP and SA, showed better results.

Keywords

References

  1. Chawla, N.V., Bowyer, K.W., Hall, L.O., and Kegelmeyer, W.P., SMOTE: Synthetic Minority Over-sampling Technique, Journal of Artificial Intelligence Research, 2002, Vol. 16, pp. 321-357. https://doi.org/10.1613/jair.953
  2. Han, H., Wang, W.-Y., and Mao, B-H, BorderlineSMOTE: A New Over-Sampling Method in Imbalanced Data Sets Learning, Proceedings of ICIC 2005: Advances in Intelligent Computing, 2005, pp. 878-887.
  3. Han, Y.J. and Joe, I.W., Imbalanced Data Improvement Techniques Based on SMOTE and Light GBM, KIPS Trans. Comp. and Comm. Sys., 2022, Vol. 11 No.12, pp. 445-452
  4. He, H., Bai, Y., Garcia, E., and Li, S., ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning, International Joint Conference on Neural Networks, 2008, Vol. 3, pp. 1322-1328.
  5. Kirkpatrick, S., Gelatt, C.D., and Vecchi, M.P., Optimization by Simulated Annealing, Science, 1983, Vol. 220, No. 4598, pp. 671-680. https://doi.org/10.1126/science.220.4598.671
  6. Lee, D.J., Simulated Annealing for Overcoming Data Imbalance in Mold Injection Process, J of KSIE, 2022, Vol. 45, No. 4, pp. 233-239. https://doi.org/10.11627/jksie.2022.45.4.233
  7. Lee, J.H., Machine Learning Applications to Households Insolvency with Imbalanced Data, J of Consumer Studies, 2019, Vol. 30, No. 6, pp. 97-118. https://doi.org/10.35736/JCS.30.6.5
  8. Moon, A.K. and Kim, H.S., Microclimate-Based Frost Prediction Model Resolving the Class Imbalance, J of Korea Ins. of Comm. And Inf. Sci., 2022, Vol. 47, No.10, pp. 1704-1715. https://doi.org/10.7840/kics.2022.47.10.1704
  9. Oh, S.M. and Lee, J.H., Virtual Data Generation Techniques for Imbalance problem of credit prediction data, Proceedings of KICS, Yongpyung, 2023.02.08-10, pp. 874-875.
  10. Park, S.C., Kim, D.Y., Seo, K.B., and Lee, W.J., The Development of Biodegradable Fiber Tensile Tenacity and Elongation Prediction Model Considering Data Imbalance and Measurement Error, KIPS Trans. Softw. and Data Eng., 2022, Vol. 11, No. 12, pp. 489-498.
  11. Shin, H.J., Lee, S.B., and Lee, K.C., Improving the Quality of Generating Imbalance Data in GANs through an Exhaustive Contrastive Learning, J of KIISE, 2023, Vol. 50, No.4, pp. 295-305. 2023. https://doi.org/10.5626/JOK.2023.50.4.295