DOI QR코드

DOI QR Code

XAI 기반 발전설비 고장 기록 데이터 품질 향상 시스템 개발

Development of System for Enhancing the Quality of Power Generation Facilities Failure History Data Based on Explainable AI (XAI)

  • 김유림 (인하대학교 산업경영공학과) ;
  • 박정인 (인하대학교 산업경영공학과) ;
  • 박동현 (인하대학교 산업경영공학과) ;
  • 강성우 (인하대학교 산업경영공학과)
  • Kim Yu Rim (Department of Industrial Engineering, Inha University) ;
  • Park Jeong In (Department of Industrial Engineering, Inha University) ;
  • Park Dong Hyun (Department of Industrial Engineering, Inha University) ;
  • Kang Sung Woo (Department of Industrial Engineering, Inha University)
  • 투고 : 2024.06.12
  • 심사 : 2024.07.23
  • 발행 : 2024.09.30

초록

Purpose: The deterioration in the quality of failure history data due to differences in interpretation of failures among workers at power plants and the lack of consistency in the way failures are recorded negatively impacts the efficient operation of power plants. The purpose of this study is to propose a system that classifies power generation facilities failures consistently based on the failure history text data created by the workers. Methods: This study utilizes data collected from three coal unloaders operated by Korea Midland Power Co., LTD, from 2012 to 2023. It classifies failures based on the results of Soft Voting, which incorporates the prediction probabilities derived from applying the predict_proba technique to four machine learning models: Random Forest, Logistic Regression, XGBoost, and SVM, along with scores obtained by constructing word dictionaries for each type of failure using LIME, one of the XAI (Explainable Artificial Intelligence) methods. Through this, failure classification system is proposed to improve the quality of power generation facilities failure history data. Results: The results of this study are as follows. When the power generation facilities failure classification system was applied to the failure history data of Continuous Ship Unloader, XGBoost showed the best performance with a Macro_F1 Score of 93%. When the system proposed in this study was applied, there was an increase of up to 0.17 in the Macro_F1 Score for Logistic Regression compared to when the model was applied alone. All four models used in this study, when the system was applied, showed equal or higher values in Accuracy and Macro_F1 Score than the single model alone. Conclusion: This study propose a failure classification system for power generation facilities to improve the quality of failure history data. This will contribute to cost reduction and stability of power generation facilities, as well as further improvement of power plant operation efficiency and stability.

키워드

과제정보

이 논문은 정부(과학기술정보통신부)의 재원으로 정보통신기획평가원의 지원을 받아 수행된 지역지능화혁신인재양성사업임(IITP-2024-RS-2023-00259678).

참고문헌

  1. Batelic J, Griparic K, and Matika D. 2021. Impact of Remediation-Based Maintenance on the Reliability of a Coal-Fired Power Plant Using Generalized Stochastic Petri Nets. Energies 14(18):5682. 
  2. Blanco, M. A., Marti-Puig, P., Gibert, K., Cusido, J., and Sole-Casals, J. 2019. A text-mining approach to assess the failure condition of wind turbines using maintenance service history. Energies 12(10):1982. 
  3. Chawla, N. V., Bowyer, K. W., Hall, L. O., and Kegelmeyer, W. P. 2002. SMOTE: synthetic minority over-sampling technique. Journal of Artificial Intelligence Research 16:321-357. 
  4. Chen, M., Liu, Q., Chen, S., Liu, Y., Zhang, C. H., and Liu, R. 2019. XGBoost-based algorithm interpretation and application on post-fault transient stability status prediction of power system. IEEE Access 7:13149-13158. 
  5. Han, M., Kim, Y., and Kim, N. 2022. The Effect of Domain Specificity on the Performance of Domain-Specific Pre-Trained Language Models. Journal of Intelligence and Information Systems 28(4):251-273. 
  6. He, H., and Garcia, E. A. 2009. Learning from imbalanced data. IEEE Transactions on Knowledge and Data Engineering 21(9):1263-1284. 
  7. Hong, J., Jung, Y., and Kang, S. 2024. Defect Prediction and Variable Impact Analysis in CNC Machining Process. Journal of the Korean society for Quality Management 52(2):185-199. 
  8. Hong, S., Kim, J., and Yang, E. 2022. Automated text classification of maintenance data of higher education buildings using text mining and machine learning techniques. Journal of Architectural Engineering 28(1):04021045. 
  9. Hwang, S., Kim, Y., and Kang, S. 2023. Study of Reliability Analysis Based Power Generation Facilities Maintenance System - Focused on Continuous Ship Unloader -. Journal of the Korean society for Quality Management 51(2):315-327. 
  10. Jardine, A. K., Lin, D., and Banjevic, D. 2006. A review on machinery diagnostics and prognostics implementing condition-based maintenance. Mechanical Systems and Signal Processing 20(7):1483-1510. 
  11. Kim, K., Park, J., Lee, H., Jeong, H., Kim, H., and Kim, H. 2017. A Study on Fault Diagnosis of Boiler Tube Leakage based on Neural Network using Data Mining Technique in the Thermal Power Plant. Transactions of The Korean Institute of Electrical Engineers 66(10):1445-1453. 
  12. Kim, M., Jung, S., Kim, J., Kim, S. 2021. A Study on Fault Detection Method Using Weight-based Local Outlier Factor, Journal of Korean Institute of Intelligent Systems 31(4):273-278 
  13. Korea Energy Economics Institute. KEEI Yearbook of Energy Statistics 2022; Korea Energy Economics Institute: Ulsan, Korea, 2023. 
  14. Lee, H., Hong, Y., and Kang, S. 2021. Identifying process capability index for electricity distribution system through thermal image analysis. Journal of Korean Society for Quality Management 49(3):327-340. 
  15. Lee, S., and Hwang P. 2015. Industrial and Technological Trends of Personal Mobility in Korea and Abroad. Journal of the Korean Society of Automotive Engineers 37(1):36-45. 
  16. Lee, S., and Kim, Y. 2022. A Pre-processing Process Using TadGAN-based Time-series Anomaly Detection. Journal of the Korean society for Quality Management 50(3):459-471. 
  17. Lee, Y., Oh, J., and Kim, G. 2020. Interpretation of load forecasting using explainable artificial intelligence techniques. The Transactions of The Korean Institute of Electrical Engineers 69(3):480-485. 
  18. Lim, C., and Han, S. 2018. A study on development of power grid fault prediction system based on big data and preceding activities to calculate optimal investment cost. Journal of the Korean Data And Information Science Society 29(3):779-794. 
  19. Liu, J., Ma, H., Xie, X., and Cheng, J. 2022. Short text classification for faults information of secondary equipment based on convolutional neural networks. Energies 15(7):2400. 
  20. Machlev, R., Heistrene, L., Perl, M., Levy, K. Y., Belikov, J., Mannor, S., and Levron, Y. 2022. Explainable Artificial Intelligence (XAI) techniques for energy and power systems: Review, challenges and opportunities. Energy and AI 9:100169. 
  21. Meng, Q., Song, Y., Mu, J., Lv, Y., Yang, J., Xu, L., et al. 2023. Electric power audit text classification with multi-grained pre-trained language model. IEEE Access 11:13510-13518. 
  22. Park, J., Jo, H., Lee, S., Oh, S., and Na, M. 2022. A reliable intelligent diagnostic assistant for nuclear power plants using explainable artificial intelligence of GRU-AE, LightGBM and SHAP. Nuclear Engineering and Technology 54(4):1271-1287. 
  23. Rahman, U., and Mahbub, M. 2023. Application of classification models on maintenance records through text mining approach in industrial environment. Journal of Quality in Maintenance Engineering 29(1):203-219. 
  24. Ribeiro, M. T., Singh, S., and Guestrin, C. 2016. "Why should I trust you?" Explaining the predictions of any classifier. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135-1144. 
  25. Sairam, S., Srinivasan, S., Marafioti, G., Subathra, B., Mathisen, G., and Bekiroglu, K. 2020. Explainable incipient fault detection systems for photovoltaic panels. arXiv preprint arXiv:2011.09843. 
  26. Sarker, I. H. 2021. Deep learning: a comprehensive overview on techniques, taxonomy, applications and research directions. SN Computer Science 2(6):420. 
  27. Tian, X., Li, C., and Zhao, B. 2021. A novel classification model SA-MPCNN for power equipment defect text. Transactions on Asian and Low-Resource Language Information Processing 20(6):1-21. 
  28. Vong, C. M., and Du, J. 2020. Accurate and efficient sequential ensemble learning for highly imbalanced multi-class data. Neural Networks 128:268-278. 
  29. Wang, H., Liu, Z., Xu, Y., Wei, X., and Wang, L. 2020. Short text mining framework with specific design for operation and maintenance of power equipment. CSEE Journal of Power and Energy Systems 7(6):1267-1277. 
  30. Zednik, C. 2021. Solving the black box problem: A normative framework for explainable artificial intelligence. Philosophy & Technology 34(2):265-288.