Acknowledgement
This work was supported by the R&D Program of the Ministry of Trade, Industry, and Energy (MOTIE) and Korea Evaluation Institute of Industrial Technology (KEIT). (20023805, RS-2022-00155731, RS-2023-00232192)
References
- ISO/IEC AWI TR 42106, "Information Technology - Artificial Intelligence - Overview of Differentiated Benchmarking of AI System Quality Characteristics," https://www.iso.org/standard/86903.html
- ISO/IEC 25059:2023, "Software Engineering - Systems and Software Quality Requirements and Evaluation (SQuaRE) - Quality Model for AI Systems," https://www.iso.org/standard/80655.html
- ISO/IEC TR 29119-11:2020, "Software and Systems Engineering - Software Testing - Part 11: Guidelines on the Testing of AI-based Systems," https://www.iso.org/standard/79016.html
- DIN SPEC 92001-2, "Artificial Intelligence - Life Cycle Processes and Quality Requirements - Part 2: Robustness," https://www.din.de/en/wdc-beuth:din21:330011015
- IEEE 2937-2022, "IEEE Standard for Performance Benchmarking for Artificial Intelligence Server Systems," https://standards.ieee.org/ieee/2937/10376
- ISO/IEC TR 24029-1:2021, "Artificial Intelligence (AI) - Assessment of the Robustness of Neural Networks - Part 1: Overview," https://www.iso.org/standard/77609.html
- J. Ha, J. Seo, and S. Lee, "Living Lab and Confusion Matrix for Performance Improvement and Evaluation of Artificial Intelligence System in Life Environment," j.inst.Korean. electr.electron.eng., vol.24, no.4, pp.1180-1183, 2020. DOI: 10.7471/ikeee.2020.24.4.1180