DOI QR코드

DOI QR Code

A Case Study of Rapid AI Service Deployment - Iris Classification System

  • Yonghee LEE (Department of AI Software, Shingu College)
  • Received : 2023.10.24
  • Accepted : 2023.11.09
  • Published : 2023.12.30

Abstract

The flow from developing a machine learning model to deploying it in a production environment suffers challenges. Efficient and reliable deployment is critical for realizing the true value of machine learning models. Bridging this gap between development and publication has become a pivotal concern in the machine learning community. FastAPI, a modern and fast web framework for building APIs with Python, has gained substantial popularity for its speed, ease of use, and asynchronous capabilities. This paper focused on leveraging FastAPI for deploying machine learning models, addressing the potentials associated with integration, scalability, and performance in a production setting. In this work, we explored the seamless integration of machine learning models into FastAPI applications, enabling real-time predictions and showing a possibility of scaling up for a more diverse range of use cases. We discussed the intricacies of integrating popular machine learning frameworks with FastAPI, ensuring smooth interactions between data processing, model inference, and API responses. This study focused on elucidating the integration of machine learning models into production environments using FastAPI, exploring its capabilities, features, and best practices. We delved into the potential of FastAPI in providing a robust and efficient solution for deploying machine learning systems, handling real-time predictions, managing input/output data, and ensuring optimal performance and reliability.

Keywords

Acknowledgement

This paper was supported by Shingu College.

References

  1. Bansal, P. and Ouda, A., (2022). Study on Integration of FastAPI and Machine Learning for Continuous Authentication of Behavioral Biometrics, 2022 International Symposium on Networks, Computers and Communications (ISNCC), Shenzhen, China, 2022, pp. 1-6, doi: 10.1109/ISNCC55209.2022.9851790.
  2. Dani, H., Bhople, P., Waghmare, H., Munginwar, K., Patil, A., (2022). Review on Frameworks Used for Deployment of Machine Learning Model, International Journal for Research in Applied Science & Engineering Technology (IJRASET): Vol. 10 Issue II Feb 2022, doi: https://doi.org/10.22214/ijraset.2022.40222
  3. Jain, S. and Kumar, S., (2023). Cost Effective Generic Machine Learning Operation: A Case Study, 2023 International Conference on Data Science and Network Security (ICDSNS), Tiptur, India, 2023, pp. 1-6, doi: 10.1109/ICDSNS58469.2023.10245408.
  4. Kang, S., Choi, J., Kang, M., (2022). Classification Model and Crime Occurrence City Forecasting Based on Random Forest Algorithm, Korean Journal of Artificial Intelligence, 10(1), (2022), 21-25. doi: http://dx.doi.org/10.24225/kjai.2022.10.1.21
  5. Kumari, S. and Rath, S. K., (2015). "Performance comparison of SOAP and REST based Web Services for Enterprise Application Integration," 2015 International Conference on Advances in Computing, Communications and Informatics (ICACCI), Kochi, India, 2015, pp. 1656-1660, doi: 10.1109/ICACCI.2015.7275851.
  6. Lathkar, M., (2023). Getting Started with FastAPI. In: High-Performance Web Apps with FastAPI. Apress, Berkeley, CA. https://doi.org/10.1007/978-1-4842-9178-8_2
  7. Snodgrass, J. E., Milkov, A., (2020). Web-based machine learning tool that determines the origin of natural gases, Computers & Geosciences, 145, December 2020, doi: https://doi.org/10.1016/j.cageo.2020.104595. (https://www.sciencedirect.com/science/article/pii/S0098300420305793)
  8. Song, J., Cai, J., Li, R., and Li, Y., (2023). "Design and Implementation of Scientific Research Achievement Transformation System," 2023 IEEE/ACIS 21st International Conference on Software Engineering Research, Management and Applications (SERA), Orlando, FL, USA, 2023, pp. 407-412, doi: 10.1109/SERA57763.2023.10197696.
  9. Song J, Kook J., (2022) Mapping Server Collaboration Architecture Design with OpenVSLAM for Mobile Devices. Applied Sciences. 2022; 12(7):3653. https://doi.org/10.3390/app12073653
  10. Turing, Python FastAPI vs Flask: A Detailed Comparison, Retrieved from https://www.turing.com/kb/fastapi-vs-flask-a-detailed-comparison
  11. Voron, F., (2022). Building Data Science Applications with FastAPI: Develop, manage, and deploy efficient machine learning applications with Python, Packt Publishing, 2022