DOI QR코드

DOI QR Code

Hazelcast Vs. Ignite: Opportunities for Java Programmers

  • Maxim, Bartkov (RooX Solutions Java Team Lead) ;
  • Tetiana, Katkova (University of Customs and Finance, Department of Innovative Technologies, Department of Cyber security) ;
  • S., Kruglyk Vladyslav (Bogdan Khmelnitsky Melitopol State Pedagogical University, Faculty of Informatics, Mathematics and Economics, Department of Informatics and Cybernetics) ;
  • G., Murtaziev Ernest (Bogdan Khmelnitsky Melitopol State Pedagogical University, Faculty of Informatics, Mathematics and Economics, Department of Mathematics and Physics) ;
  • V., Kotova Olha (Kherson State University, Faculty of Computer Science, Physics and Mathematics, Department of Algebra, Geometry and Mathematical Analysis)
  • 투고 : 2022.02.05
  • 발행 : 2022.02.28

Abstract

Storing large amounts of data has always been a big problem from the beginning of computing history. Big Data has made huge advancements in improving business processes by finding the customers' needs using prediction models based on web and social media search. The main purpose of big data stream processing frameworks is to allow programmers to directly query the continuous stream without dealing with the lower-level mechanisms. In other words, programmers write the code to process streams using these runtime libraries (also called Stream Processing Engines). This is achieved by taking large volumes of data and analyzing them using Big Data frameworks. Streaming platforms are an emerging technology that deals with continuous streams of data. There are several streaming platforms of Big Data freely available on the Internet. However, selecting the most appropriate one is not easy for programmers. In this paper, we present a detailed description of two of the state-of-the-art and most popular streaming frameworks: Apache Ignite and Hazelcast. In addition, the performance of these frameworks is compared using selected attributes. Different types of databases are used in common to store the data. To process the data in real-time continuously, data streaming technologies are developed. With the development of today's large-scale distributed applications handling tons of data, these databases are not viable. Consequently, Big Data is introduced to store, process, and analyze data at a fast speed and also to deal with big users and data growth day by day.

Keywords

References

  1. Zaki, A. K.: NoSQL databases: new millennium database for big data, big users, cloud computing and its security challenges. International Journal of Research in Engineering and Technology (IJRET), vol. 3, no. 15, pp. 403-409 (2014) https://doi.org/10.15623/ijret.2014.0315080
  2. Madden, S.: From databases to big data. IEEE Internet Computing, vol. 16, no. 3, pp. 4-6 (2012). https://doi.org/10.1109/MIC.2012.50
  3. Sagiroglu, S., & Sinanc, D.: Big data: A review. In: 2013 International Conference on Collaboration Technologies and Systems (CTS). IEEE. (2013, May).
  4. Machine learning and big data. In: Edureka, Big Data and Hadoop. Retrieved October 22, 2021, from Edureka.co website: https://www.edureka.co/blog/machine-learning-and-big-data/(2019, September 24).
  5. Ishwarappa, & Anuradha, J.: A brief introduction on big data 5Vs characteristics and Hadoop technology. Procedia Computer Science, vol. 48, pp. 319-324 (2015) https://doi.org/10.1016/j.procs.2015.04.188
  6. Zhang, Y., Geng, P., Sivaparthipan, C. B., & Muthu, B. A.: Big data and artificial intelligence based early risk warning system of fire hazard for smart cities. Sustainable Energy Technologies and Assessments, vol. 45, no. 100986, p. 100986 (2021) https://doi.org/10.1016/j.seta.2020.100986
  7. Li, V. O., Lam, J. C., Han, Y., & Chow, K. A big data and artificial intelligence framework for smart and personalized air pollution monitoring and health management in Hong Kong. Environmental Science & Policy, vol. 124, pp. 441-450 (2021). https://doi.org/10.1016/j.envsci.2021.06.011
  8. Khan, Z. F., & Alotaibi, S. R.: Applications of artificial intelligence and big data analytics in m-health: A healthcare system perspective. Journal of Healthcare Engineering, vol. 2020, p. 8894694 (2020)
  9. Beam, A. L., & Kohane, I. S.: Big data and machine learning in health care. JAMA: The Journal of the American Medical Association, vol. 319, no. 13, pp. 1317- 1318, (2018) https://doi.org/10.1001/jama.2017.18391
  10. Asharf, J., Moustafa, N., Khurshid, H., Debie, E., Haider, W., & Wahab, A.: A review of intrusion detection systems using machine and deep learning in Internet of Things: Challenges, solutions and future directions. Electronics, vol. 9, no. 7, p. 1177 (2020) https://doi.org/10.3390/electronics9071177
  11. Puentes, F., Perez-Godoy, M. D., Gonzalez, P., & Del Jesus, M. J.: An analysis of technological frameworks for data streams. Progress in Artificial Intelligence, vol. 9, no. 3, pp. 239-261 (2020) https://doi.org/10.1007/s13748-020-00210-6
  12. Garcia-Gil, D., Ramirez-Gallego, S., Garcia, S., & Herrera, F.: A comparison on scalability for batch big data processing on apache spark and apache flink. Big Data Analaytics, vol. 2, no. 1 (2017)
  13. Samosir, J., Indrawan-Santiago, M., & Haghighi, P. D.: An evaluation of data stream processing systems for data driven applications. Procedia Computer Science, no. 80, pp. 439-449 (2016)
  14. Fernandes, E., Salgado, A., & Bernardino, J. : Big data streaming platforms to support real-time analytics. In: Proceedings of the 15th International Conference on Software Technologies. SCITEPRESS - Science and Technology Publications (2020)
  15. In-memory database: Retrieved October 22, 2021, from Hazelcast.com website: https://hazelcast.com/glossary/in-memory-database/ (2019, July 17).
  16. Sojoodi, A. H., Salimi Beni, M., & Khunjush, F.: Ignite-GPU: a GPU-enabled in-memory computing architecture on clusters. The Journal of Supercomputing, vol. 77(3), pp. 3165-3192 (2021) https://doi.org/10.1007/s11227-020-03390-z
  17. In-memory data grid with apache ignite. The Apache Software Foundation. Retrieved October 22, 2021, from Apache.org website: https://ignite.apache.org/use-cases/inmemory-data-grid.html (2021)
  18. Ivanov, N.: In-memory data grids vs. in-memory databases. Retrieved October 22, 2021, from Infoworld.com website: https://www.infoworld.com/article/3300747/in-memorydata-grids-vs-in-memory-databases.html (2018, August 29).
  19. Bryant, D.: John DesJardins on in-memory data grids, stream processing, and app modernization. Retrieved October 22, 2021, from InfoQ website: https://www.infoq.com/podcasts/imdg-streamprocessing/(2020, September 14).
  20. The Apache Software Foundation. Data loading and streaming. Retrieved October 22, 2021, from Apache.org website: https://ignite.apache.org/features/streaming.html (2021)
  21. Gencer, C., Topolnik, M., Durina, V., Demirci, E., Kahveci, E. B., Lukas, A. G. O., … Katsifodimos, A.: Hazelcast Jet: Low-latency stream processing at the 99.99th percentile. arXiv [cs.DC]. Retrieved from http://arxiv.org/abs/2103.10169 (2021)
  22. In-memory data grid: A complete overview - hazelcast. Retrieved October 22, 2021, from Hazelcast.com website: https://hazelcast.com/glossary/in-memory-data-grid/ (2019, November 9)
  23. Guroob, A. H., & Manjaiah, D. H.: Big Data-based In-Memory Data Grid (IMDG) Technologies: challenges of implementation by analytics tools. International Journal of Emerging Research in Management &Technology, vol. 6(5), pp. 829-834 (2017)
  24. Kosandiak, I.: Spring Boot with Hazelcast. Retrieved October 22, 2021, from Medium website: https://ihorkosandiak.medium.com/spring-boot-with-hazelcast-b04d13927745 (2018, September 5)
  25. Why IMDG. Retrieved October 22, 2021, from Hazelcast.org website: https://hazelcast.org/imdg/why/ (2019, August 5)
  26. Frankel, N. (2021, April 16). Hazelcast, from embedded to client-server. Retrieved October 22, 2021, from Dev.to website: https://dev.to/hazelcast/hazelcast-from-embedded-to-client-server-3j7j
  27. Fernandes, E., Salgado, A., & Bernardino, J.: Big data streaming platforms to support real-time analytics. Proceedings of the 15th International Conference on Software Technologies. SCITEPRESS - Science and Technology Publications (2020)
  28. Puentes, F., Perez-Godoy, M. D., Gonzalez, P., & Del Jesus, M. J.: An analysis of technological frameworks for data streams. Progress in Artificial Intelligence, vol. 9(3), pp. 239-261 (2020) https://doi.org/10.1007/s13748-020-00210-6
  29. Gupta, H. K., & Rafat Parveen, D.: Comparative study of big data frameworks. 2019 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT). IEEE (2019)
  30. Alkatheri, S., Abbas, S., & Siddiqui, M.: A comparative study of big data frameworks. International Journal of Computer Science and Information Security (IJCSIS), vol.17(1), pp. 66-73 (2019)
  31. Singh, D., & Reddy, C. K.: A survey on platforms for big data analytics. Journal of Big Data, vol. 2(1), pp. 8 (2015) https://doi.org/10.1186/s40537-014-0008-6