DOI QR코드

DOI QR Code

Evaluating Computational Efficiency of Spatial Analysis in Cloud Computing Platforms

클라우드 컴퓨팅 기반 공간분석의 연산 효율성 분석

  • 최창락 (경희대학교 이과대학 지리학과) ;
  • 김예린 (경희대학교 이과대학 지리학과) ;
  • 홍성연 (경희대학교 이과대학 지리학과)
  • Received : 2018.10.25
  • Accepted : 2018.12.13
  • Published : 2018.12.31

Abstract

The increase of high-resolution spatial data and methodological developments in recent years has enabled a detailed analysis of individual experiences in space and over time. However, despite the increasing availability of data and technological advances, such individual-level analysis is not always possible in practice because of its computing requirements. To overcome this limitation, there has been a considerable amount of research on the use of high-performance, public cloud computing platforms for spatial analysis and simulation. The purpose of this paper is to empirically evaluate the efficiency and effectiveness of spatial analysis in cloud computing platforms. We compare the computing speed for calculating the measure of spatial autocorrelation and performing geographically weighted regression analysis between a local machine and spot instances on clouds. The results indicate that there could be significant improvements in terms of computing time when the analysis is performed parallel on clouds.

휴대용 기기와 다양한 위치 기반 서비스의 확산으로 공간데이터의 양적 팽창이 가속화됨에 따라 대용량의 공간데이터를 효율적으로 다룰 수 있는 기술의 중요성이 점차 커지고 있다. 클라우드 컴퓨팅은 인터넷을 통해 스토리지, 메모리, 애플리케이션 등 다양한 전산 자원을 공유할 수 있는 서비스 환경으로, 최근 이를 활용해 대용량의 공간데이터를 처리, 분석하는 방법과 그 필요성에 관한 연구가 활발히 수행되어 왔다. 그러나 아직까지 대용량 공간데이터의 분석에 클라우드 컴퓨팅 플랫폼을 활용했을 때 어느 정도의 성능 향상을 기대할 수 있는지에 대한 실증적 연구는 비교적 많이 이루어지지 않았으며, 본 연구의 목표는 이러한 논의의 공백을 채우는 것이다. 이를 위해 연구에서는 클라우드 컴퓨팅 플랫폼에서 병렬 연산을 사용했을 때 모란지수와 지리가중회귀분석의 연산 속도가 어느 정도 향상되는지 살펴보았으며, 그 결과를 통해 클라우드 컴퓨팅을 활용한 공간분석의 효율성을 평가하였다. 실험 결과, 중앙처리장치의 클록 수가 더 높은 로컬 컴퓨터에 비해 병렬 연산에 적합한 환경을 갖춘 공용 클라우드 컴퓨팅 플랫폼에서 좀 더 효율적인 연산이 가능했으며, 데이터의 규모가 클수록 격차가 더욱 크게 나타났다.

Keywords

References

  1. Armbrust, M., A. Fox, R. Griffith, A.D. Joseph, R.H. Katz, A. Konwinski, G. Lee, D.A. Patterson, A. Rabkin, I. Stoica and M. Zaharia. 2009. Above the clouds: a berkeley view of cloud computing. Technical Report No. UCB/ EECS-2009 -28.
  2. Azad, A. and A. Buluc. 2017. A workefficient parallel sparse matrix-sparse vector multiplication algorithm. Proceedings of the 2017 IEEE International Parallel and Distributed Processing Symposium. Florida, FL, USA, 31 May 2017. pp.688 -697.
  3. Cho, Y.I. 2013. Understanding big data and its major issues. Journal of Korean Associastion for Regional Information Society 16(3):43-65.
  4. Choi, J.G. and B.N. Noh. 2011. Security technology research in cloud computing environment. Journal of Security Engineering 8(3):371-384.
  5. Cleveland, W.S. 1979. Robust locally weighted regression and smoothing scatterplots. Journal of the American Statistical Association 74(368):829-836. https://doi.org/10.1080/01621459.1979.10481038
  6. Cleveland, W.S. and S.J. Devlin. 1988. Locally weighted regression: an approach to regression analysis by local fitting. Journal of the American Statistical Association 83(403):596-610. https://doi.org/10.1080/01621459.1988.10478639
  7. Fang, C., C.J. Yang, Z. Chen, X.J. Yao and H.T. Guo. 2011. Parallel algorithm for viewshed analysis on a modern GPU. International Journal of Digital Earth 4(6):471-486. https://doi.org/10.1080/17538947.2011.555565
  8. Garfinkel, S. 2007. An evaluation of Amazon's grid computing services: EC2, S3, and SQS. Harvard Computer Science Group Technical Report TR-08 -07.
  9. Goodchild, M.F. 2007. Citizens as sensors: the world of volunteered geography. GeoJournal 69(4):211-221. https://doi.org/10.1007/s10708-007-9111-y
  10. Guan, Q. and K.C. Clarke. 2010. A general -purpose parallel raster processing programming library test application using a geographic cellular automata model. International Journal of Geographical Information Science 24(5):695-722. https://doi.org/10.1080/13658810902984228
  11. Haut, J.M., M. Paoletti, J. Plaza and A. Plaza. 2017. Cloud implementation of the K-means algorithm for hyperspectral image analysis. The Journal of Supercomputing. 73(1):514-529. https://doi.org/10.1007/s11227-016-1896-3
  12. Healey, R., S. Dowers, B. Gittings and M. J. Mineter. 1997. Parallel processing algorithms for GIS. CRC Press. Florida, FL, USA. 460pp.
  13. Jang, E.Y. and C.S. Park. 2011. A study of modeling and simulation for the availability optimization of cloud computing service. Journal of the Korea Society for Simulation 20(1):1-8. https://doi.org/10.9709/JKSS.2011.20.1.001
  14. Kim, T., I. Kim, C. Min and Y.I. Eom. 2012. Trends in cloud computing security technology. Communications of the Korean Institute of Information Scientists and Engineers 30(1):30-38.
  15. Kitchin, R. 2013. Big data and human geography: opportunities, challenges and risks. Dialogues in human geography 3(3):262-267. https://doi.org/10.1177/2043820613513388
  16. Leavitt, N. 2009. Is cloud computing really ready for prime time?. Computer 42(1): 15-20. https://doi.org/10.1109/MC.2009.20
  17. Lee, K.H., H. Choi and Y.D. Chung. 2011. Massive data processing and management in cloud computing: a survey. Journal of KISE 38(2):104-125.
  18. Li, Z., A.S. Fotheringham, W. Li and T. Oshan. 2018. Fast Geographically Weighted Regression (FastGWR): a scalable algorithm to investigate spatial process heterogeneity in millions of observations. International Journal of Geographical Information Science. 33(1):155-175.
  19. Moran, P.A. 1950. Notes on continuous stochastic phenomena. Biometrika 37 (1/2):17-23. https://doi.org/10.1093/biomet/37.1-2.17
  20. Park, J.M., M.H. Lee, D.B. Shin and J.W. Ahn. 2015. Deduction of the policy issues for activating the geo-spatial big data services. Journal of Korea Spatial Information Society 23(6):19-29.
  21. Quinn, M.J. 1987. Designing efficient algorithms for parallel computers. McGraw -Hill, Inc. New York, NY, USA. 288pp.
  22. Tang, W. and W. Feng. 2017. Parallel map projection of vector-based big spatial data: coupling cloud computing with graphics processing units. Computers, Environment and Urban Systems 61:187-197. https://doi.org/10.1016/j.compenvurbsys.2014.01.001
  23. Turton, I. and S. Openshaw. 1998. Highperformance computing and geography: Developments, issues, and case studies. Environment and Planning A 30(10): 1839-1856. https://doi.org/10.1068/a301839
  24. Wang, Y., S. Wang and D. Zhou. 2009. Retrieving and indexing spatial data in the cloud computing environment. In: Jaatun, M.G., G. Zhao and C. Rong(ed.). Cloud Computing. Springer Berlin Heidelberg. Berlin, pp.322-331.
  25. Xiaoqiang, Y. and D. Yuejin. 2010. Exploration of cloud computing technologies for geographic information services. Proceedings of the 18th International Conference on Geoinformatics. Beijing, China, 18-20 June 2010. pp.1-5.
  26. Yang, C., M. Goodchild, Q. Huang, D. Nebert, R. Raskin, Y. Xu, M. Bambacus and D. Fay. 2011. Spatial cloud computing: how can the geospatial sciences use and help shape cloud computing?. International Journal of Digital Earth 4(4):305-329. https://doi.org/10.1080/17538947.2011.587547
  27. Yang, C., Q. Huang, Z. Li, K. Liu and F. Hu. 2017. Big Data and cloud computing: innovation opportunities and challenges. International Journal of Digital Earth 10(1):13-53. https://doi.org/10.1080/17538947.2016.1239771
  28. Yang, C., Y. Xu and D. Nebert. 2013. Redefining the possibility of digital Earth and geosciences with spatial cloud computing. International Journal of Digital Earth 6(4):297-312. https://doi.org/10.1080/17538947.2013.769783
  29. Yue, P., H. Zhou, J. Gong and L. Hu. 2013. Geoprocessing in cloud computing platforms -a comparative analysis. International Journal of Digital Earth 6(4):404-425. https://doi.org/10.1080/17538947.2012.748847
  30. Zhang, J., S. You and L. Gruenwald. 2016. High-performance polyline intersection based spatial join on GPU-accelerated clusters. Proceedings of 2016 ACM SIGSPATIAL International Workshop on Analytics for Big Geospatial Data. San Francisco, CA, USA, 31 October 2010. pp.1-8.
  31. Zhang, Y., X. Zheng, Z. Wang, G. Ai and Q. Huang. 2018. Implementation of a Parallel GPU-Based Space-Time Kriging Framework. ISPRS International Journal of Geo-Information. 7(5):193-205 https://doi.org/10.3390/ijgi7050193
  32. Zhao, Y.L., A. Padmanabhan and S.W. Wang. 2013. A parallel computing approach to viewshed analysis of large terrain data using graphics processing units. International Journal of Geographical Information Science 27(2):363-384. https://doi.org/10.1080/13658816.2012.692372
  33. Zhou, X., C. Xu and B. Kimmons. 2015. Detecting tourism destinations using scalable geospatial analysis based on cloud computing platform. Computers, Environment and Urban Systems 54:144 -153. https://doi.org/10.1016/j.compenvurbsys.2015.07.006