과제정보
본 논문은 정부 (과학기술정보통신부)의 재원으로 한국연구재단의 지원 (No. RS-2023-00211606)과 정보통신기획평가원의 지역지능화혁신인재양성사업의 지원 (No. IITP-2023-RS-2022-00156389)으로 연구하였음
참고문헌
- V. G. Cerf, "On Heterogeneous Computing," Communication of the ACM, Vol. 64, No. 21, pp. 9, 2021.
- OpenCL, https://www.khronos.org/opencl/
- S. Mendez, "Edge Computing Systems with Kubernetes," Packt Publishing, 2022.
- J. Diaz, C. Munoz-Caro, A. Nino, "A Survey of Parallel Programming Models and Tools in the Multi and Many-Core Era," IEEE Transactions on Parallel and Distributed Systems, Vol. 23, No. 8, pp. 1369-1386, 2012. https://doi.org/10.1109/TPDS.2011.308
- CUDA, https://developer.nvidia.com/cuda-toolkit
- SYCL, https://www.khronos.org/sycl/
- T. Deakin, J. Price, M. Martineau, S. McIntosh-Smith, "Evaluating Attainable Memory Bandwidth of Parallel Programming Models Via BabelStream," International Journal of Computational Science and Engineering, Vol. 17, No. 3, pp. 247-262, 2018. https://doi.org/10.1504/IJCSE.2018.095847
- Raspberry Pi, https://www.raspberrypi.com/products/raspberry-pi-4-model-b/
- Jetson Orin Nano, https://www.nvidia.com/en-us/autonomous-machines/embedded-systems/jetson-orin/
- A. Alpay, V. Heuveline, "One Pass to Bind Them: The First Single-Pass SYCL Compiler with Unified Code Representation Across Backends," in Proceedings of the 2023 International Workshop on OpenCL, Article 7, 2023.
- OpenMP, https://www.openmp.org/
- OpenACC, https://www.openacc.org/
- D. Angus, S. Georgiev, H. A. Gonzalez, J. Riordan, P. Keir, M. Goli, "Porting SYCL Accelerated Neural Network Frameworks to Edge Devices," in Proceedings of the 2023 International Workshop on OpenCL, Article No. 4, 2023.
- J. Y. Park, J. H. Hong, K. S. Chung "Parallel LDPC Decoder for CMMB on CPU and GPU Using OpenCL," IEMEK J. Embed. Sys. Appl., Vol. 11, No. 6, pp. 325-334, 2016 (in Korean). https://doi.org/10.14372/IEMEK.2016.11.6.325
- S. Memeti, L. Li, S. Pllana, J. Kolodziej, C. Kessler, "Benchmarking OpenCL, OpenACC, OpenMP, and CUDA: Programming Productivity, Performance, and Energy Consumption," in Proceedings of the 2017 Workshop on Adaptive Resource Management and Scheduling for Cloud Computing, pp. 1-7, 2017.
- Codeplay ComputeCpp, https://developer.codeplay.com/products/computecpp/ce/home
- Intel oneAPI DPC++, https://www.intel.com/content/www/us/en/developer/tools/oneapi/dpc-compiler.html
- OpenSYCL, https://github.com/OpenSYCL/OpenSYCL
- STREAM Benchmark, https://www.cs.virginia.edu/stream/
- VC4CL: OpenCL for VideoCore IV GPU, https://github.com/doe300/VC4CL
- strace: Linux syscall tracer, https://strace.io/
- JetPack SDK, https://developer.nvidia.com/embedded/jetpack
- Portable Computing Language, http://portablecl.org/
- NVIDIA HPC SDK, https://developer.nvidia.com/hpc-sdk
- C. Feng, P. Han X. Zhang, B. Yang, Y. Liu, L. Guo, "Computation Offloading in Mobile Edge Computing Networks: A Survey," Journal of Network and Computer Applications, Vol. 202, 103366, 2022.