DOI QR코드

DOI QR Code

Application Characteristic-based Divided Scheduling for Multicore Systems

  • Park, Jung Kyu (Dept. of Digital Media Design and Applications, Seoul Women's University) ;
  • Kim, Jaeho (School of Electrical and Computer Engineering, UNIST)
  • Received : 2017.05.12
  • Accepted : 2017.06.19
  • Published : 2017.06.30

Abstract

In this paper, we proposed a novel user-level scheduling scheme that monitors applications characteristics on-line using PMU and allocates applications into cpu cores. We utilize PMU (Performance Monitoring Unit) to analyze which shared resource has the strongest relation with the influence. Using the proposed scheduling method, it is possible to reduce the contention of shared resources. The key idea of this scheme is separating high-influential applications into different processors. The evaluation results have shown that the proposed scheduling scheme can enhance the performance up to 12% for a 8 core system and up to 25% for a 28 core system, respectively.

Keywords

References

  1. J. Rao, K. Wang, X. Zhou, and C. Xu, "Optimizing virtual machine scheduling in NUMA multicore systems," in Proc. of the 19th International Symposium on High Performance Computer Architecture, Feb. 2013.
  2. D. Levinthal, "Performance Analysis Guide for Intel Core i7 Processor and Intel Xeon 5500 Processor," https://software.intel.com/, 2009.
  3. INTEL., "Intel 64 and IA-32 Architectures Optimization Reference Manual," https://www.intel.com.
  4. S. Blagodurov, S. Zhuravlev, M. Dashti, and A. Fedorova, "A case for NUMA-aware contention management on multicore systems," in Proc. of the USENIX Annual Technical Conference, June, 2011.
  5. S. Zhuravlev, S. Blagodurov, and A. Fedorava, "Addressing shared resource contention in multicore processors via scheduling," in Proc. of the 15th International Conference on Architectural Support for Programming Languages and Operating Systems, Mar. 2010.
  6. R. Lachaize, B. Lepers, and V. Quema, "MemProf: a memory profiler for NUMA multicore systems," in Proc. of the USENIX Annual Technical Conference, June, 2012.
  7. M. Dasthi, A. Fedorova, J. Funston, F. Gaud, R. Lachize, B. Lepers, V. Quema, and M. Roth, M, "Traffic management: A holistic approach to memory placement on NUMA systems," in Proc. of the 18th international Conference on Architectural Support for Programming Languages and Operating Systems, Mar. 2013.
  8. B. Lepers, V. Quema, and A. Fedorova, "Thread and memory placement on NUMA systems: Asymmetry matters," in Porc. of the USENIX Annual Technical Conference, June. 2015.
  9. A. Baumann, P. Barham, P. Dagand, T. Harris, R. Isaacs, S. Peter, T. Rroscoe, A. Schpbach, and A. Singhania, "The multikernel: A new OS architecture for scalable multicore systems," in Proc. of the 22th ACM Symposium on Operating Systems Principles, Oct. 2009.
  10. M. Liu, and T. Li, "Optimizing virtual machine consolidation performance on NUMA server architecture for cloud workloads," in Proc. of the 41th International Symposium on Computer Architecture, June. 2014.
  11. Z. Majo, and T. R. Gross, "Memory system performance in a NUMA multicore multiprocessor," in Proc. of the 4th Annual International Systems and Storage Conference, May, 2011.
  12. S. Boyd-Wickizer, A. T. Clements, Y. Mao, A. Pesperrev, F. Kaashoek, R. Morris, and N. Zeldovich, "An analysis of Linux scalability to many cores," in Proc. of the 9th Symposium on Operating Systems Design and Implementation Oct, 2010.
  13. M. V. Weaver, "Linux perf event features and overhead," in the 2nd International Workshop on Performance Analysis of Workload Optimized Systems, Apr. 2013.
  14. D. Eklov, N. Nikoleris, D. Black-Schaffer, D, and Hagersten, "Bandwidth bandit: Quantitative characterization of memory contention," in Proc. of the 11th Annual IEEE/ACM International Symposium on Code Generation and Optimization, Feb. 2013.
  15. G. W. Lee, "Energy-Efficient Fault-Tolerant Scheduling based on Duplicated Executions for Real-Time Tasks on Multicore Processors ," KSCI, vol. 19, no 5, May, 2014.