Search | Korea Science

S3M2: Scalable Scheduling for Shared Memory Multiprocessors (공유 메모리 다중 프로세서 시스템을 위한 가변 스케줄링)

Kang, Oh-Han
- The Transactions of the Korea Information Processing Society
- /
- v.7 no.10
- /
- pp.3055-3063
- /
- 2000
In this paper, a task duplication based heuristic scheduling algorithm is proposed to solve the problem of task scheduling on Shared Memory Multiporcessors (SMM). The proposed algorithm pre-allocates network resources so as to avoid potential communication conlhct, and the algorithm uses heuristies to select duplication tasks so as to recuce of a multiprocessors, and generates scheduling accorting to the available number of processors ina system. The proposed algorithm has been applied to some practical task graphs in the simulation, and the results show that the proposed algorithm achieves considerable performance improvement, in respect of schedule length.
PDF

Banked Register File for ARM Thumb to Secure More Registers (다수의 레지스터를 확보하기 위한 ARM Thumb 레지스터 뱅크의 제안)

Lee Je-Hyung;Park Jinpyo;Moon Soo-Mook
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.07a
- /
- pp.781-783
- /
- 2005
ARM 프로세서는 내장형 시스템에서 가장 널리 사용되는 32비트 마이크로 프로세서 중 하나이며, Thumb 명령어 세트는 보다 작은 코드 크기를 위해 제공하는 16비트 확장 명령어 세트이다. Thumb의 약점중의 하나는 줄어든 명령어 길이 때문에 이용할 수 있는 레지스터의 개수가 반으로 줄어든다는 것인데 결과적으로 가용 레지스터의 부족으로 인해 spill 코드가 빈번하게 발생할 수 있다. 우리는 약간의 하드웨어 및 명령어 수정을 통해 뱅크(bank)로 이루어진 레지스터 파일을 제공하고자 한다. 이로 인해 컴파일러는 보다 여유 있는 레지스터를 확보하게 되어 spill 코드가 줄어들게 되므로 보다 작은 크기의 코드를 얻어낼 수 있다. 이 변화된 형태의 레지스터 파일을 운용하기 위한 효율적인 레지스터 할당기법이 요구되며, 제안하는 영역기반 레지스터 할당기법을 통해 이이 최적화된 Thumb 코드 대비 약 $5.1\%$의 코드 크기 감소효과를 볼 수 있었다.
PDF

Enhancing the Performance of Multiple Parallel Applications using Heterogeneous Memory on the Intel's Next-Generation Many-core Processor (인텔 차세대 매니코어 프로세서에서의 다중 병렬 프로그램 성능 향상기법 연구)

Rho, Seungwoo;Kim, Seoyoung;Nam, Dukyun;Park, Geunchul;Kim, Jik-Soo
- Journal of KIISE
- /
- v.44 no.9
- /
- pp.878-886
- /
- 2017
This paper discusses performance bottlenecks that may occur when executing high-performance computing MPI applications in the Intel's next generation many-core processor called Knights Landing(KNL), as well as effective resource allocation techniques to solve this problem. KNL is composed of a host processor to enable self-booting in addition to an existing accelerator consisting of a many-core processor, and it was released with a new type of on-package memory with improved bandwidth on top of existing DDR4 based memory. We empirically verified an improvement of the execution performance of multiple MPI applications and the overall system utilization ratio by studying a resource allocation method optimized for such new many-core processor architectures.
https://doi.org/10.5626/JOK.2017.44.9.878 인용 KSCI

An Effective Task Allocation Algorithm in Two-Dimensional Mesh Interconnection Networks (이차원 메쉬 상호 연결 망에 대한 효율적인 태스크할당 알고리즘)

Chu, Hyeon-Seung;Park, Gyeong-Rin;Yu, Seong-Mu
- The Transactions of the Korea Information Processing Society
- /
- v.6 no.6
- /
- pp.1667-1677
- /
- 1999
An effective allocation of requested number of processors to newly incoming tasks in two-dimensional (2D) mesh interconnection networks is very important for achieving the desired high performance and resource utilization. It also needs to guarantee the complete recognition of the free submeshes based on contiguous and available processors with minimum overhead. An efficient task allocation algorithm for 2D meshes is presented in this paper. By employing a new approach for searching the one-dimensional array, the proposed algorithm can find the available submesh without the scanning of the entire 2D array unlike earlier designs. As a result, the new algorithm can significantly reduce the task allocation time. Comprehensive computer simulation shows that the average allocation time and waiting delay are much smaller than earlier irrespective of the size of meshes. The hardware overhead is comparable to other algorithms.
PDF

A Study on Task Allocation of Parallel Spatial Joins using Fixed Grids (고정 그리드를 이용한 병렬 공간 조인의 태스크 할당에 관한 연구)

Kim, Jin-Deok;Seo, Yeong-Deok;Hong, Bong-Hui
- The KIPS Transactions:PartD
- /
- v.8D no.4
- /
- pp.347-360
- /
- 2001
The most expensive spatial operation in spatial databases is a spatial join which computes a combined table of which tuple consists of two tuples of the two tables satisfying a spatial predicate. Although the execution time of sequential processing of a spatial join has been so far considerably improved, the response time is not tolerable because of not meeting the requirements of interactive users. It is usually appropriate to use parallel processing to improve the performance of spatial join processing. However, as the number of processors increases, the efficiency of each processor decreases rapidly because of the disk bottleneck and the overhead of message passing. This paper proposes the method of task allocation to soften the disk bottleneck caused by accessing the shared disk at the same time, and to minimize message passing among processors. In order to evaluate the performance of the proposed method in terms of the number of disk accesses and message passing, we conduct experiments on the two kinds of parallel spatial join algorithms. The experimental tests on the MIMD parallel machine with shared disks show that the proposed semi-dynamic task allocation method outperforms the static and dynamic task allocation methods.
PDF

A Study on Parallel Processing by Multi-Microprocessors (마이크로프로세서복합에 의한 병렬처리에 관한 연구)

Chung, Yon-Tack;Song, Young-Jae
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.17 no.5
- /
- pp.36-42
- /
- 1980
In this study, multi-microprocessors system in which slave microprocessor is conrlected with master microprocessor bus through the DMA controller is designed by the use of four 8085 CPU. A high degree of processing efficiency could be obtained by making this system work parallel processing. The result of measuring relat ions bet ween working microproressor and system throughput was 70-80 percents lower than ideal value Master microprocessor takes charge of resource allocation and scheduling, common memory assigns communication between microprocessors and a store of common data. The met hod of detecting Pa rallelism from source Program composed by series is also suggested.
PDF

Performance Evaluation of a New Scheduling Algorithm for the Simultaneous MultiThreading Microprocessor (동시 다중 쓰레딩 마이크로프로세서를 위한 스케줄링 알고리즘의 성능 평가)

Lee Jung-Hoon;Kim Jin Suk
- The KIPS Transactions:PartA
- /
- v.12A no.2 s.92
- /
- pp.145-150
- /
- 2005
Recently, many Processor manufacturers have implemented simultaneous multi treading technology, which can simultaneously execute independent threads in one processor cycle, as a way of increasing processor efficiency, ana one particular example is Hyper Threading. Hyper Threading technology, which enables many logical processors to reside a physical processor, differs from the current multiprocessing environment which has many independent processors, and calls for a particular work assignment method optimized for Hyper Threading environment Thus, in this paper, We have proposed a scheduling algorithm compatible with Hyper Threading technology and analyzed its performance using various methods. As a result, we shall expect its efficient performance by properly understanding and managing Hyper Threading system.
https://doi.org/10.3745/KIPSTA.2005.12A.2.145 인용 PDF KSCI

An Extended Real-Time Synchronization Protocols for Shared Memory Multiprocessors (공유메모리 다중 프로세서 실시간 시스템에서의 동기화 프로토콜)

Kang, Seung-Yup;Ha, Rhan
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10a
- /
- pp.136-138
- /
- 1998
작업들이 자원을 공유하는 경우 예측하기 어려운 지연시간이 발생한다. 다중 프로세서 시스템에서의 자원공유로 인한 지연시간은 더욱 예측하기 어렵다. 실기간 시스템의 스케줄 가능성 검사를 위해서는 이러한 지연시간을 정확히 예측해야한다. 선점가능한 우선순위 구동 CPU 스케줄링 알고리즘에 의해서 다른 우선순위의 작업과의 동기화는 우선순위 역전 문제를 야기한다. 본 논문에서는 다중 프로세서에서의 동기화 프로토콜을 제안하고 작업의 지연시간을 분석한다. 다른 프로세서에 할당된 작업들이 수행중인 자원을 요구할 때, 자원을 수행하는 작업의 우선순위를 높여줌으로써 자원수행을 빠르게 종료하게 한다. 이로 인해 자원에 의한 지연을 최소화한다. 특히, 높은 우선순위 작업의 경우 더욱 작은 지연시간을 갖게한다. 시뮬레이션을 통한 Shared Memory Protocol [5]과의 비교, 분석 결과 성능의 향상을 보임을 알 수 있다. 다양한 작업집합에 대한 지연시간을 분석하였다.
PDF

Theoretical Performance Bounds and Parallelization of a Two-Dimensional Packing Algorithm (이차원 팩킹 알고리즘의 이론적 성능 분석과 병렬화)

Hwang, In-Jae;Hong, Dong-Kweon
- The KIPS Transactions:PartA
- /
- v.10A no.1
- /
- pp.43-48
- /
- 2003
Two-dimensional packing algorithm can be used for allocating submeshes in mesh multiprocessor systems. Previously, we developed an efficient packing algorithm called TP heuristic, and showed how the results of the packing could be used for allocating submeshes. In this paper, we present theoretical performance bounds for TP heuristic. We also present a parallel version of the algorithm that consumes reduced time when it is executed by multiple processors in mesh multiprocessors.
https://doi.org/10.3745/KIPSTA.2003.10A.1.043 인용 PDF KSCI

Rate-based Processor Reservation Technique for Multimedia Applications (멀티미디어 응용을 위한 요구비율 기반 프로세서 예약 기법)

Park, Young-Il;Ha, Rhan
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.04a
- /
- pp.86-88
- /
- 2000
멀티미디어 태스크는 기존 범용 운영 체제의 시분할 스케줄러에서 만족시킬 수 없는 시간적 요구사항을 가진다. 이런 태스크를 기존의 시분할 태스크와 함께 서비스하기 위해서는 새로운 스케줄링 프레임워크가 필요하다. FQ(Fair Queueing)은 태스크의 공유비율에 비례하여 자원을 할당하는 방법으로 이질적인 태스크(멀티미디어 태스크, 일반 시분할 태스크)가 공존하는 개방적인 환경에서의 스케줄링 정책으로 적합하다는 특징이 있다. 본 논문에서는 FQ의 종류인 WFQ(Weighted Fair Queueing)를 개선하여 하나의 스케줄러에서 다른 두 부류의 태스크를 모두 처리하는 요구비율 기반의 프로세서 예약 기법을 제안한다. 실시간 태스크와 시분할 태스크를 처리하기 위해서 실시간 부류의 태스크를 우선적으로 배치하고, 실시간 부류 태스크의 실행 사이에 시분할 태스크를 스케줄하여 실시간 태스크에 대해서 보장된 서비스를, 시분할 태스크에 대해서는 이 태스크에 할당된 예약만큼의 프로세서 시간을 제공한다. 모의 실험에서는 제안한 프로세서 예약 방식이 실시간 태스크와 시분할 태스크를 효율적으로 처리하며 기존의 WFQ보다 더 안정적임을 보인다.
PDF

Search Result 141, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)