Search | Korea Science

An Adaptive Sequential Prefetching using Traffic Information in Shared-Memory Multiprocessors (공유메모리 다중처리기에서 상호연결망의 통신량을 고려하는 선인출 기법)

박정우;손영철;정한조;맹승렬
- Proceedings of the Korean Information Science Society Conference
- /
- 2000.04a
- /
- pp.633-635
- /
- 2000
상호연결망을 기반으로 하는 공유메모리 다중처리기의 성능은 공유메모리 접근 속도에 많은 영향을 받는다. 선인출 기법은 프로세서의 계산과 데이터의 접근을 중첩시켜 메모리의 접근 속도를 줄인다. 기존의 선인출 기법들은 캐쉬미스 양을 줄이는 것만을 생각하여 상호연결망의 상황을 고려하지 않은 문제점이 있다. 본 논문에서는 응답이 늦은 선인출 이용하여 선인출 양을 조절함으로써 상호연결망의 경쟁을 줄이는 새로운 선인출 기법을 제안하고 프로그램 구동 모의실험을 통해 기존의 선인출 기법[1]에 비해 더 좋은 성능을 나타냄을 보인다.
PDF

Task Duplication Based Clustering and Scheduling on Symmetric Multiprocessor Systems (대칭형 다중프로세서 시스템에서 태스크 중복기반의 클러스터링과 스케줄링)

강오한;조경미;김기남;김시관
- Proceedings of the Korean Information Science Society Conference
- /
- 2003.04a
- /
- pp.97-99
- /
- 2003
대칭형 다중프로세서 (SMP: Symmetric Multiprocessors) 시스템은 고성능의 병렬 연산을 위한 중요하고 효과적인 기반환경을 제공하고 있다. SMP에서 태스크 클러스터링과 스케줄링 기법은 시스템의 성능에 큰 영향을 미친다. 본 논문에서는 버스 기반의 SMP에서 사용할 수 있는 태스크 중복 기반의 클러스터링과 스케줄링 기법을 소개한다. 본 논문에서 제안한 클러스터링 기법에서는 휴리스틱을 사용하여 중복할 태스크를 선택한 후 프로세서에 할당하고, 스케줄링 기법에서는 잠재하는 통신 충돌을 방지하기 위하여 네트워크 통신 자원을 사전에 할당한다. 새로운 클러스터링과 스케줄링 기법의 성능을 확인하기 위하여 시뮬레이션에서는 통신비용의 변화에 대한 병렬연산시간을 비교하였다.
PDF

A Guaranteed Real-time Scheduling Algorithm for (m,k)-firm Deadlines Constrained Tasks on Multiprocessors (멀티프로세서에서 (m, k)-firm Deadline 을 가지는 태스크를 위한 실시간 스케줄링 알고리즘)

Kong, Yeonhwa;Cho, Hyeonjoong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2010.11a
- /
- pp.1529-1532
- /
- 2010
본 논문은 동종의 멀티코어에서 (m,k)-firm Deadline 을 가지는 태스크를 위한 실시간 시스템의 스케줄링 기법을 제안한다. 본 논문에서 제안된 알고리즘의 목적은 (m, k)-firm Deadline 을 만족시키는 확률을 증가시켜 최대의 Quality of Service 를 제공하는 것이다. 본 논문에서는 제안된 알고리즘이 QoS 를 보장함을 분석적으로 보이고 실험을 통해 알고리즘의 효율성을 검증한다.
https://doi.org/10.3745/PKIPS.y2010m11a.1529 인용 PDF

A Design Pattern-Oriented Real-Time Scheduling Simulator for Multiprocessors (디자인 패턴 지향 다중 프로세서를 위한 실시간 스케줄링 시뮬레이터)

Lee, Chong-Hyeon;Cho, Hyeonjoong
- Proceedings of the Korea Information Processing Society Conference
- /
- 2009.11a
- /
- pp.107-108
- /
- 2009
본 논문은 다중 프로세서에서의 다양한 실시간 스케줄링 기법을 지원하기 위해 객체 지향적으로 설계된 시뮬레이터를 제안한다. 다중 프로세서 기반 실시간 스케줄링의 특징을 반영한 설계 원칙에 따라 디자인 패턴을 적용하여 시뮬레이터의 재사용성과 확장성을 높였다.
https://doi.org/10.3745/PKIPS.y2009m11a.107 인용 PDF

Data Communication Prediction Model in Multiprocessors based on Robust Estimation (로버스트 추정을 이용한 다중 프로세서에서의 데이터 통신 예측 모델)

Jun Janghwan;Lee Kangwoo
- The KIPS Transactions:PartA
- /
- v.12A no.3 s.93
- /
- pp.243-252
- /
- 2005
This paper introduces a noble modeling technique to build data communication prediction models in multiprocessors, using Least-Squares and Robust Estimation methods. A set of sample communication rates are collected by using a few small input data sets into workload programs. By applying estimation methods to these samples, we can build analytic models that precisely estimate communication rates for huge input data sets. The primary advantage is that, since the models depend only on data set size not on the specifications of target systems or workloads, they can be utilized to various systems and applications. In addition, the fact that the algorithmic behavioral characteristics of workloads are reflected into the models entitles them to model diverse other performance metrics. In this paper, we built models for cache miss rates which are the main causes of data communication in shared memory multiprocessor systems. The results present excellent prediction error rates; below $1\%$ for five cases out of 12, and about $3\%$ for the rest cases.
https://doi.org/10.3745/KIPSTA.2005.12A.3.243 인용 PDF KSCI

Directory Cache Coherence Scheme using the Number-Balanced Binary Tree (수 평형 이진트리를 이용한 디렉토리 캐쉬 일관성 유지 기법)

Seo, Dae-Wha
- The Transactions of the Korea Information Processing Society
- /
- v.4 no.3
- /
- pp.821-830
- /
- 1997
The directory-based cache coherence scheme is an attractive approach to solve the caceh coherence problem in a large-scale shared-memory multiprocessor.However, the exsting directory-based schemes have some problens such as the enormous storage overhead for a directory, the long invalidation latency, the heavy network condes-tion, and the low scalability.For resolving these problems, we propose a new directroy- based caceh coherence scheme which is suitable for building scalable, shred-memory multiprocessors.In this scheme, each directory en-try ofr a given memory block is a number-balanced binaty tree(NBBT) stucture.The NBBT has several proper-ties to effciently maintain the directory for the cache consistency such that the shape is unique, the maximum depth is [log$_2$n], and the tree has the minimum number of leaf nodes among the binarry tree with n nodes.Therefore, this scheme can reduce the storage overhead, the network traffic, and the inbalidation latency and can ensutr the high- scalability the large-scale shared-memory multiprocessors.
PDF

Fast QR Factorization Algorithms of Toeplitz Matrices based on Stabilized / Hyperbolic Householder Transformations (하우스홀더 변환법을 이용한 토플리즈 행렬의 빠른 QR 인수분해 알고리즘)

Choi, Jae-Young
- The Transactions of the Korea Information Processing Society
- /
- v.5 no.4
- /
- pp.959-966
- /
- 1998
We present fast QR factorization algorithms $m{\times}n\;(m{\geq}n)$ Toeplitz matrix. These QR factorization algortihms are determined from the shift-invariance properties of underlying matrices. The major transformation tool is a stabilized/hyperbolic Householder transformation. The algortihms require O(mn) operations, and can be easily implemented on distributed-memory multiprocessors.
PDF

An Extended Real-Time Synchronization Protocols for Shared Memory Multiprocessors (공유메모리 다중 프로세서 실시간 시스템에서의 동기화 프로토콜)

Kang, Seung-Yup;Ha, Rhan
- Proceedings of the Korean Information Science Society Conference
- /
- 1998.10a
- /
- pp.136-138
- /
- 1998
작업들이 자원을 공유하는 경우 예측하기 어려운 지연시간이 발생한다. 다중 프로세서 시스템에서의 자원공유로 인한 지연시간은 더욱 예측하기 어렵다. 실기간 시스템의 스케줄 가능성 검사를 위해서는 이러한 지연시간을 정확히 예측해야한다. 선점가능한 우선순위 구동 CPU 스케줄링 알고리즘에 의해서 다른 우선순위의 작업과의 동기화는 우선순위 역전 문제를 야기한다. 본 논문에서는 다중 프로세서에서의 동기화 프로토콜을 제안하고 작업의 지연시간을 분석한다. 다른 프로세서에 할당된 작업들이 수행중인 자원을 요구할 때, 자원을 수행하는 작업의 우선순위를 높여줌으로써 자원수행을 빠르게 종료하게 한다. 이로 인해 자원에 의한 지연을 최소화한다. 특히, 높은 우선순위 작업의 경우 더욱 작은 지연시간을 갖게한다. 시뮬레이션을 통한 Shared Memory Protocol [5]과의 비교, 분석 결과 성능의 향상을 보임을 알 수 있다. 다양한 작업집합에 대한 지연시간을 분석하였다.
PDF

Simulation-based Design Verification for High-performance Computing System

Jeong Taikyeong T.
- Journal of Korea Multimedia Society
- /
- v.8 no.12
- /
- pp.1605-1612
- /
- 2005
This paper presents the knowledge and experience we obtained by employing multiprocessor systems as a computer simulation design verification to study high-performance computing system. This paper also describes a case study of symmetric multiprocessors (SMP) kernel on a 32 CPUs CC-NUMA architecture using an actual architecture. A small group of CPUs of CC-NUMA, high-performance computer system, is clustered into a processing node or cluster. By simulating the system design verification tools; we discussed SMP OS kernel on a CC-NUMA multiprocessor architecture performance which is $32\%$ of the total execution time and remote memory access latency is occupied $43\%$ of the OS time. In this paper, we demonstrated our simulation results for multiprocessor, high-performance computing system performance, using simulation-based design verification.
PDF

Study on parallel algorithmfor falult simulation (고장시뮬레이션의 병렬화 알고리듬에 관한 연구)

송오영
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.21 no.11
- /
- pp.2966-2977
- /
- 1996
As design of very large circuits is made possible by rapid development of VLSI technologies, efficient fault simulation is needed. Ingeneral, fault simulation requires many computer resources. As general-purpose multiprocessors become more common and affordable, these seem to be an attractive and effective alternative for fault simulation. Efficient fault simulation of synchronous sequential circuits has been reported to be attainably by using a linear iterative array model for such a circuit, and combining parallel fault simulation with russogate fault simulation. Such fault simulation algorithm is parallelized on a general-purpose multiprocessor with shard memory for acceleration of fault simulation. Through the experimenal study, the effect of the number of processors on speed-up of simulation, processor utilization, and the effect of multiprocessor hardware on simulation performance are studied. Some results for experiments with benchmark circuits are shown.
PDF

Search Result 71, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)