Search | Korea Science

A synchronous/asynchronous hybrid parallel method for some eigenvalue problems on distributed systems

박필성
- Proceedings of the Korean Society of Computational and Applied Mathematics Conference
- /
- 2003.09a
- /
- pp.11-11
- /
- 2003
오늘날 단일 슈퍼컴퓨터로는 처리가 불가능한 거대한 문제들의 해법이 시도되고 있는데, 이들은 지리적으로 분산된 슈퍼컴퓨터, 데이터베이스, 과학장비 및 디스플레이 장치 등을 초고속 통신망으로 연결한 GRID 환경에서 효과적으로 실행시킬 수 있다. GRID는 1990년대 중반 과학 및 공학용 분산 컴퓨팅의 연구 과정에서 등장한 것으로, 점차 응용분야가 넓어지고 있다. 그러나 GRID 같은 분산 환경은 기존의 단일 병렬 시스템과는 많은 점에서 다르며 이전의 기술들을 그대로 적용하기에는 무리가 있다. 기존 병렬 시스템에서는 주로 동기 알고리즘(synchronous algorithm)이 사용되는데, 직렬 연산과 같은 결과를 얻기 위해 동기화(synchronization)가 필요하며, 부하 균형이 필수적이다. 그러나 부하 균형은 이질 클러스터(heterogeneous cluster)처럼 프로세서들의 성능이 서로 다르거나, 지리적으로 분산된 계산자원을 사용하는 GRID 환경에서는 이기종의 문제뿐 아니라 네트워크를 통한 메시지의 전송 지연 등으로 유휴시간이 길어질 수밖에 없다. 이처럼 동기화의 필요성에 의한 연산의 지연을 해결하는 하나의 방안으로 비동기 반복법(asynchronous iteration)이 나왔으며, 지금도 활발히 연구되고 있다. 이는 알고리즘의 동기점을 가능한 한 제거함으로써 빠른 프로세서의 유휴 시간을 줄이는 것이 목적이다. 즉 비동기 알고리즘에서는, 각 프로세서는 다른 프로세서로부터 갱신된 데이터가 올 때까지 기다리지 않고 계속 다음 작업을 수행해 나간다. 따라서 동시에 갱신된 데이터를 교환한 후 다음 단계로 진행하는 동기 알고리즘에 비해, 미처 갱신되지 않은 데이터를 사용하는 경우가 많으므로 전체적으로는 연산량 대비의 수렴 속도는 느릴 수 있다 그러나 각 프로세서는 거의 유휴 시간이 없이 연산을 수행하므로 wall clock time은 동기 알고리즘보다 적게 걸리며, 때로는 50%까지 빠른 결과도 보고되고 있다 그러나 현재까지의 연구는 모두 어떤 수렴조건을 만족하는 선형 시스템의 해법에 국한되어 있으며 비교적 구현하기 쉬운 공유 메모리 시스템에서의 연구만 보고되어 있다. 본 연구에서는 행렬의 주요 고유쌍을 구하는 데 있어 비동기 반복법의 적용 가능성을 타진하기 위해 우선 이론적으로 단순한 멱승법을 사용하여 실험하였고 그 결과 순수한 비동기 반복법은 수렴하기 어렵다는 결론을 얻었다 그리하여 동기 알고리즘에 비동기적 요소를 추가한 혼합 병렬 알고리즘을 제안하고, MPI(Message Passing Interface)를 사용하여 수원대학교의 Hydra cluster에서 구현하였다. 그 결과 특정 노드의 성능이 다른 것에 비해 현저하게 떨어질 때 전체적인 알고리즘의 수렴 속도가 떨어지는 것을 상당히 완화할 수 있음이 밝혀졌다.
PDF

Transient Coordinator : a Collision Resolution Algorithm for Asynchronous MAC Protocols in Wireless Sensor Networks (센서 네트워크 환경에서 비동기식 MAC 프로토콜을 위한 충돌해결 알고리즘)

Lee, Sang-Hoon;Bae, Yong-Soo;Choi, Lynn
- Proceedings of the Korean Information Science Society Conference
- /
- 2012.06d
- /
- pp.208-210
- /
- 2012
일반적으로 비동기식 센서 네트워크 MAC 프로토콜들은 별도의 충돌회피 알고리즘을 사용하지 않기 때문에 동기식 MAC 프로토콜에 비해 충돌에 의한 성능 저하가 크게 나타난다. 본 논문에서는 충돌에 따른 비동기식 MAC 프로토콜의 성능 저하 문제를 최소화하기 위해 수신 노드가 충돌을 야기한 송신 노드들의 전송 순서를 제어하여 추가적인 충돌을 제거할 수 있는 비동기식 MAC 프로토콜용 충돌해결 알고리즘을 소개한다. 충돌을 감지한 수신 노드는 충돌을 일으킨 송신 노드들에 대한 정보를 수집하고 이를 이용하여 충돌이 발생하지 않는 전송 순서를 각 송신 노드에게 지정한다. 송신 노드들은 지정된 순서에 따라 독점적인 채널 사용 권한을 얻으며 이를 통해 송신 노드간의 충돌을 회피할 수 있다. 실험에 대한 분석에 따르면 본 논문에서 제안하는 충돌해결 알고리즘은 반복적인 충돌을 방지하여 그로 인해 낭비되는 시간과 대역폭을 줄이고 이를 통해 각 노드당 평균 데이터 처리량을 약 18.6% 증가시킬 수 있다.

Performance analysis of iterative groupwise equal-delay interference cancellation(IGEIC) for multiuser detection of coherent W-CDMA system (동기복조 W-CDMA 시스템의 다중 사용자 검출을 위한 반복 그룹단위 등지연 간섭제거(IGEIC) 알고리즘 성능해석)

구제길;최형진
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.27 no.3B
- /
- pp.179-187
- /
- 2002
This paper proposes and analyzes the iterative groupwise equal-delay interference cancellation(IGEIC) algorithm for coherent detection of an asynchronous wideband DS-CDMA system in a single cell over multipath fading channels. The IGEIC algorithm divide users in a system into several groups, and subtract out interference signal from the received signal as many as the number of users within a user group, iteratively. The IGEIC algorithm is also classified into the iterative groupwise equal-delay serial interference cancellation(IGESIC) algorithm and the iterative groupwise equal-delay parallel interference cancellation(IGEPIC) algorithm. In the case of perfect correlation for spreading codes, it shows that the performance of IGESIC and IGEPIC algorithm is the same after interference cancellation of as many as the number of users within a user group, but the performance of IGEPIC algorithm is superior to the IGESIC algorithm just before fecal cancellation within a user group. The results show that (he performance of the two proposed algorithms are also superior to the SIC algorithm by 3dB.
PDF KSCI

Improving Performance of Large Sparse Linear System Solvers On Distributed Memory Systems By Asynchronous Algorithms (비동기 알고리즘을 이용한 분산 메모리 시스템에서의 초대형 선형 시스템 해법의 성능 향상)

Park, Pil-Seong;Sin, Sun-Cheol
- The KIPS Transactions:PartA
- /
- v.8A no.4
- /
- pp.439-446
- /
- 2001
The main stream of parallel programming today is using synchronous algorithms, where processor synchronization for correct computation and workload balance are essential. Overall performance of the whole system is dependent upon the performance of the slowest processor, if workload is not well-balanced or heterogeneous clusters are used. Asynchronous iteration is a way to mitigate such problems, but most of the works done so far are for shared memory systems. In this paper, we suggest and implement a parallel large sparse linear system solver that improves performance on distributed memory systems like clusters by reducing processor idle times as much as possible by asynchronous iterations.
PDF

Implementation Of Asymmetric Communication For Asynchronous Iteration By the MPMD Method On Distributed Memory Systems (분산 메모리 시스템에서의 MPMD 방식의 비동기 반복 알고리즘을 위한 비대칭 전송의 구현)

Park Pil-Seong
- Journal of Internet Computing and Services
- /
- v.4 no.5
- /
- pp.51-60
- /
- 2003
Asynchronous iteration is a way to reduce performance degradation of some parallel algorithms due to load imbalance or transmission delay between computing nodes, which requires asymmetric communication between the nodes of different speeds. To implement such asynchronous communication on distributed memory systems, we suggest an MPMD method that creates an additional separate server process on each computing node, and compare it with an SPMD method that creates a single process per node.
PDF

The Signal Acquisition Algorithm for Ultra Wide-band Communication Systems (UWB 통신시스템에서 동기 획득 알고리즘)

Park, Dae-Heon;Kang, Beom-Jin;Park, Jang-Woo;Cho, Sung-Eon
- Journal of Advanced Navigation Technology
- /
- v.12 no.2
- /
- pp.146-153
- /
- 2008
Due to the extremely short pulse in the Ultra-Wideband (UWB) technology, the accurate synchronization acquisition method is very important for both high data-rate WPAN and low data-rate WPAN. In this paper, we propose the synchronization acquisition algorithm based on two-step signal search method to acquire the synchronization in the UWB multi-path channel. At the first step, the search window is divided by two and the window that has higher power is chosen as a next search window. This operation is repeated until the measure power of the search window is smaller than the threshold value. At the second step, we employ Linear Search algorithm to the search window obtained at the first step for fine search. The proposed algorithm is proved that the synchronization acquisition is faster than the parallel search algorithm and it shows good performance in environment of the SNR extreme changes by the simulation.
PDF

Reserved Slot Allocation Scheme for Voice Service in WATM MAC (무선 비동기 전송모드 매체 접근제어에서 음성서비스를 위한 예약 슬롯 할당 알고리즘)

김관웅;배성환;전병실
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.7
- /
- pp.101-108
- /
- 2001
In this paper we focus on dynamic reservation slot allocation scheme for supporting QoS of a voice traffic in WATM MAC. Especially, voice traffic is the most important real-time object, and so we propose a new MAC protocol for voice traffic over WATM networks in the multimedia environment. According to the characteristics of voice traffic which is repeatedly in silent state and active state, new protocol allocates reservation slots dynamically with respect to the number of silent voice source of which starting time is stored to the state table in base station (BS). The simulation results show that the proposed protocol has better performance than slotted ALOHA in average access delay, collision rate, better than NC-PRMA(Non Collision Packet Reservation Multiple Access) in band width efficiency, and can provide a certain level of QoS requirement by the given slot assignment even though the number of voice terminals is increased.
PDF

Automatic Recognition Algorithm for Linearly Modulated Signals Under Non-coherent Asynchronous Condition (넌코히어런트 비동기하에서의 선형 변조신호 자동인식 알고리즘)

Sim, Kyuhong;Yoon, Wonsik
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.18 no.10
- /
- pp.2409-2416
- /
- 2014
In this paper, an automatic recognition algorithm for linearly modulated signals like PSK, QAM under noncoherent asynchronous condition is proposed. Frequency, phase, and amplitude characteristics of digitally modulated signals are changed periodically. By using this characteristics, cyclic moments and higher order cumulants based features are utilized for the modulation recognition. Hierarchial decision tree method is used for high speed signal processing and totally 4 feature extraction parameters are used for modulation recognition. In the condition where the symbol number is 4,096, the recognition accuracy of the proposed algorithm is more than 95% at SNR 15dB. Also the proposed algorithm is effective to classify the signal which has carrier frequency and phase offset.
https://doi.org/10.6109/jkiice.2014.18.10.2409 인용 PDF KSCI

A Fault-Tolerant Linear System Solver in a Standard MPI Environment (표준 MPI 환경에서의 무정지형 선형 시스템 해법)

Park, Pil-Seong
- Journal of Internet Computing and Services
- /
- v.6 no.6
- /
- pp.23-34
- /
- 2005
In a large scale parallel computation, failures of some nodes or communication links end up with waste of computing resources, Several fault-tolerant MPI libraries have been proposed so far, but the programs written by using such libraries have a portability problem since fault-tolerant features are not supported by the MPI standard yet, In this paper, we propose an application-level fault-tolerant linear system solver that uses the asynchronous iteration algorithm and the standard MPI functions only, which does not have a portability problem and is more efficient by adopting a simplified recovery mechanism.
PDF

Efficient Parallel Algorithm for Gram-Schmidt Method

Kim, Sung-Kyung
- Journal of Korea Society of Industrial Information Systems
- /
- v.4 no.4
- /
- pp.88-93
- /
- 1999
Several Iterative methods are considered, Gram-Schmidt algerian for thin orthogonalization and Lanczos methodfor a few extreme eigenvalues. For these methods, a variants of method is derived for which only one synchronization point per on iteration is required; that is one global communication in a message passing distributed-memory machine per one iteration is required The variant is called restructured method, and restructured method has better parallel properties to the conventional method.
PDF

Search Result 17, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)