• 제목/요약/키워드: Parallel Communication

검색결과 1,117건 처리시간 0.028초

Parallel LDPC Decoding on a Heterogeneous Platform using OpenCL

  • Hong, Jung-Hyun;Park, Joo-Yul;Chung, Ki-Seok
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권6호
    • /
    • pp.2648-2668
    • /
    • 2016
  • Modern mobile devices are equipped with various accelerated processing units to handle computationally intensive applications; therefore, Open Computing Language (OpenCL) has been proposed to fully take advantage of the computational power in heterogeneous systems. This article introduces a parallel software decoder of Low Density Parity Check (LDPC) codes on an embedded heterogeneous platform using an OpenCL framework. The LDPC code is one of the most popular and strongest error correcting codes for mobile communication systems. Each step of LDPC decoding has different parallelization characteristics. In the proposed LDPC decoder, steps suitable for task-level parallelization are executed on the multi-core central processing unit (CPU), and steps suitable for data-level parallelization are processed by the graphics processing unit (GPU). To improve the performance of OpenCL kernels for LDPC decoding operations, explicit thread scheduling, vectorization, and effective data transfer techniques are applied. The proposed LDPC decoder achieves high performance and high power efficiency by using heterogeneous multi-core processors on a unified computing framework.

Performance Evaluation of Parallel Opportunistic Multihop Routing

  • Shin, Won-Yong
    • Journal of information and communication convergence engineering
    • /
    • 제12권3호
    • /
    • pp.135-139
    • /
    • 2014
  • Opportunistic routing was originally introduced in various multihop network environments to reduce the number of hops in such a way that, among the relays that decode the transmitted packet for the current hop, the one that is closest to the destination becomes the transmitter for the next hop. Unlike the conventional opportunistic routing case where there is a single active S-D pair, for an ad hoc network in the presence of fading, we investigate the performance of parallel opportunistic multihop routing that is simultaneously performed by many source-destination (S-D) pairs to maximize the opportunistic gain, thereby enabling us to obtain a logarithmic gain. We first analyze a cut-set upper bound on the throughput scaling law of the network. Second, computer simulations are performed to verify the performance of the existing opportunistic routing for finite network conditions and to show trends consistent with the analytical predictions in the scaling law. More specifically, we evaluate both power and delay with respect to the number of active S-D pairs and then, numerically show a net improvement in terms of the power-delay trade-off over the conventional multihop routing that does not consider the randomness of fading.

OpenCL을 활용한 CPU와 GPU 에서의 CMMB LDPC 복호기 병렬화 (Parallel LDPC Decoder for CMMB on CPU and GPU Using OpenCL)

  • 박주열;홍정현;정기석
    • 대한임베디드공학회논문지
    • /
    • 제11권6호
    • /
    • pp.325-334
    • /
    • 2016
  • Recently, Open Computing Language (OpenCL) has been proposed to provide a framework that supports heterogeneous computing platforms. By using an OpenCL framework, digital communication systems can support various protocols in a unified computing environment to achieve both high portability and high performance. This article introduces a parallel software decoder of Low Density Parity Check (LDPC) codes for China Multimedia Mobile Broadcasting (CMMB) on a heterogeneous platform. Each step of LDPC decoding has different parallelization characteristics. In this paper, steps suitable for task-level parallelization are executed on the CPU, and steps suitable for data-level parallelization are processed by the GPU. To improve the performance of the proposed OpenCL kernels for LDPC decoding operations, explicit thread scheduling, loop-unrolling, and effective data transfer techniques are applied. The proposed LDPC decoder achieves high performance by using heterogeneous multi-core processors on a unified computing framework.

인력선 프레임의 병렬화 위상 최적설계 (Parallelized Topology Design Optimization of the Frame of Human Powered Vessel)

  • 김현석;이기명;김민근;조선호
    • 대한조선학회논문집
    • /
    • 제47권1호
    • /
    • pp.58-66
    • /
    • 2010
  • Topology design optimization is a method to determine the optimal distribution of material that yields the minimal compliance of structures, satisfying the constraint of allowable material volume. The method is easy to implement and widely used so that it becomes a powerful design tool in various disciplines. In this paper, a large-scale topology design optimization method is developed using the efficient adjoint sensitivity and optimality criteria methods. Parallel computing technique is required for the efficient topology optimization as well as the precise analysis of large-scale problems. Parallelized finite element analysis consists of the domain decomposition and the boundary communication. The preconditioned conjugate gradient method is employed for the analysis of decomposed sub-domains. The developed parallel computing method in topology optimization is utilized to determine the optimal structural layout of human powered vessel.

재작업이 존재하는 이종병렬기계에서 생산효율을 위해 공정소요시간 단축을 목적으로 하는 작업할당 (Dispatching to Minimize Flow Time for Production Efficiency in Non-Identical Parallel Machines Environment with Rework)

  • 서정하;고효헌;김성식;백준걸
    • 대한산업공학회지
    • /
    • 제37권4호
    • /
    • pp.367-381
    • /
    • 2011
  • Reducing waste for the efficiency of production is becoming more important because of the rapidly changing market circumstances and the rising material and oil prices. The dispatching also has to consider the characteristic of production circumstance for the efficiency. The production circumstance has the non-identical parallel machines with rework rate since machines have different capabilities and deterioration levels in the real manufacturing field. This paper proposes a dispatching method, FTLR (Flow Time Loss Index with Rework Rate) for production efficiency. The goal of FTLR is to minimize flow time based on such production environments. FTLR predicts the flow time with rework rate. After assessing dominant position of expected flow time per each machine, FTLR performs dispatching to minimize flow time. Experiments compare various dispatch methods for evaluating FTLR with mean flow time, mean tardiness and max tardiness in queue.

Suppression of Parallel Plate Modes Using Edge-Located EBG Structure in High-Speed Power Bus

  • Cho, Jonghyun;Kim, Myunghoi
    • Journal of information and communication convergence engineering
    • /
    • 제14권4호
    • /
    • pp.252-257
    • /
    • 2016
  • An edge-located electromagnetic bandgap (EL-EBG) structure using a defected ground structure (DGS) is proposed to suppress resonant modes induced by edge excitation in a two-dimensional planar parallel plate waveguide (PPW). The proposed EL-DGS-EBG PPW significantly mitigates multiple transverse-magnetic (TM) modes in a wideband frequency range corresponding to an EBG stopband. To verify the wideband suppression, test vehicles of a conventional PPW, a PPW with a mushroom-type EBG structure, and an EL-DGS-EBG PPW are fabricated using a commercial process involving printed circuit boards (PCBs). Measurements of the input impedances show that multiple resonant modes of the previous PPWs are significantly excited through an input port located at a PPW edge. In contrast, resonant modes in the EL-DGS-EBG PPW are substantially suppressed over the frequency range of 0.5 GHz to 2 GHz. In addition, we have experimentally demonstrated that the EL-DGS-EBG PPW reduces the radiated emission from -24 dB to -44 dB as compared to the conventional PPW.

병렬유전자알고리즘을 이용한 탐지노드 선정문제의 에너지 효율성과 수렴성 향상에 관한 해석 (Analysis of Improved Convergence and Energy Efficiency on Detecting Node Selection Problem by Using Parallel Genetic Algorithm)

  • 성기택
    • 한국정보통신학회논문지
    • /
    • 제16권5호
    • /
    • pp.953-959
    • /
    • 2012
  • 센서네트워크에서는 다수의 유휴노드가 존재하며 네트워크의 이상행위 탐지는 이러한 유휴노드를 이용하여 구현될 수 있다. 최적화 문제로 정의된 탐지노드선정 문제에 대하여, 기존의 방법에서는 중앙처리방식의 유전자 알고리즘을 이용하였다. 본 논문에서는 최적 값으로의 수렴 성을 개선함과 동시에 에너지 효율성을 향상시키는 방법으로써 네트워크의 토폴로지 특성을 고려한 병렬유전자알고리즘을 이용한 방법을 제안하였다. 시뮬레이션을 통하여 제안한 방법이 기존의 방법에 비하여 최적 값으로의 수렴이 개선되었음과 에너지 효율적임을 확인하였다.

메시지의 상관관계를 이용한 분산병렬처리 기반의 소셜 네트워크 서비스 시각화 방법 (Visualization Method of Social Networks Service using Message correlations based on Distributed Parallel Processing)

  • 김용일;박선;류갑상
    • 한국정보통신학회논문지
    • /
    • 제17권5호
    • /
    • pp.1168-1173
    • /
    • 2013
  • 본 논문은 소셜 네트워크상의 내부관계와 외부관계를 반영하여 사용자간의 관계를 사용자 중심으로 계층적 시각화하는 새로운 클라우드 기반의 방법을 제안한다. 본논문의 시각화방법은 상관관계 행렬을 이용하여 사용자의 내부관계를 계산하여 소셜 네트워크상 사용자 중심의 관계 계층을 잘 나타내며, 소셜 네트워크의 외부 관계를 이용하여 사용자의 계층 관계에 접근 노드의 중요도를 반영한다. 제안방법의 사용자들은 소셜 네트워크상의 사용자 노드 관계가 계층적으로 시각화되기 때문에 사용자 관계를 잘 이해할 수 있다. 이외에 제안된 방법은 하둡(hadoop)과 하이프(hive)를 이용하여 분산저장 및 병렬로 계산하며, 계산 결과는 D3를 이용하여 계층적 그래프로 시각화한다.

멀티쓰레드 기반 병렬처리 구조를 이용한 TMN 에이젼트 플랫폼 설계 및 구현 (Design and Implementation of a TMN Agent Platform based on a Multi-thread Parallel Processing Architecture)

  • 김성우;김영탁
    • 한국정보과학회논문지:컴퓨팅의 실제 및 레터
    • /
    • 제5권6호
    • /
    • pp.793-800
    • /
    • 1999
  • TMN Agent Platform은 망 요소의 운영상태와 자원들을 GDMO에 따라 관리객체(Managed Object : MO)로 모델링 하고, 자원들의 현재 상태를 유지하며, 관리자(Manager)로부터의 망 관리 기능 요구에 따라 조작된다. 그러므로, 에이전트의 성능향상은 전체적인 통신망 관리의 성능향상에 직접적인 영향을 미친다.본 논문에서는 TMN 에이전트의 기능요구 사항을 분석하고, 이를 토대로 성능향상을 위해 멀티스레드 기법을 사용하는 병렬 처리 구조의 TMN Agent Platform의 기능구조를 제시한다. 또한 에이전트와 다양한 자원들간의 효율적인 메시지전달을 위한 체계를 제시하며, 구현된 TMN Agent Platform의 성능을 분석한다.Abstract TMN Agent manages the operational status and real-resources of network elements, such as switching nodes and transmission systems. It performs the requested management functions from manager and maintains consistent status data of real-resource. The performance of agent system affects directly the performance of network management operation. If the agent is implemented by sequential processing scheme with single process, the agent processing can be delayed or blocked according to the status of real-resources. This problem can be solved by parallel and distributed processing scheme.To improve the processing performance of TMN Agent, we propose a TMN Agent Platform's functional architecture that is based on parallel processing with multi-tread and effective message transferring scheme between agent and various real-resource. We analyze the performance of the implemented TMN Agent Platform.

The Auto-Tracking Communication Link Using Planar Active Rectrodirective Array

  • Kim, Gi-Rae
    • Journal of information and communication convergence engineering
    • /
    • 제5권2호
    • /
    • pp.121-125
    • /
    • 2007
  • In this paper, a planar active retrodirective four-element array with subharmonic phase conjugation mixers based on anti-parallel diode pairs (APDPs) is designed, and its application in the auto-tracking duplex communication link is presented. As compared to previous phase conjugation mixers using twice RF frequency for LO frequency, the proposed conjugation mixers need only half RF frequency so that it can be easily applied for millimeter-wave applications. The proposed architecture, which conventionally performs the function of the transmission of an incident signal back in the direction of its source, is modified in order to include a receive function. Experimental verification of these architectures is performed at 1GHz and the results from the prototypes are compared with a theoretical model.