• Title/Summary/Keyword: Parallel Implementation

Search Result 880, Processing Time 0.037 seconds

Design and Implementation of an Approximate Surface Lens Array System based on OpenCL (OpenCL 기반 근사곡면 렌즈어레이 시스템의 설계 및 구현)

  • Kim, Do-Hyeong;Song, Min-Ho;Jung, Ji-Sung;Kwon, Ki-Chul;Kim, Nam;Kim, Kyung-Ah;Yoo, Kwan-Hee
    • The Journal of the Korea Contents Association
    • /
    • v.14 no.10
    • /
    • pp.1-9
    • /
    • 2014
  • Generally, integral image used for autostereoscopic 3d display is generated for flat lens array, but flat lens array cannot provide a wide range of view for generated integral image because of narrow range of view. To make up for this flat lens array's weak point, curved lens array has been proposed, and due to technical and cost problem, approximate surface lens array composed of several flat lens array is used instead of ideal curved lens array. In this paper, we constructed an approximate surface lens array arranged for $20{\times}8$ square flat lens in 100mm radius sphere, and we could get about twice angle of view compared to flat lens array. Specially, unlike existing researches which manually generate integral image, we propose an OpenCL GPU parallel process algorithm for generating real-time integral image. As a result, we could get 12-20 frame/sec speed about various 3D volume data from $15{\times}15$ approximate surface lens array.

A Small Real-Time Radio Broadcasting System by Using Smart Phone (스마트폰을 이용한 소규모 실시간 라디오 방송 시스템)

  • Lee, Jae-Moon
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.5
    • /
    • pp.83-90
    • /
    • 2012
  • This paper is a research on the design and implementation of a small real-time radio broadcasting system by using smart phone based on Android. It was designed as the server-client structure, and used the progressive download of HTTP as methods of transferring data to further simplify the system. In order to realize the real-time broadcasting, the original audio source was divided with a short interval and captured to be compressed and stored into files. Then the client receives and plays the compressed files sequentially as it is downloaded. However, this method occurs two problems each of which is the loss of capturing the original source in the server and the discontinuity of playing the files in the client. We solved the problem in the server by separating the thread into two parallel threads of which is each captured and compressed/stored, also by using the double buffering method. The problem in the client was solved using MediaPlayer in Android and the file queue to store the multiple files.

NoC Energy Measurement and Analysis with a Cycle-accurate Energy Measurement Tool for Virtex-II FPGAs (네트워크-온-칩 설계의 전력 소모 분석을 위한 Virtex-II FPGA의 싸이클별 전력 소모 측정 도구 개발)

  • Lee, Hyung-Gyu;Chang, Nae-Hyuck
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.44 no.2
    • /
    • pp.86-94
    • /
    • 2007
  • The NoC (network-on-chip) approach is a promising solution to the increasing complexity of on-chip communication problems because of its high scalability. But, NoC applications generally consume a lot of power, because they require a large design space to accommodate many parallel IPs and network communication channels. It is not easy to analyze the power consumption of NoC applications with conventional simulation methods using simple power models. In addition, there are also many limitations in using sophisticated simulation models because they require long execution time and large efforts. In this paper, we apply a cycle-accurate energy measurement technique and tool to the FPGA prototypes, which are generally used to verify the correctness of SoC designs, as a practical indication of the power consumption of real NoC applications. An NoC-based JPEG encoder implementation is used as a case study to demonstrate the effectiveness of our approach.

Implementation of Efficient Power Method on CUDA GPU (CUDA 기반 GPU에서 효율적인 Power Method의 구현)

  • Kim, Jung-Hwan;Kim, Jin-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.2
    • /
    • pp.9-16
    • /
    • 2011
  • GPU computing is emerging in high performance application area since it can easily exploit massive parallelism in a way of cost-effective computing. The power method which finds the eigen vector of a given matrix is widely used in various applications such as PageRank for calculating importance of web pages. In this research we made the power method efficiently parallelized on GPU and also suggested how it can be improved to enhance its performance. The power method mainly consists of matrix-vector product and it can be easily parallelized. However, it should decide the convergence of the eigen vector and need scaling of the vector subsequently. Such operations incur several calls to GPU kernels and data movement between host and GPU memories. We improved the performance of the power method by means of reduced calls to GPU kernels, optimized thread allocation and enhanced decision operation for the convergence.

Design of Low Complexity and High Throughput Encoder for Structured LDPC Codes (구조적 LDPC 부호의 저복잡도 및 고속 부호화기 설계)

  • Jung, Yong-Min;Jung, Yun-Ho;Kim, Jae-Seok
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.46 no.10
    • /
    • pp.61-69
    • /
    • 2009
  • This paper presents the design results of a low complexity and high throughput LDPC encoder structure. In order to solve the high complexity problem of the LDPC encoder, a simplified matrix-vector multiplier is proposed instead of the conventional complex matrix-vector multiplier. The proposed encoder also adopts a partially parallel structure and performs column-wise operations in matrix-vector multiplication to achieve high throughput. Implementation results show that the proposed architecture reduces the number of logic gates and memory elements by 37.4% and 56.7%, compared with existing five-stage pipelined architecture. The proposed encoder also supports 800Mbps throughput at 40MHz clock frequency which is improved about three times more than the existing architecture.

Numerical investigations on stability evaluation of a jointed rock slope during excavation using an optimized DDARF method

  • Li, Yong;Zhou, Hao;Dong, Zhenxing;Zhu, Weishen;Li, Shucai;Wang, Shugang
    • Geomechanics and Engineering
    • /
    • v.14 no.3
    • /
    • pp.271-281
    • /
    • 2018
  • A jointed rock slope stability evaluation was simulated by a discontinuous deformation analysis numerical method to investigate the process and safety factors for different crack distributions and different overloading situations. An optimized method using Discontinuous Deformation Analysis for Rock Failure (DDARF) is presented to perform numerical investigations on the jointed rock slope stability evaluation of the Dagangshan hydropower station. During the pre-processing of establishing the numerical model, an integrated software system including AutoCAD, Screen Capture, and Excel is adopted to facilitate the implementation of the numerical model with random joint network. These optimizations during the pre-processing stage of DDARF can remarkably improve the simulation efficiency, making it possible for complex model calculation. In the numerical investigations on the jointed rock slope stability evaluations using the optimized DDARF, three calculation schemes have been taken into account in the numerical model: (I) no joint; (II) two sets of regular parallel joints; and (III) multiple sets of random joints. This model is capable of replicating the entire processes including crack initiation, propagation, formation of shear zones, and local failures, and thus is able to provide constructive suggestions to supporting schemes for the slope. Meanwhile, the overloading numerical simulations under the same three schemes have also been performed. Overloading safety factors of the three schemes are 5.68, 2.42 and 1.39, respectively, which are obtained by analyzing the displacement evolutions of key monitoring points during overloading.

Development of Multiscale Modeling Methods Coupling Molecular Dynamics and Stochastic Rotation Dynamics (분자동역학과 확률회전동역학을 결합한 멀티스케일 모델링 기법 개발)

  • Cha, Kwangho;Jung, Youngkyun
    • KIISE Transactions on Computing Practices
    • /
    • v.20 no.10
    • /
    • pp.534-542
    • /
    • 2014
  • Multiscale modeling is a new simulation approach which can manage different spatial and temporal scales of system. In this study, as part of multiscale modeling research, we propose the way of combining two different simulation methods, molecular dynamics(MD) and stochastic rotation dynamics(SRD). Our conceptual implementations are based on LAMMPS, one of the well-known molecular dynamics programs. Our prototype of multiscale modeling follows the form of the third party implementation of LAMMPS. It added MD to SRD in order to simulate the boundary area of the simulation box. Because it is important to guarantee the seamless simulation, we also designed the overlap zones and communication zones. The preliminary experimental results showed that our proposed scheme is properly worked out and the execution time is also reduced.

Design and Implementation of a Mobile Runtime Library for Execution of Large-scale Application (대용량 소프트웨어 실행을 위한 모바일 런타임 라이브러리 설계 및 구현)

  • Lee, Ye-In;Lee, Jong-Woo
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.1
    • /
    • pp.1-9
    • /
    • 2010
  • Today's growth of the mobile communication infrastructure made mobile computing systems like cellular phones came next to or surpassed the desktop PCs in popularity due to their mobility. Although the performance of mobile devices is now being improved continuously, it is a current common sense that compute intensive large-scale applications can hardly run on any kind of mobile handset devices. To clear up this problem, we decided to exploit the mobile cluster computing system and surveyed the existing ones first. We found out, however, that most of them are not the actual implementations but a mobile cluster infrastructure proposal or idea suggestions for reliable mobile clustering. To make cell phones participated in cluster computing nodes, in this paper, we propose a redesigned JPVM cluster computing engine and a set of WIPI mobile runtime functions interfacing with it. And we also show the performance evaluation results of real parallel applications running on our Mobile-JPVM cluster computing systems. We find out by the performance evaluation that large-scale applications can sufficiently run on mobile devices such as cellular phones when using our mobile cluster computing engine.

A Study on the Acupuncture & Moxibustion of Huangdineijing - focused on Concept of Time - (『황제내경(黃帝內經)』의 침구법(鍼灸法) 연구 - 시간(時間) 개념을 중심으로 -)

  • Lyu, Jeong-ah
    • Journal of Korean Medical classics
    • /
    • v.29 no.3
    • /
    • pp.57-74
    • /
    • 2016
  • Subject : Treatment of acupuncture & moxibustion in Huangdineijing Objectives : Discover the principles and substances of acupuncture & moxibustion treatment of Huangdineijing Methods : The author reviewed the Huangdineijing: Suwen, Lingshu to gain a realistic sense of the facts and the medical book's contents that deal with the practices of acupuncture & moxibustion treatment for clinical praxis. Conclusions : 1. The acupuncture & moxibustion of Huangdineijing can be defined as follows. A treatment for correcting of meridian's drift by proper selection of surgical site, using suitable implementation and appropriate manipulation skill from a correct judgement of meridian's flowing through sensing the pulse for diagnosis when the meridian's flowing which come and go constantly driven by the tidal order of defensive Qi tactually appeared hard or soft caused by diseases, thus bring out the soft & gradual 50 laps circulation of nutrient Qi moderately driven by defensive Qi. 2. Today's acupuncture & moxibustion treatments greatly differ from those of Huangdineijing in that we do not know the substance of that treatment which is correcting of meridian's drift and do not obligatorily carry out feeling of the pulse for diagnosis which is a measure & judgement of meridian's flowing against a preliminary treatment in parallel with treatment so reduce use and power of that treatment. 3. I could form a list of general principle from the substances of acupuncture & moxibustion of Huangdineijing like as the flowing or fate of meridian Qi(tidally going in order of defensive Qi and nutrient Qi), deficiency or excess of Meridian flowing, comparing diagnosis before and after treatment, bring out the soft & gradual Qi. That was imposing "the concept of time" on recognition and treatment to human body.

Design and Implementation of Enhanced Resonant Converter for EV Fast Charger

  • Ahn, Suk-Ho;Gong, Ji-Woong;Jang, Sung-Roc;Ryoo, Hong-Je;Kim, Duk-Heon
    • Journal of Electrical Engineering and Technology
    • /
    • v.9 no.1
    • /
    • pp.143-153
    • /
    • 2014
  • This paper presents a novel application of LCC resonant converter for 60kW EV fast charger and describes development of the high efficiency 60kW EV fast charger. The proposed converter has the advantage of improving the system efficiency especially at the rated load condition because it can reduce the conduction loss by improving the resonance current shape as well as the switching loss by increasing lossless snubber capacitance. Additionally, the simple gate driver circuit suitable for proposed topology is designed. Distinctive features of the proposed converter were analyzed depending on the operation modes and detail design procedure of the 10kW EV fast charger converter module using proposed converter topology were described. The proposed converter and the gate driver were identified through PSpice simulation. The 60kW EV fast charger which generates output voltage ranges from 50V to 500V and maximum 150A of output currents using six parallel operated 10kW converter modules were designed and implemented. Using 60kW fast charger, the charging experiments for three types of high-capacity batteries were performed which have a different charging voltage and current. From the simulation and experimental results, it is verified that the proposed converter topology can be effectively used as main converter topology for EV fast charger.