• Title/Summary/Keyword: Parallel Processing method

Search Result 734, Processing Time 0.031 seconds

Parallel Processing of Multi-Core Processor and GPUs in Projection Step for Efficient Fluid Simulation (효율적인 유체 시뮬레이션을 위한 투영 단계에서의 멀티 코어 프로세서와 그래픽 프로세서의 병렬처리)

  • Kim, Sun-Tae;Jung, Hwi-Ryong;Hong, Jeong-Mo
    • The Journal of the Korea Contents Association
    • /
    • v.13 no.6
    • /
    • pp.48-54
    • /
    • 2013
  • In these days, the state-of-art technologies employ the heterogeneous parallelization of CPU and GPU for fluid simulations in the field of computer graphics. In this paper, we present a novel CPU-GPU parallel algorithm that solves projection step of fluid simulation more efficiently than existing sequential CPU-GPU processing. Fluid simulation that requires high computational resources can be carried out efficiently by the proposed method.

Parallel Processing based Image Identifier Generation (병렬처리 기반 정지영상 인식자 생성)

  • Ko, Mieun;Park, Je-Ho;Park, Young B.;Seo, Wontaek
    • Journal of the Semiconductor & Display Technology
    • /
    • v.16 no.1
    • /
    • pp.6-10
    • /
    • 2017
  • Recent enhancement in the still image acquisition devices has been widely perpetrated into the daily life of the common people. Due to this trend, the voluminous still images, that are produced and shared in the personal or the massive storage, need to controlled with effective and efficient management. The human-devised or system-generated still image identifiers used for the identification of the images are at risk in the situation of unexpected changing or eliminating of the identifiers. In this paper, we propose a parallel processing based method for still image identifier generation by utilizing the still image internal features.

  • PDF

Full Search Equivalent Motion Estimation Algorithm for General-Purpose Multi-Core Architectures

  • Park, Chun-Su
    • Journal of the Semiconductor & Display Technology
    • /
    • v.12 no.3
    • /
    • pp.13-18
    • /
    • 2013
  • Motion estimation is a key technique of modern video processing that significantly improves the coding efficiency significantly by exploiting the temporal redundancy between successive frames. Thread-level parallelism is a promising method to accelerate the motion estimation process for multithreading general-purpose processors. In this paper, we propose a parallel motion estimation algorithm which parallelizes the motion search process of the current H.264/AVC encoder. The proposed algorithm is implemented using the OpenMP application programming interface (API) and can be easily integrated into the current encoder. The experimental results show that the proposed parallel algorithm can reduce the processing time of the motion estimation up to 65.08% without any penalty in the rate-distortion (RD) performance.

A Study on the Pixel-Parallel Usage Processing Using the Format Converter (포맷 변환기를 이용한 화소-병렬 화상처리에 관한 연구)

  • Kim, Hyeon-Gi;Lee, Cheon-Hui
    • The KIPS Transactions:PartA
    • /
    • v.9A no.2
    • /
    • pp.259-266
    • /
    • 2002
  • In this paper we implemented various image processing filtering using the format converter. This design method is based on realized the large processor-per-pixel array by integrated circuit technology. These two types of integrated structure are can be classify associative parallel processor and parallel process DRAM (or SRAM) cell. Layout pitch of one-bit-wide logic is Identical memory cell pitch to array high density PEs in integrate structure. This format converter design has control path implementation efficiently, and can be utilize the high technology without complicated controller hardware. Sequence of array instruction are generated by host computer before process start, and instructions are saved on unit controller. Host computer is executed the pixel-parallel operation starting at saved instructions after processing start. As a result, we obtained three result that 1) simple smoothing suppresses higher spatial frequencies, reducing noise but also blurring edges, 2) a smoothing and segmentation process reduces noise while preserving sharp edges, and 3) median filtering may be applied to reduce image noise. Median filtering eliminates spikes while maintaining sharp edges and preserving monotonic variations in pixel values.

Decomposition Based Parallel Processing Technique for Efficient Collaborative Optimization (효율적 분산협동설계를 위한 분해 기반 병렬화 기법의 개발)

  • Park, Hyung-Wook;Kim, Sung-Chan;Kim, Min-Soo;Choi, Dong-Hoon
    • Proceedings of the KSME Conference
    • /
    • 2000.11a
    • /
    • pp.818-823
    • /
    • 2000
  • In practical design studies, most of designers solve multidisciplinary problems with complex design structure. These multidisciplinary problems have hundreds of analysis and thousands of variables. The sequence of process to solve these problems affects the speed of total design cycle. Thus it is very important for designer to reorder original design processes to minimize total cost and time. This is accomplished by decomposing large multidisciplinary problem into several multidisciplinary analysis subsystem (MDASS) and processing it in parallel. This paper proposes new strategy for parallel decomposition of multidisciplinary problem to raise design efficiency by using genetic algorithm and shows the relationship between decomposition and multidisciplinary design optimization (MDO) methodology.

  • PDF

A proposed parallel processing structure for robot motion control (로봇 운동 제어의 실시간 연산을 위한 병렬처리구조)

  • 고경철;조형석
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1988.10a
    • /
    • pp.1-5
    • /
    • 1988
  • The realization of high quality robot control needs the improvement of computing speed of controller. In this paper, parallel processing method is considered for this purpose. A S/W algorithm for task scheduling is developed first, and then, an appropriate H/W structure is proposed. This scheme is applied to calculate inverse kinematics of PUMA robot. The simulation results show that the computing time when using three 8086/87's is reduced to 4.23 msec compared to 10 msec in case using one 8086/87.

  • PDF

GPU-Based Parallel Collision Detection for Deformable Objects (변형 물체를 위한 GPU 기반 병렬 충돌 감지)

  • Sung, Nak-Jun;Kim, Min Sang;Hong, Min;Choi, Yoo-Joo
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.7 no.1
    • /
    • pp.25-32
    • /
    • 2018
  • Due to heavy computational cost, deformable object simulation requires more effective collision detection method than rigid body simulation. However, when the CPU-based collision detection algorithm is purely applied to the GPU environment, the collision detection algorithm and the data structure optimized for the GPU environment are essential because the performance of the GPU can not be used properly. Therefore, we propose a GPU-based parallel collision detection algorithm for mass-spring system which is widely used for deformable object representation in this paper. The proposed method uses a parallel algorithm and data structure to reduce collision detection cost through GPU-based curling algorithm using AABB-Octree structure. In this paper, we prove the effectiveness of the proposed method by comparing the intersection test of all triangle pairs in parallel. The results of experimental tests show that the proposed method improves the performance by about 24% on average. Therefore, it is expected that the proposed method can improve the performance of real-time simulation for deformable objects.

Virtual Flutter Test of a Spanwise Curved Wing Using CFD/CSD Integrated Coupling Method (CFD/CSD 통합 연계기법을 이용한 횡방향 곡률이 있는 날개의 가상 플러터 시험)

  • Oh, Se-Won;Lee, Jung-Jin;Kim, Dong-Hyun
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.16 no.4 s.109
    • /
    • pp.355-365
    • /
    • 2006
  • The coupled time-integration method with a staggered algorithm based on computational structural dynamics (CSD), finite element method (FEM) and computational fluid dynamics (CFD) has been developed in order to demonstrate physical vibration phenomena due to dynamic aeroelastic excitations. Virtual flutter tests for the spanwise curved ing model have been effectively conducted using the present advanced computational method with high speed parallel processing technique. In addition, the present system can simultaneously give a recorded data file to generate virtual animation for the flutter safety test. The results for virtual flutter test are compared with the experimental data of wind tunnel test. It is shown from the results that the effect of spanwise curvature have a tendency to decrease the flutter dynamic pressure for the same flight condition.

Study on Parallel Processing for Efficient Flexible Multibody Analysis based on Subsystem Synthesis Method (병렬 처리를 이용한 부분 시스템 기반 유연다물체 동역학의 효율적인 해석 연구)

  • Han, Jong-Boo;Song, Hajun;Kim, Sung-Soo
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.41 no.6
    • /
    • pp.507-515
    • /
    • 2017
  • Flexible multibody simulations are widely used in the industry to design mechanical systems. In flexible multibody dynamics, deformation coordinates are described either relatively in the body reference frame that is floating in the space or in the inertial reference frame. Moreover, these deformation coordinates are generated based on the discretization of the body according to the finite element approach. Therefore, the formulation of the flexible multibody system always deals with a huge number of degrees of freedom and the numerical solution methods require a substantial amount of computational time. Parallel computational methods are a solution for efficient computation. However, most of the parallel computational methods are focused on the efficient solution of large-sized linear equations. For multibody analysis, we need to develop an efficient formulation that could be suitable for parallel computation. In this paper, we developed a subsystem synthesis method for a flexible multibody system and proposed efficient parallel computational schemes based on the OpenMP API in order to achieve efficient computation. Simulations of a rotating blade system, which consists of three identical blades, were carried out with two different parallel computational schemes. Actual CPU times were measured to investigate the efficiency of the proposed parallel schemes.

Vehicle Headlight Alignment Calibration and Classification Using OpenMP (OpenMP를 이용한 차량 헤드라이트 얼라인먼트 보정 및 분류 방법)

  • Moon, Chang-Bae;Kim, Kun-Hong;Kim, Byeong-Man;Oh, Dukhwan
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.22 no.2
    • /
    • pp.61-70
    • /
    • 2017
  • In This Paper, the Classification Speed of Vehicle Headlight Modules is Improved by a CPU-based Parallel Processing Using OpenMP. Also, a Classification Method of Headlight Modules which Extracts their Features after Revising their Alignment is Proposed. To Analyze the Performance of the Proposed Method, the Discrimination Accuracy and the Processing Speed were Compared with the Method Using Gray Image and the Method Using Line Detection. As the Results of the Analysis, in the Discrimination Accuracy, the Proposed Method and the Line Detection Method Showed good Performance, but the Proposed Method Showed Better Performance than the Line Detection Method by the Processing Speed. Also, the Gray-based Method was the Best in Processing Speed, but the Proposed Method is Better than the Gray-based Method in the Discrimination Accuracy.