• Title/Summary/Keyword: Parallel computation

Search Result 594, Processing Time 0.164 seconds

Parallel Prefix Computation and Sorting on a Recursive Dual-Net

  • Li, Yamin;Peng, Shietung;Chu, Wanming
    • Journal of Information Processing Systems
    • /
    • v.7 no.2
    • /
    • pp.271-286
    • /
    • 2011
  • In this paper, we propose efficient algorithms for parallel prefix computation and sorting on a recursive dual-net. The recursive dual-net $RDN^k$(B) for k > 0 has $(2n_o)^{2K}/2$ nodes and $d_0$ + k links per node, where $n_0$ and $d_0$ are the number of nod es and the node-degree of the base-network B, respectively. Assume that each node holds one data item, the communication and computation time complexities of the algorithm for parallel prefix computation on $RDN^k$(B), k > 0, are $2^{k+1}-2+2^kT_{comm}(0)$ and $2^{k+1}-2+2^kT_{comp}(0)$, respectively, where $T_{comm}(0)$ and $T_{comp}(0)$ are the communication and computation time complexities of the algorithm for parallel prefix computation on the base-network B, respectively. The algorithm for parallel sorting on $RDN^k$(B) is restricted on B = $Q_m$ where $Q_m$ is an m-cube. Assume that each node holds a single data item, the sorting algorithm runs in $O((m2^k)^2)$ computation steps and $O((km2^k)^2)$ communication steps, respectively.

DEVELOPMENT OF PARALLEL COMPUTATION METHOD FOR THE p VERSION IN THE FINITE ELEMENT METHOD

  • Kim, Chang-Geun;Cha, Ho-Jung
    • Journal of applied mathematics & informatics
    • /
    • v.6 no.2
    • /
    • pp.649-659
    • /
    • 1999
  • This paper presents a parallel implementation of stiff-ness matrix calculation based on the processor farm model on a net-work of workstations running PVM programming environment. As the computational characteristics of stiffnes matrix exhibits good po-tentials for effective prallel computation the performance improve-ment is show to be almost linear with the number of sorkstations involved in the computation.

SPARSE NULLSPACE COMPUTATION OF EQULILBRIUM MATRICES

  • Jang, Ho-Jong;Cha, Kyung-Joon
    • Communications of the Korean Mathematical Society
    • /
    • v.11 no.4
    • /
    • pp.1175-1185
    • /
    • 1996
  • We study the computation of sparse null bases of equilibrium matrices in the context of structural optimization and incompressible fluid flow. In our approach we emphasize the parallel computatin and examine the applications. New block decomposition and node ordering schemes are suggested, and numerical examples are considered.

  • PDF

Adaptive Application Component Mapping for Parallel Computation Offloading in Variable Environments

  • Fan, Wenhao;Liu, Yuan'an;Tang, Bihua
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.11
    • /
    • pp.4347-4366
    • /
    • 2015
  • Distinguished with traditional strategies which offload an application's computation to a single server, parallel computation offloading can promote the performance by simultaneously delivering the computation to multiple computing resources around the mobile terminal. However, due to the variability of communication and computation environments, static application component multi-partitioning algorithms are difficult to maintain the optimality of their solutions in time-varying scenarios, whereas, over-frequent algorithm executions triggered by changes of environments may bring excessive algorithm costs. To this end, an adaptive application component mapping algorithm for parallel computation offloading in variable environments is proposed in this paper, which aims at minimizing computation costs and inter-resource communication costs. It can provide the terminal a suitable solution for the current environment with a low incremental algorithm cost. We represent the application component multi-partitioning problem as a graph mapping model, then convert it into a pathfinding problem. A genetic algorithm enhanced by an elite-based immigrants mechanism is designed to obtain the solution adaptively, which can dynamically adjust the precision of the solution and boost the searching speed as transmission and processing speeds change. Simulation results demonstrate that our algorithm can promote the performance efficiently, and it is superior to the traditional approaches under variable environments to a large extent.

Parallel Computation Algorithm of Gauss Elimination in Power system Analysis (전력계통해석을 위한 자코비안행렬 가우스소거의병렬계산 알고리즘)

  • 서의석;오태규
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.43 no.2
    • /
    • pp.189-196
    • /
    • 1994
  • This paper describes a parallel computing algorithm in Gauss elimination of Jacobian matrix to large-scale power system. The structure of Jacobian matrix becomes different according to ordering method of buses. In sequential computation buses are ordered to minimize the number of fill-in in the triangulation of the Jacobian matrix. The proposed method develops the parallelism in the Gauss elimination by using ND(nested dissection) ordering. In this procedure the level structure of the power system network is transformed to be long and narrow by using end buses which results in balance of computing load among processes and maximization of parallel computation. Each processor uses the sequential computation method to preserve the sqarsity of matrix.

  • PDF

Study of Efficient Parallel Computation of Cholesky's Method in FE Mesh (유한요소망에서의 효율적인 직접해법 병렬계산에 관한 연구)

  • Lee, H.B.;Choi, K.;Kim, H.J.;Jung, H.K.;Hahn, S.Y.
    • Proceedings of the KIEE Conference
    • /
    • 1996.07a
    • /
    • pp.68-70
    • /
    • 1996
  • In this paper, an efficient parallel computation method for solving large sparse systems of linear algebraic equations by using Cholesky's method in the finite element method is studied. The methods of minimizing the number of fill-ins in the factorization process of factorization are investigated for minimizing the amount of memory and computation time. The parallel programming is implemented under the PVM(Parallel Virtual Machine) environment. The method of load-distribution is studied for minimizing the computation time and the communication time.

  • PDF

Analysis of Three-dimensional Nonaxisymmetric Spin-up by Using Parallel Computation (병렬계산에 의한 비축대칭 3차원 스핀업 유동해석)

  • Park, Jae-Hyoun;Choi, Yoon-Hwan;Suh, Yong-Kweon
    • Proceedings of the KSME Conference
    • /
    • 2001.06e
    • /
    • pp.512-517
    • /
    • 2001
  • In this study, spin-up flows in a rectangular container are analysed by using three-dimensional computation. In the numerical computation, we use the parallel computer system of PC-cluster type. We compared our results with those obtained by two-dimensional computation. Effect of velocity and vorticity on the flow is studied. The result shows that two-dimensional solution is in good agreement with the 3-D result. Attention is given to the region where the 3-D flow is significant.

  • PDF

Optimal Control of Large-Scale Dynamic Systems using Parallel Processing (병렬처리를 이용한 대규모 동적 시스템의 최적제어)

  • Park, Ki-Hong
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.5 no.4
    • /
    • pp.403-410
    • /
    • 1999
  • In this study, a parallel algorithm has been developed that can quickly solve the optiaml control problem of large-scale dynamic systems. The algorithm adopts the sequential quadratic programming methods and achieves domain decomposition-type parallelism in computing sensitivities for search direction computation. A silicon wafer thermal process problem has been solved using the algorithm, and a parallel efficiency of 45% has been achieved with 16 processors. Practical methods have also been investigated in this study as a way to further speed up the computation time.

  • PDF

Application for parallel computation for finite element analysis of welding processes (용접공정 유한요소 해석의 병렬 처리 적용)

  • 임세영;김주완;최강혁
    • Proceedings of the KWS Conference
    • /
    • 2004.05a
    • /
    • pp.273-275
    • /
    • 2004
  • A parallel multi-frontal solver is developed for finite element analysis of an arc-welding process, which entails phase evolution, heat transfer, and deformations of structure. We verify the code via comparison to a commercial code,SYSWELD. Attention is focused on the implementation of the parallel solver using MPI library, on the speedup by parallel computation, and on the effectiveness of the solver in welding application

  • PDF

Parallel computation for debonding process of externally FRP plated concrete

  • Xu, Tao;Zhang, Yongbin;Liang, Z.Z.;Tang, Chun-An;Zhao, Jian
    • Structural Engineering and Mechanics
    • /
    • v.38 no.6
    • /
    • pp.803-823
    • /
    • 2011
  • In this paper, the three dimensional Parallel Realistic Failure Process Analysis ($RFPA^{3D}$-Parallel) code based on micromechanical model is employed to investigate the bonding behavior in FRP sheet bonded to concrete in single shear test. In the model, the heterogeneity of brittle disordered material at a meso-scale was taken into consideration in order to realistically demonstrate the mechanical characteristics of FRP-to-concrete. Modified Mohr-coulomb strength criterion with tension cut-off, where a stressed element can damage in shear or in tension, was adopted and a stiffness degradation approach was used to simulate the initiation, propagation and growth of microcracks in the model. In addition, a Master-Slave parallel operation control technique was adopted to implement the parallel computation of a large numerical model. Parallel computational results of debonding of FRP-concrete visually reproduce the spatial and temporal debonding failure progression of microcracks in FRP sheet bonded to concrete, which agrees well with the existing testing results in laboratory. The numerical approach in this study provides a useful tool for enhancing our understanding of cracking and debonding failure process and mechanism of FRP-concrete and our ability to predict mechanical performance and reliability of these FRP sheet bonded to concrete structures.