• Title/Summary/Keyword: and Parallel Processing

Search Result 2,013, Processing Time 0.032 seconds

A New Synchronization Scheme for Parallel Processing of Loop with Constant and Variable Dependence Distance (불변 및 가변 종속거리를 갖는 루프의 병렬처리를 위한 새로운 동기화 기법)

  • 이광형;황종선;박두순
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.32B no.5
    • /
    • pp.693-701
    • /
    • 1995
  • In most application programs, loops usually comprise most of the computation in a program and are the most important source of parallelism. When loops are executed on multiprocessors, the cross iteration data dependences need to be enforced by synchronization between processors. Existing synchronization schemes have been studied mainly on the loop with constant dependence distance. When these schemes are applied to the loop with variable dependence distance, there exists lots of overhead by the use of unnecessary synchronization variables and execution of unuseful synchronization instructions. Even though there exist various variable synchronization schemes, they have a lot of run-time overhead to compute synchronization information. In this paper, we present a new synchronization scheme, Synch-Free/Synch-Hold for managing synchronization efficiently on the loop with constant and variable dependence distance.

  • PDF

A Study on the Auto-diagnosis Plethysmograph by Novel Algorithm for Radial Pulse Detection (새로운 맥파 인식 알고리즘을 적용한 자동 맥파 시스템에 관한 연구)

  • 박승환;홍승홍
    • Journal of Biomedical Engineering Research
    • /
    • v.17 no.2
    • /
    • pp.241-246
    • /
    • 1996
  • This paper describes a new system that detects radial pulse wave and allows the diagnosis of malfunctions of cardiovascular system by analyzing the waveforms with the newly proposed algorithm. The system consists of a sensor part and a data processing part within which a new detection algorithm is incorporated In acquiring radial pulse signal noninvasively, the sensor used in this system is a new combinational fiber-optic sensor which has a detecting Part and a transmitting Part. Also, In order to analyze the characteristics of pulsation quantitatively, the algorithm proposed in this paper is a method that runs in parallel with both the data of ECG and differential pulse simultaneously. these concepts are based upon the idea that thfee Q points of ECG give obious discrimination of one entire period of pulse in any abnormal cases, and newly defined feature lines at the differential counterpart can be used to recogrlize sDme significant points in one period of pulses.

  • PDF

A Genetic Algorithm for Dynamic Job Shop Scheduling (동적 Job Shop 일정계획을 위한 유전 알고리즘)

  • 박병주;최형림;김현수;이상완
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.27 no.2
    • /
    • pp.97-109
    • /
    • 2002
  • Manufacturing environments in the real world are subject to many sources of change and uncertainty, such as new job releases, job cancellations, a chance in the processing time or start time of some operation. Thus, the realistic scheduling method should Properly reflect these dynamic environment. Based on the release times of jobs, JSSP (Job Shoe Scheduling Problem) can be classified as static and dynamic scheduling problem. In this research, we mainly consider the dynamic JSSP with continually arriving jobs. The goal of this research is to develop an efficient scheduling method based on GA (Genetic Algorithm) to address dynamic JSSP. we designed scheduling method based on SGA (Sing1e Genetic Algorithm) and PGA (Parallel Genetic Algorithm) The scheduling method based on GA is extended to address dynamic JSSP. Then, This algorithms are tested for scheduling and rescheduling in dynamic JSSP. The results is compared with dispatching rule. In comparison to dispatching rule, the GA approach produces better scheduling performance.

Inspection of Cracks on the Express Train Wheel Using a High Speed Scan Type Magnetic Camera (초고속 스캔형 자기카메라에 의한 고속열차 차륜 탐상)

  • Lee, Jin-Yi;Hwang, Ji-Seong;Kwon, Seok-Jin;Seo, Jung-Won
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.32 no.11
    • /
    • pp.943-950
    • /
    • 2008
  • A novel nondestructive testing (NDT) system, which is able to detect a crack with high speed and high spatial resolution, is urgently required for inspecting small cracks on express train wheels. This paper proposes a high speed scan type magnetic camera, which uses the multiple amplifying circuits and the crack indicating pulse output system. The linearly integrated Hall sensors are arrayed in parallel, and the Hall voltages from each sensor in the scanning direction are obtained and amplified. High-speed NDT can be achieved by using the exclusive analog-digital converter and micro-processor because the ${\partial}\;V_H/\;{\partial}$ x value, which provides the most important crack information, can be obtained by buffering and calculating. The effectiveness of the novel method was verified by examine using cracks on the wheel specimen model.

A Low-Complexity 128-Point Mixed-Radix FFT Processor for MB-OFDM UWB Systems

  • Cho, Sang-In;Kang, Kyu-Min
    • ETRI Journal
    • /
    • v.32 no.1
    • /
    • pp.1-10
    • /
    • 2010
  • In this paper, we present a fast Fourier transform (FFT) processor with four parallel data paths for multiband orthogonal frequency-division multiplexing ultra-wideband systems. The proposed 128-point FFT processor employs both a modified radix-$2^4$ algorithm and a radix-$2^3$ algorithm to significantly reduce the numbers of complex constant multipliers and complex booth multipliers. It also employs substructure-sharing multiplication units instead of constant multipliers to efficiently conduct multiplication operations with only addition and shift operations. The proposed FFT processor is implemented and tested using 0.18 ${\mu}m$ CMOS technology with a supply voltage of 1.8 V. The hardware- efficient 128-point FFT processor with four data streams can support a data processing rate of up to 1 Gsample/s while consuming 112 mW. The implementation results show that the proposed 128-point mixed-radix FFT architecture significantly reduces the hardware cost and power consumption in comparison to existing 128-point FFT architectures.

Auto-Tuning of Reference Model Based PID Controller Using Immune Algorithm

  • Kim, Dong-Hwa;Park, Jin-Ill
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.12 no.3
    • /
    • pp.246-254
    • /
    • 2002
  • In this paper auto-tuning scheme of PID controller based on the reference model has been studied for a Process control system by immune algorithm. Up to this time, many sophisticated tuning algorithms have been tried in order to improve the PID controller performance under such difficult conditions. Also, a number of approaches have been proposed to implement mixed control structures that combine a PID controller with fuzzy logic. However, in the actual plant, they are manually tuned through a trial and error procedure, and the derivative action is switched off. Therefore, it is difficult to tune. Since the immune system possesses a self organizing and distributed memory, it is thus adaptive to its external environment and allows a PDP (Parallel Distributed Processing) network to complete patterns against the environmental situation. Simulation results reveal that reference model basd tuning by immune network suggested in this paper is an effective approach to search for optimal or near optimal process control.

Scalable Prediction Models for Airbnb Listing in Spark Big Data Cluster using GPU-accelerated RAPIDS

  • Muralidharan, Samyuktha;Yadav, Savita;Huh, Jungwoo;Lee, Sanghoon;Woo, Jongwook
    • Journal of information and communication convergence engineering
    • /
    • v.20 no.2
    • /
    • pp.96-102
    • /
    • 2022
  • We aim to build predictive models for Airbnb's prices using a GPU-accelerated RAPIDS in a big data cluster. The Airbnb Listings datasets are used for the predictive analysis. Several machine-learning algorithms have been adopted to build models that predict the price of Airbnb listings. We compare the results of traditional and big data approaches to machine learning for price prediction and discuss the performance of the models. We built big data models using Databricks Spark Cluster, a distributed parallel computing system. Furthermore, we implemented models using multiple GPUs using RAPIDS in the spark cluster. The model was developed using the XGBoost algorithm, whereas other models were developed using traditional central processing unit (CPU)-based algorithms. This study compared all models in terms of accuracy metrics and computing time. We observed that the XGBoost model with RAPIDS using GPUs had the highest accuracy and computing time.

COMPUTATION OF SOMBOR INDICES OF OTIS(BISWAPPED) NETWORKS

  • Basavanagoud, B.;Veerapur, Goutam
    • Journal of the Chungcheong Mathematical Society
    • /
    • v.35 no.3
    • /
    • pp.205-225
    • /
    • 2022
  • In this paper, we derive analytical closed results for the first (a, b)-KA index, the Sombor index, the modified Sombor index, the first reduced (a, b)-KA index, the reduced Sombor index, the reduced modified Sombor index, the second reduced (a, b)-KA index and the mean Sombor index mSOα for the OTIS biswapped networks by considering basis graphs as path, wheel graph, complete bipartite graph and r-regular graphs. Network theory plays a significant role in electronic and electrical engineering, such as signal processing, networking, communication theory, and so on. A topological index (TI) is a real number associated with graph networks that correlates chemical networks with a variety of physical and chemical properties as well as chemical reactivity. The Optical Transpose Interconnection System (OTIS) network has recently received increased interest due to its potential uses in parallel and distributed systems.

Two-Input Max/Min Circuit for Fuzzy Inference System

  • P. Laipasu;A. Chaikla;A. Jaruwanawat;P. Pannil;Lee, T.;V. Riewruja
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.105.3-105
    • /
    • 2001
  • In this paper, a current mode two-input maximum (Max) and minimum (Min) operations scheme, which is a useful building block for analog fuzzy inference systems, is presented. The Max and Min operations are incorporated in the same scheme with parallel processing. The proposed scheme comprises a MOS class AB/B configuration and current mirrors. Its simple structure can provide a high efficiency. The performance of the scheme exhibits a very sharp transfer characteristic and high accuracy. The proposed scheme achieves a high-speed operation and is suitable for real-time systems. The simulation results verifying the performances of the scheme are agreed with the expected values.

  • PDF

Binary Decision/Micro Programmable Controller and Its Application to Traffic Control System (BD/Micro 프로그래머블 제어기와 이의 교통제어 시스템 응용)

  • Woo, Kwang Bang;Kim, Yeong Il;An, Min Ock
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.23 no.5
    • /
    • pp.637-646
    • /
    • 1986
  • In this paper, we investigate and characterize the parallel processing of a hybrid system interfacing binary decision(BD) programmable controller with a microcomputer. The generation of analog signals and estimation of data, which can not properly be performed by the BD programmable controller alone, have been done by the the expanded controller utilizing BD/Micdro P.C. In addition, implementation of its hardware and software is considered.

  • PDF