• Title/Summary/Keyword: memory interface

Search Result 509, Processing Time 0.027 seconds

A NOVEL PARALLEL METHOD FOR SPECKLE MASKING RECONSTRUCTION USING THE OPENMP

  • LI, XUEBAO;ZHENG, YANFANG
    • Journal of The Korean Astronomical Society
    • /
    • v.49 no.4
    • /
    • pp.157-162
    • /
    • 2016
  • High resolution reconstruction technology is developed to help enhance the spatial resolution of observational images for ground-based solar telescopes, such as speckle masking. Near real-time reconstruction performance is achieved on a high performance cluster using the Message Passing Interface (MPI). However, much time is spent in reconstructing solar subimages in such a speckle reconstruction. We design and implement a novel parallel method for speckle masking reconstruction of solar subimage on a shared memory machine using the OpenMP. Real tests are performed to verify the correctness of our codes. We present the details of several parallel reconstruction steps. The parallel implementation between various modules shows a great speed increase as compared to single thread serial implementation, and a speedup of about 2.5 is achieved in one subimage reconstruction. The timing result for reconstructing one subimage with 256×256 pixels shows a clear advantage with greater number of threads. This novel parallel method can be valuable in real-time reconstruction of solar images, especially after porting to a high performance cluster.

DEVELOPMENT OF CFD PROGRAM FOR THE CONJUGATE HEAT TRANSFER ANALYSIS OF PMSM ELECTRIC MOTOR (PMSM 전동기 모터의 복합 열전달 해석을 위한 CFD 프로그램 개발)

  • Lee, Jung-Hee;Choi, Jong-Rak;Hur, Nahm-Keon;Kim, Joo-Han;Kim, Young-Kyoun
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2011.05a
    • /
    • pp.488-493
    • /
    • 2011
  • The object of this study is to develope the program for analyzing the fluid flow and heat transfer of PMSM electric motor. The program will be mainly used for inexperienced users of CFD analysis. So it has to be performed using the geometry data and the heat source of each part only. Interface program for converting the given data to the instruction of pre-processor is developed. The conjugate heat transfer between a flow passage of the motor and inner parts consisting of rotor and stator is regarded. In order to reduce the computational time and memory storage, cyclic boundary condition is applied. For the numerical simulation, MRF(Multi-Reference Frame) method is used to consider rotating operation of the rotor and heat source is applied to the copper, wire, and magnetic parts in the motor. On the screen of computer, the users can show the velocity distributions and the contours such as pressure, turbulent kinetic energy, turbulent dissipation rate and temperature.

  • PDF

Large-eddy simulation of channel flow using a spectral domain-decomposition grid-embedding technique (스펙트럴 영역분할 격자 삽입법을 이용한 채널유동의 큰 에디 모사)

  • Gang, Sang-Mo;Byeon, Do-Yeong;Baek, Seung-Uk
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.22 no.7
    • /
    • pp.1030-1040
    • /
    • 1998
  • One of the main unresolved issues in large-eddy simulation(LES) of wall-bounded turbulent flows is the requirement of high spatial resolution in the near-wall region, especially in the spanwise direction. Such high resolution required in the near-wall region is generally used throughout the computational domain, making simulations of high Reynolds number, complex-geometry flows prohibitive. A grid-embedding strategy using a nonconforming spectral domain-decomposition method is proposed to address this limitation. This method provides an efficient way of clustering grid points in the near-wall region with spectral accuracy. LES of transitional and turbulent channel flow has been performed to evaluate the proposed grid-embedding technique. The computational domain is divided into three subdomains to resolve the near-wall regions in the spanwise direction. Spectral patching collocation methods are used for the grid-embedding and appropriate conditions are suggested for the interface matching. Results of LES using the grid-embedding strategy are promising compared to LES of global spectral method and direct numerical simulation. Overall, the results show that the spectral domain-decomposition grid-embedding technique provides an efficient method for resolving the near-wall region in LES of complex flows of engineering interest, allowing significant savings in the computational CPU and memory.

Development of a Read-time Voice Dialing System Using Discrete Hidden Markov Models (이산 HM을 이용한 실시간 음성인식 다이얼링 시스템 개발)

  • Lee, Se-Woong;Choi, Seung-Ho;Lee, Mi-Suk;Kim, Hong-Kook;Oh, Kwang-Cheol;Kim, Ki-Chul;Lee, Hwang-Soo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.1E
    • /
    • pp.89-95
    • /
    • 1994
  • This paper describes development of a real-time voice dialing system which can recognize around one hundred word vocabularies in speaker independent mode. The voice recognition algorithm in this system is implemented on a DSP board with a telephone interface plugged in an IBM PC AT/486. In the DSP board, procedures for feature extraction, vector quantization(VQ), and end-point detection are performed simultaneously in every 10 msec frame interval to satisfy real-time constraints after detecting the word starting point. In addition, we optimize the VQ codebook size and the end-point detection procedure to reduce recognition time and memory requirement. The demonstration system has been displayed in MOBILAB of the Korean Mobile Telecom at the Taejon EXPO'93.

  • PDF

Behavior of Diffusion Layer Formation for TiNi/6061Al Smart Composites by Vacuum hot Press (진공 Hot Press법에 의한 TiNi/6061Al 지적 복합재료의 확산층 형성거동)

  • Park, Kwang-Hoon;Park, Sung-Ki;Shin, Soon-Gi;Lee, Jun-Hee
    • Korean Journal of Materials Research
    • /
    • v.12 no.12
    • /
    • pp.955-961
    • /
    • 2002
  • 2.7vol%TiNi/6061 Al composites with TiNi shape memory alloy as reinforcement were fabricated by vacuum hot press. It was investigated by OM, SEM, EPMA and XRD analysis for the behavior of diffusion layer formation on various heat treatment condition. Thickness of diffusion layer was increased proportionally according to heat treatment time. The layer was formed by the mutual diffusion of TiNi and Al. The diffusion rate from TiNi fiber to Al matrix was faster than that of reverse diffusion path. The more diffused layer was formed in Al matrix. The diffusion at interface layer was consisted of $A1_3$Ti, $Al_3$Ni analyzed by EPMA, XRD results.

Speech Interactive Agent on Car Navigation System Using Embedded ASR/DSR/TTS

  • Lee, Heung-Kyu;Kwon, Oh-Il;Ko, Han-Seok
    • Speech Sciences
    • /
    • v.11 no.2
    • /
    • pp.181-192
    • /
    • 2004
  • This paper presents an efficient speech interactive agent rendering smooth car navigation and Telematics services, by employing embedded automatic speech recognition (ASR), distributed speech recognition (DSR) and text-to-speech (ITS) modules, all while enabling safe driving. A speech interactive agent is essentially a conversational tool providing command and control functions to drivers such' as enabling navigation task, audio/video manipulation, and E-commerce services through natural voice/response interactions between user and interface. While the benefits of automatic speech recognition and speech synthesizer have become well known, involved hardware resources are often limited and internal communication protocols are complex to achieve real time responses. As a result, performance degradation always exists in the embedded H/W system. To implement the speech interactive agent to accommodate the demands of user commands in real time, we propose to optimize the hardware dependent architectural codes for speed-up. In particular, we propose to provide a composite solution through memory reconfiguration and efficient arithmetic operation conversion, as well as invoking an effective out-of-vocabulary rejection algorithm, all made suitable for system operation under limited resources.

  • PDF

Olfactory Interaction based on ISO/IEC 23005 Standard

  • Choi, Jang-Sik;Chang, Sung-June;Lee, Hae-Ryong;Byun, Hyung-Gi
    • Journal of Sensor Science and Technology
    • /
    • v.26 no.5
    • /
    • pp.297-300
    • /
    • 2017
  • Realistic media comprised of metadata of the five senses to provide enhanced experiences by stimulating our memory and sensations have had an increasingly pervading effect in our daily lives. Many researchers and companies are in the process of developing their own authoring systems running on different platforms to serve realistic media, resulting in compatibility issues among the systems. To tackle these issues, the International Organization for Standardization have standardized the interface, data format, protocol, API, etc. required to provide the realistic media. In particular, the ISO/IEC 23005 standard, which is called MPEG-V in SC29/WG 11, has defined XML schemas for olfaction interaction based on electronic nose (E-Nose), and scent display. In this paper, the MPEG-V standard for olfaction interaction is reviewed, and a data flow diagram that can be used for olfactory interaction based on the MPEG-V standard was designed. In addition, the necessary schemas related to the E-Nose sensor for olfactory interaction was provided.

Design and Evaluation of Data Input/output for Video Conference System (화상회의 시스템에서의 데이터 입출력 설계 및 평가)

  • 김현기
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.8 no.2
    • /
    • pp.38-44
    • /
    • 2003
  • In this paper, we propose the method in which multimedia data simultaneously transfers to the main memory and the multimedia processor from the network interface card to improve bottleneck of system bus through analysis for architecture of video conference system and input/output model. The proposed method can reduce the number of system bus accesses, bus cycles, data transmission time and compression ratio of video data in the video conference system. We compared the performance between the proposed method and the conventional methods in the multi-party video conference systems. The simulation results showed that the proposed method was reduced the transmission time of multimedia data than the conventional method.

  • PDF

AndroScope: An Insightful Performance Analyzer for All Software Layers of the Android-Based Systems

  • Cho, Myeongjin;Lee, Ho Jin;Kim, Minseong;Kim, Seon Wook
    • ETRI Journal
    • /
    • v.35 no.2
    • /
    • pp.259-269
    • /
    • 2013
  • Android has become the most popular platform for mobile devices. However, Android still has critical performance issues, such as "application not responding" errors and hiccups resulting from garbage collection. Many phone vendors have tried to resolve the problems by characterizing and improving the performance. However, there are few insightful performance analysis tools for the Android-based systems. This paper presents AndroScope, which is a performance analysis tool for both the Android platform (Dalvik virtual machine, core libraries, Android libraries, and even Linux kernels) and its applications. To the best of our knowledge, this is the first tool to collect and analyze performance data from all the software layers of the Android-based systems. AndroScope offers a trace mechanism to collect such deep and wide performance data as hardware performance counters, time, and memory usage. In addition, the tool includes TraceBridge, which is a middleware for the fast handling of mass logs. Moreover, AndroScope offers an integrated graphical user interface with the Android software development kit to display a great volume of the detailed performance data.

Low Power Trace Cache for Embedded Processor

  • Moon Je-Gil;Jeong Ha-Young;Lee Yong-Surk
    • Proceedings of the IEEK Conference
    • /
    • summer
    • /
    • pp.204-208
    • /
    • 2004
  • Embedded business will be expanded market more and more since customers seek more wearable and ubiquitous systems. Cellular telephones, PDAs, notebooks and portable multimedia devices could bring higher microprocessor revenues and more rewarding improvements in performance and functions. Increasing battery capacity is still creeping along the roadmap. Until a small practical fuel cell becomes available, microprocessor developers must come up with power-reduction methods. According to MPR 2003, the instruction and data caches of ARM920T processor consume $44\%$ of total processor power. The rest of it is split into the power consumptions of the integer core, memory management units, bus interface unit and other essential CPU circuitry. And the relationships among CPU, peripherals and caches may change in the future. The processor working on higher operating frequency will exact larger cache RAM and consume more energy. In this paper, we propose advanced low power trace cache which caches traces of the dynamic instruction stream, and reduces cache access times. And we evaluate the performance of the trace cache and estimate the power of the trace cache, which is compared with conventional cache.

  • PDF