• Title/Summary/Keyword: One-chip processor

Search Result 109, Processing Time 0.02 seconds

FPGA-Based Hardware Accelerator for Feature Extraction in Automatic Speech Recognition

  • Choo, Chang;Chang, Young-Uk;Moon, Il-Young
    • Journal of information and communication convergence engineering
    • /
    • v.13 no.3
    • /
    • pp.145-151
    • /
    • 2015
  • We describe in this paper a hardware-based improvement scheme of a real-time automatic speech recognition (ASR) system with respect to speed by designing a parallel feature extraction algorithm on a Field-Programmable Gate Array (FPGA). A computationally intensive block in the algorithm is identified implemented in hardware logic on the FPGA. One such block is mel-frequency cepstrum coefficient (MFCC) algorithm used for feature extraction process. We demonstrate that the FPGA platform may perform efficient feature extraction computation in the speech recognition system as compared to the generalpurpose CPU including the ARM processor. The Xilinx Zynq-7000 System on Chip (SoC) platform is used for the MFCC implementation. From this implementation described in this paper, we confirmed that the FPGA platform is approximately 500× faster than a sequential CPU implementation and 60× faster than a sequential ARM implementation. We thus verified that a parallelized and optimized MFCC architecture on the FPGA platform may significantly improve the execution time of an ASR system, compared to the CPU and ARM platforms.

Parallel Load Techinques Application for Transcranial Magnetic Stimulation

  • Choi, Sun-Seob;Kim, Whi-Young
    • Journal of Magnetics
    • /
    • v.17 no.1
    • /
    • pp.27-32
    • /
    • 2012
  • Transcranial magnetic stimulation requires an electric field composed of dozens of V/m to achieve stimulation. The stimulation system is composed of a stimulation coil to form the electric field by charging and discharging a capacitor in order to save energy, thus requiring high-pressure kV. In particular, it is charged and discharged in capacitor to discharge through stimulation coil within a short period of time (hundreds of seconds) to generate current of numerous kA. A pulse-type magnetic field is formed, and eddy currents within the human body are triggered to achieve stimulation. Numerous pulse forms must be generated to initiate eddy currents for stimulating nerves. This study achieved high internal pressure, a high number of repetitions, and rapid switching of elements, and it implemented numerous control techniques via introduction of the half-bridge parallel load method. In addition it applied a quick, accurate, high-efficiency charge/discharge method for transcranial magnetic stimulation to substitute an inexpensive, readily available, commercial frequency condenser for a previously used, expensive, high-frequency condenser. Furthermore, the pulse repetition rate was altered to control energy density, and grafts compact, one-chip processor with simulation to stably control circuit motion and conduct research on motion and output characteristics.

Optimized Digital Proportional Integral Derivative Controller for Heating and Cooling Injection Molding System

  • Jeong, Byeong-Ho;Kim, Nam-Hoon;Lee, Kang-Yeon
    • Journal of Electrical Engineering and Technology
    • /
    • v.10 no.3
    • /
    • pp.1383-1388
    • /
    • 2015
  • Proportional integral derivative (PID) control is one of the conventional control strategies. Industrial PID control has many options, tools, and parameters for dealing with the wide spectrum of difficulties and opportunities in manufacturing plants. It has a simple control structure that is easy to understand and relatively easy to tune. Injection mold is warming up to the idea of cycling the tool surface temperature during the molding cycle rather than keeping it constant. This “heating and cooling” process has rapidly gained popularity abroad. However, it has discovered that raising the mold wall temperature above the resin’s glass-transition or crystalline melting temperature during the filling stage is followed by rapid cooling and improved product performance in applications from automotive to packaging to optics. In previous studies, optimization methods were mainly selected on the basis of the subjective experience. Appropriate techniques are necessary to optimize the cooling channels for the injection mold. In this study, a digital signal processor (DSP)-based PID control system is applied to injection molding machines. The main aim of this study is to optimize the control of the proposed structure, including a digital PID control method with a DSP chip in the injection molding machine.

The Study of Reclaimer of Antiseptic Solution for Winter-sowing Prevention of a Vehicle Disinfector at Livestock Farm (축산농가 차량소독기의 동파방지를 위한 약액 회수장치에 관한 연구)

  • Kim, W.;Lee, S.K.
    • Journal of Animal Environmental Science
    • /
    • v.13 no.1
    • /
    • pp.29-34
    • /
    • 2007
  • This study was conducted to develop a reclaimer of the vehicle disinfector to be used at livestock fm. The reclaimer was mainly consisted of ball-valves, geared motors and one-chip processor, and the purpose of the system was to prevent liquid freezing as well as decrease environmental pollution of antiseptic solution. The properly spraying pressure of the vehicle disinfector was found over 1.96 MPa at 1m of the spraying range. While certain amount of the antiseptic solution remained in the injection-pipes, the spray starting time was found not making any significant effect on the remained amount of the antiseptic solution. The amounts of the antiseptic solution remained in the injection-pipes were 50 ml and 270 ml in average, respectively with and without the use of the reclaimer. The reclaimer was the most effective when the connection of the injection-pipe and sprayer line was located below the side-injection-pipe and then connected to the injection-pipe located at the bottom of vehicles.

  • PDF

Application-specific Traffic Generator (응용 프로그램의 특성 반영이 가능한 트래픽 생성기)

  • Yeo, Phil-Koo;Cho, Keol;Yu, Dae-Chul;Hwang, Young-Si;Chung, Ki-Seok
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.48 no.9
    • /
    • pp.40-49
    • /
    • 2011
  • Integrating massive components and low-power policies have been actively investigated for system-on-chip designs. But in recent years, finding the optimal interconnection structure among heterogeneous components has emerged as a critical system design issue. Therefore, various simulation tools to model interconnection designs are being developed and performance evaluation of simulation is reflected in the real design. But most of the simulation environments employ traffic generation based on the mathematical probability functions, and such traffic generation cannot fully cover for various situations that may be occurred in the real system. Therefore, the demand for traffic pattern generation based on real applications is increasing. However, there have been few simulators that adopt application-specific traffic generators. This paper proposes a novel traffic generation method in simulating various interconnection structures for multi-processor system-on-chip design. The proposed traffic generation method can generate traffic patterns that can reflect the actual characteristics of the application and evaluate the performance of an interconnection structure under more realistic circumstance than traffic patterns using mathematical probability functions. By comparing the differences between the proposed method and the one based on mathematical probability functions, this paper shows advantages of the proposed traffic generation method.

Design and Implementation of Adaptive Beam-forming System for Wi-Fi Systems (무선랜 시스템을 위한 적응형 빔포밍 시스템의 설계 및 구현)

  • Oh, Joohyeon;Gwag, Gyounghun;Oh, Youngseok;Cho, Sungmin;Oh, Hyukjun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.9
    • /
    • pp.2109-2116
    • /
    • 2014
  • This paper presents the implementation and design of the advanced WI-FI systems with beam-forming antenna that radiate their power to the direction of user equipment to improve the overall throughput, contrast to the general WI-FI systems equipped with omni-antenna. The system consists of patch array antenna, DSP, FPGA, and Qualcomm's commercial chip. The beam-forming system on the FPGA utilizes the packet information from Qualcomm's commercial chip to control the phase shifters and attenuators of the patch array antenna. The PCI express interface has been used to maximize the communication speed between DSP and FPGA. The directions of arrival of users are managed using the database, and each user is distinguished by the MAC address given from the packet information. When the system wants to transmit a packet to one user, it forms beams to the direction of arrival of the corresponding user stored in the database to maximize the throughput. Directions of arrival of users are estimated using the received preamble in the packet to make its SINR as high as possible. The proposed beam-forming system was implemented using an FPGA and Qualcommm's commercial chip together. The implemented system showed considerable throughput improvement over the existing general AP system with omni-directional antenna in the multi-user communication environment.

Low-Power Cache Design by using Locality Buffer and Address Compression (지역 버퍼와 주소 압축을 통한 저전력 캐시 설계)

  • Kwak, Jong Wook
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.9
    • /
    • pp.11-19
    • /
    • 2013
  • Most modern computer systems employ cache systems in order to alleviate the access time gap between processor and memory system. The power dissipated by the cache systems becomes a significant part of the total power dissipated by whole microprocessor chip. Therefore, power reduction in the cache system becomes one of the important issues. Partial tag cache is the system for the least power consumption. The main power reduction for this method is due to the use of small partial tag matching, not full tag matching. In this paper, we first analyze the previous regular partial tag cache systems and propose a new address matching mechanism by using locality buffer and address compression. In simulation results, the proposed model shows 18% power reduction in average, still providing same performance level, compared to regular cache.

Hardware Architecture and its Design of Real-Time Video Compression Processor for Motion JPEG2000 (Motion JPEG2000을 위한 실시간 비디오 압축 프로세서의 하드웨어 구조 및 설계)

  • 서영호;김동욱
    • The Transactions of the Korean Institute of Electrical Engineers D
    • /
    • v.53 no.1
    • /
    • pp.1-9
    • /
    • 2004
  • In this paper, we proposed a hardware(H/W) structure which can compress and recontruct the input image in real time operation and implemented it into a FPGA platform using VHDL(VHSIC Hardware Description Language). All the image processing element to process both compression and reconstruction in a FPGA were considered each of them was mapped into a H/W with the efficient structure for FPGA. We used the DWT(discrete wavelet transform) which transforms the data from spatial domain to the frequency domain, because use considered the motion JPEG2000 as the application. The implemented H/W is separated to both the data path part and the control part. The data path part consisted of the image processing blocks and the data processing blocks. The image processing blocks consisted of the DWT Kernel for the filtering by DWT, Quantizer/Huffman Encoder, Inverse Adder/Buffer for adding the low frequency coefficient to the high frequency one in the inverse DWT operation, and Huffman Decoder. Also there existed the interface blocks for communicating with the external application environments and the timing blocks for buffering between the internal blocks. The global operations of the designed H/W are the image compression and the reconstruction, and it is operated by the unit or a field synchronized with the A/D converter. The implemented H/W used the 54%(12943) LAB(Logic Array Block) and 9%(28352) ESB(Embedded System Block) in the APEX20KC EP20K600CB652-7 FPGA chip of ALTERA, and stably operated in the 70MHz clock frequency. So we verified the real time operation. that is. processing 60 fields/sec(30 frames/sec).

High Performance Coprocessor Architecture for Real-Time Dense Disparity Map (실시간 Dense Disparity Map 추출을 위한 고성능 가속기 구조 설계)

  • Kim, Cheong-Ghil;Srini, Vason P.;Kim, Shin-Dug
    • The KIPS Transactions:PartA
    • /
    • v.14A no.5
    • /
    • pp.301-308
    • /
    • 2007
  • This paper proposes high performance coprocessor architecture for real time dense disparity computation based on a phase-based binocular stereo matching technique called local weighted phase-correlation(LWPC). The algorithm combines the robustness of wavelet based phase difference methods and the basic control strategy of phase correlation methods, which consists of 4 stages. For parallel and efficient hardware implementation, the proposed architecture employs SIMD(Single Instruction Multiple Data Stream) architecture for each functional stage and all stages work on pipelined mode. Such that the newly devised pipelined linear array processor is optimized for the case of row-column image processing eliminating the need for transposed memory while preserving generality and high throughput. The proposed architecture is implemented with Xilinx HDL tool and the required hardware resources are calculated in terms of look up tables, flip flops, slices, and the amount of memory. The result shows the possibility that the proposed architecture can be integrated into one chip while maintaining the processing speed at video rate.

Implementation of a System for RFID Education to be based on an EPC global Network Standard (EPC global Network 표준을 따르는 RFID 교육용 시스템의 구현)

  • Kim, Dae-Hee;Chung, Joong-Soo;Kim, Hyu-Chan;Jung, Kwang-Wook;Kim, Seog-Gyu
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.11
    • /
    • pp.90-99
    • /
    • 2009
  • This paper presents the implementation of RFID EPC global network educational system based on using 900MHz air interface between the reader and the active tag. The software of reader and the active tag is developed on embedded environment, and the software of PC controlling the reader is based on window OS operated as the server. The ATmega128 VLSI chip is used for the processor of the reader and the active tag. As the development environment, AVR compiler is used for the reader and the active tag of which the programming language is C. The visual C++language of the visual studio on the PC activated as the server is used for development language. Main functions of this system are to control tag containing EPC global Data by PC through the reader, to obtain information of tag through the internet and to read/write data on tag memory. Finally the data written from the active tag's memory is sent to the PC via the reader as "read" operation and compare the received data with one already sent to the tag. Software implementation of 900MHz EPC global RFID educational system is done on the basis of these functions.