• Title/Summary/Keyword: multi-cycle architecture

Search Result 59, Processing Time 0.024 seconds

An Improvement of Implementation Method for Multi-Layer AHB BusMatrix (ML-AHB 버스 매트릭스 구현 방법의 개선)

  • Hwang Soo-Yun;Jhang Kyoung-Sun
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.32 no.11_12
    • /
    • pp.629-638
    • /
    • 2005
  • In the System on a Chip design, the on chip bus is one of the critical factors that decides the overall system performance. Especially, in the case or reusing the IPs such as processors, DSPs and multimedia IPs that requires higher bandwidth, the bandwidth problems of on chip bus are getting more serious. Recently ARM proposes the Multi-Layer AHB BusMatrix that is a highly efficient on chip bus to solve the bandwidth problems. The Multi-Layer AHB BusMatrix allows parallel access paths between multiple masters and slaves in a system. This is achieved by using a more complex interconnection matrix and gives the benefit of increased overall bus bandwidth, and a more flexible system architecture. However, there is one clock cycle delay for each master in existing Multi-Layer AHB BusMatrix whenever the master starts new transactions or changes the slave layers because of the Input Stage and arbitration logic realized with Moore type. In this paper, we improved the existing Multi-Layer AHB BusMatrix architecture to solve the one clock cycle delay problems and to reduce the area overhead of the Input Stage. With the elimination of the Input Stage and some restrictions on the arbitration scheme, we tan take away the one clock cycle delay and reduce the area overhead. Experimental results show that the end time of total bus transaction and the average latency time of improved Multi-Layer AHB BusMatrix are improved by $20\%\;and\;24\%$ respectively. in ease of executing a number of transactions by 4-beat incrementing burst type. Besides the total area and the clock period are reduced by $22\%\;and\;29\%$ respectively, compared with existing Multi-layer AHB BusMatrix.

An Ameliorated Design Method of ML-AHB BusMatrix

  • Hwang, Soo-Yun;Jhang, Kyoung-Sun;Park, Hyeong-Jun;Bae, Young-Hwan;Cho, Han-Jin
    • ETRI Journal
    • /
    • v.28 no.3
    • /
    • pp.397-400
    • /
    • 2006
  • The multi-layer advanced high-performance bus (ML-AHB) BusMatrix proposed by ARM is an excellent architecture for applying embedded systems with low power. However, there is one clock cycle delay for each master in the ML-AHB BusMatrix of the advanced microcontroller bus architecture (AMBA) design kit (ADK) whenever a master starts new transactions or changes the slave layers. In this letter, we propose an improved design method to remove the one clock cycle delay in the ML-AHB BusMatrix of an ADK. We also remarkably reduce the total area and power consumption of the ML-AHB BusMatrix of an ADK with the elimination of the heavy input stages.

  • PDF

Spectral Shape in Accordance with the Magnitude and Distance of Earthquakes and Its Effect on Multi-DOF Structures (지진의 규모와 거리에 따른 스펙트럼 형상과 다자유도 구조물에 대한 영향)

  • Kim, Jin Woo;Kim, Dong Kwan;Kim, Ho Soo
    • Journal of the Earthquake Engineering Society of Korea
    • /
    • v.24 no.1
    • /
    • pp.49-57
    • /
    • 2020
  • In this study earthquake records were collected for rock conditions that do not reflect seismic amplification by soil from global earthquake databases such as PEER, USGS, and ESMD. The collected earthquake records were classified and analyzed based on the magnitude and distance of earthquakes. Based on the analyzed earthquakes, the design response spectrum shape, effective ground acceleration, and amplification ratios for each period band are presented. In addition, based on the analyzed data, the story shear force for 5F, 10F, 15F, and 20F were derived through an analysis of the elastic time history for multi-DOF structures. The results from analyzing the rock earthquake record show that the seismic load tends to be amplified greatly in the short period region, which is similar to results observed from the Gyeongju and Pohang earthquakes. In addition, the results of the multi-DOF structure analysis show that existing seismic design criteria can be underestimated and designed in the high-order mode of short- and medium-long cycle structures.

Modal parameter identification of tall buildings based on variational mode decomposition and energy separation

  • Kang Cai;Mingfeng Huang;Xiao Li;Haiwei Xu;Binbin Li;Chen Yang
    • Wind and Structures
    • /
    • v.37 no.6
    • /
    • pp.445-460
    • /
    • 2023
  • Accurate estimation of modal parameters (i.e., natural frequency, damping ratio) of tall buildings is of great importance to their structural design, structural health monitoring, vibration control, and state assessment. Based on the combination of variational mode decomposition, smoothed discrete energy separation algorithm-1, and Half-cycle energy operator (VMD-SH), this paper presents a method for structural modal parameter estimation. The variational mode decomposition is proved to be effective and reliable for decomposing the mixed-signal with low frequencies and damping ratios, and the validity of both smoothed discrete energy separation algorithm-1 and Half-cycle energy operator in the modal identification of a single modal system is verified. By incorporating these techniques, the VMD-SH method is able to accurately identify and extract the various modes present in a signal, providing improved insights into its underlying structure and behavior. Subsequently, a numerical study of a four-story frame structure is conducted using the Newmark-β method, and it is found that the relative errors of natural frequency and damping ratio estimated by the presented method are much smaller than those by traditional methods, validating the effectiveness and accuracy of the combined method for the modal identification of the multi-modal system. Furthermore, the presented method is employed to estimate modal parameters of a full-scale tall building utilizing acceleration responses. The identified results verify the applicability and accuracy of the presented VMD-SH method in field measurements. The study demonstrates the effectiveness and robustness of the proposed VMD-SH method in accurately estimating modal parameters of tall buildings from acceleration response data.

A Software And Hardware Scheme For Reducing The Branch Penalty In Parallel Computers (병렬구조 컴퓨터에서 Branch penalty를 감소시키기 위한 소프트웨어와 하드웨어 방법)

  • 함찬숙;조종현;조영일
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.11
    • /
    • pp.11-16
    • /
    • 1993
  • VLIW architecture capable of testing multiple conditions in a cycle must support an efficient mechanism for multi-way branches. This paper proposes a mechanism to speed up the execution of multi-way branches and an efficient memory packing method of instructions, which reduced the wasted memory space. Also, we develops a new compiler technique which can transform program segments that are not applied to multi-way branches into ones that are applied to multi-way branches. The benefits gained by the transformation are to reduce branch penalty and to increase instruction-level parallelism.

  • PDF

An Optimal Instruction Fetch Strategy for SMT Processors (SMT 프로세서에 최적화된 명령어 페치 전략에 관한 연구)

  • 홍인표;문병인;김문경;이용석
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.5C
    • /
    • pp.512-521
    • /
    • 2002
  • Recently, conventional superscalar RISC processors arrive their performance limit, and many researches on the next-generation architecture are concentrated on SMT(Simultaneous Multi-Threading). In SMT processors, multiple threads are executed simultaneously and share hardware resources dynamically. In this case, it is more important to supply instructions from multiple threads to processor core efficiently than ever. Because SMT architecture shows higher IPC(Instructions per cycle) than superscalar architecture, performance is influenced by fetch bandwidth and the size of fetch queue. Moreover, to use TLP(Thread Level Parallelism) efficiently, fetch thread selection algorithm and fetch bandwidth for each selected threads must be carefully designed. Thus, in this paper, the performance values influenced by these factors are analyzed. Based on the results, an optimal instruction fetch strategy for SMT processors is proposed.

Rapid Data Allocation Technique for Multiple Memory Bank Architectures (다중 메모리 뱅크 구조를 위한 고속의 자료 할당 기법)

  • 조정훈;백윤홍;최준식
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.10a
    • /
    • pp.196-198
    • /
    • 2003
  • Virtually every digital signal processors(DSPs) support on-chip multi- memory banks that allow the processor to access multiple words of data from memory in a single instruction cycle. Also, all existing fixed-point DSPs have irregular architecture of heterogeneous register which contains multiple register files that are distributed and dedicated to different sets of instructions. Although there have been several studies conducted to efficiently assign data to multi-memory banks, most of them assumed processors with relatively simple, homogeneous general-purpose resisters. Therefore, several vendor-provided compilers fer DSPs were unable to efficiently assign data to multiple data memory banks. thereby often failing to generate highly optimized code fer their machines. This paper presents an algorithm that helps the compiler to efficiently assign data to multi- memory banks. Our algorithm differs from previous work in that it assigns variables to memory banks in separate, decoupled code generation phases, instead of a single, tightly-coupled phase. The experimental results have revealed that our decoupled algorithm greatly simplifies our code generation process; thus our compiler runs extremely fast, yet generates target code that is comparable In quality to the code generated by a coupled approach

  • PDF

Brief Overview on Design Techniques and Architectures of SAR ADCs

  • Park, Kunwoo;Chang, Dong-Jin;Ryu, Seung-Tak
    • Journal of Semiconductor Engineering
    • /
    • v.2 no.1
    • /
    • pp.99-108
    • /
    • 2021
  • Successive Approximation Register (SAR) Analog-to-Digital Converters (ADC) seem to become the hottest ADC architecture during the past decade in implementing energy-efficient high performance ADCs. In this overview, we will review what kind of circuit techniques and architectural advances have contributed to place the SAR ADC architecture at its current position, beginning from a single SAR ADC and moving to various hybrid architectures. At the end of this overview, a recently reported compact and high-speed SAR-Flash ADC is introduced as one design example of SAR-based hybrid ADC architecture.

Implementation of Ethernet-Based High-Speed Data Communication for Multi-core DSP (멀티 코어 DSP를 위한 이더넷 기반 고속 데이터 통신 구현)

  • Nguyen, Dung Huy;Choi, Joon-Young
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.17 no.3
    • /
    • pp.185-190
    • /
    • 2022
  • We propose a high speed data communication method for motor drive systems with fast control cycle in order to collect state variables of motor control without degrading control performance. Ethernet is chosen for communication device, and multi-core DSP architecture is exploited for communication processing load distribution. The communication program including network protocol stack and motor control program are assigned to two separate cores, and data between two cores are exchanged using interrupt-based inter-process communication mechanism, which enables to achieve a high-speed communication performance without degrading the motor control performance. The performance of developed communication method is demonstrated by real experiments using TCP, UDP and Raw Socket protocols in an experimental setup consisting of TI's TMS320F28388D motor control card and MS Windows PC.

Design of High Performance Multi-mode 2D Transform Block for HEVC (HEVC를 위한 고성능 다중 모드 2D 변환 블록의 설계)

  • Kim, Ki-Hyun;Ryoo, Kwang-Ki
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.18 no.2
    • /
    • pp.329-334
    • /
    • 2014
  • This paper proposes the hardware architecture of high performance multi-mode 2D forward transform for HEVC which has same number of cycles for processing any type of four TUs and yield high throughput. In order to make the original image which has high pixel and high resolution into highly compressed image effectively, the transform technique of HEVC supports 4 kinds of pixel units, TUs and it finds the optimal mode after performs each transform computation. As the proposed transform engine uses the common computation operator which is produced by analyzing the relationship among transform matrix coefficients, it can process every 4 kinds of TU mode matrix operation with 35cycles equally. The proposed transform block was designed by Verilog HDL and synthesized by using TSMC 0.18um CMOS processing technology. From the results of logic synthesis, the maximum operating frequency was 400MHz and total gate count was 214k gates which has the throughput of 10-Gpels/cycle with the $4k(3840{\times}2160)@30fps$ image.