Search | Korea Science

Variable latency L1 data cache architecture design in multi-core processor under process variation

Kong, Joonho
- Journal of the Korea Society of Computer and Information
- /
- v.20 no.9
- /
- pp.1-10
- /
- 2015
In this paper, we propose a new variable latency L1 data cache architecture for multi-core processors. Our proposed architecture extends the traditional variable latency cache to be geared toward the multi-core processors. We added a specialized data structure for recording the latency of the L1 data cache. Depending on the added latency to the L1 data cache, the value stored to the data structure is determined. It also tracks the remaining cycles of the L1 data cache which notifies data arrival to the reservation station in the core. As in the variable latency cache of the single-core architecture, our proposed architecture flexibly extends the cache access cycles considering process variation. The proposed cache architecture can reduce yield losses incurred by L1 cache access time failures to nearly 0%. Moreover, we quantitatively evaluate performance, power, energy consumption, power-delay product, and energy-delay product when increasing the number of cache access cycles.
https://doi.org/10.9708/jksci.2015.20.9.001 인용 PDF KSCI

An Adaptive Energy-Efficient and Low-Latency MAC Protocol for Wireless Sensor Networks

Liu, Hao;Yao, Guoliang;Wu, Jianhui;Shi, Longxing
- Journal of Communications and Networks
- /
- v.12 no.5
- /
- pp.510-517
- /
- 2010
In this paper, an adaptive MAC protocol (variable load adaptive (VLA)-MAC) is proposed for wireless sensor networks. This protocol can achieve high energy efficiency and provide low latency under variable-traffic-load conditions. In the case of VLA-MAC, traffic load is measured online and used for adaptive adjustment. Sensor nodes transmit packets in bursts under high load conditions to alleviate packet accumulation and reduce latency. This also removes unnecessary listen action and decreases energy consumption in low load conditions. Simulation results show that the energy efficiency, latency, and throughput achieved by VLA-MAC are higher than those achieved by some traditional approaches.
PDF KSCI

A Variable Latency K'th Order Newton-Raphson's Floating Point Number Divider (가변 시간 K차 뉴톤-랍손 부동소수점 나눗셈)

Cho, Gyeong-Yeon
- IEMEK Journal of Embedded Systems and Applications
- /
- v.9 no.5
- /
- pp.285-292
- /
- 2014
The commonly used Newton-Raphson's floating-point number divider algorithm performs two multiplications in one iteration. In this paper, a tentative K'th Newton-Raphson's floating-point number divider algorithm which performs K times multiplications in one iteration is proposed. Since the number of multiplications performed by the proposed algorithm is dependent on the input values, the average number of multiplications per an operation in single precision and double precision divider is derived from many reciprocal tables with varying sizes. In addition, an error correction algorithm, which consists of one multiplication and a decision, to get exact result in divider is proposed. Since the proposed algorithm only performs the multiplications until the error gets smaller than a given value, it can be used to improve the performance of a floating point number divider unit. Also, it can be used to construct optimized approximate reciprocal tables.
https://doi.org/10.14372/IEMEK.2014.9.5.285 인용 PDF KSCI

Enhanced Prediction Algorithm for Near-lossless Image Compression with Low Complexity and Low Latency

Son, Ji Deok;Song, Byung Cheol
- IEIE Transactions on Smart Processing and Computing
- /
- v.5 no.2
- /
- pp.143-151
- /
- 2016
This paper presents new prediction methods to improve compression performance of the so-called near-lossless RGB-domain image coder, which is designed to effectively decrease the memory bandwidth of a system-on-chip (SoC) for image processing. First, variable block size (VBS)-based intra prediction is employed to eliminate spatial redundancy for the green (G) component of an input image on a pixel-line basis. Second, inter-color prediction (ICP) using spectral correlation is performed to predict the R and B components from the previously reconstructed G-component image. Experimental results show that the proposed algorithm improves coding efficiency by up to 30% compared with an existing algorithm for natural images, and improves coding efficiency with low computational cost by about 50% for computer graphics (CG) images.
https://doi.org/10.5573/IEIESPC.2016.5.2.143 인용 PDF KSCI

Refined fixed granularity algorithm on Networks of Workstations (NOW 환경에서 개선된 고정 분할 단위 알고리즘)

Gu, Bon-Geun
- The KIPS Transactions:PartA
- /
- v.8A no.2
- /
- pp.117-124
- /
- 2001
At NOW (Networks Of Workstations), the load sharing is very important role for improving the performance. The known load sharing strategy is fixed-granularity, variable-granularity and adaptive-granularity. The variable-granularity algorithm is sensitive to the various parameters. But Send algorithm, which implements the fixed-granularity strategy, is robust to task granularity. And the performance difference between Send and variable-granularity algorithm is not substantial. But, in Send algorithm, the computing time and the communication time are not overlapped. Therefore, long latency time at the network has influence on the execution time of the parallel program. In this paper, we propose the preSend algorithm. In the preSend algorithm, the master node can send the data to the slave nodes in advance without the waiting for partial results from the slaves. As the master node sent the next data to the slaves in advance, the slave nodes can process the data without the idle time. As stated above, the preSend algorithm can overlap the computing time and the communication time. Therefore we reduce the influence of the long latency time at the network and the execution time of the parallel program on the NOW. To compare the execution time of two algorithms, we use the $320{\times}320$ matrix multiplication. The comparison results of execution times show that the preSend algorithm has the shorter execution time than the Send algorithm.
PDF

Efficient Message Scheduling for WDM Optical Networks with Minimizing Flow Time

Huang, Xiaohong;Ma, Maode
- Journal of Communications and Networks
- /
- v.6 no.2
- /
- pp.147-155
- /
- 2004
In this paper, we propose an efficient sequencing technique, namely minimum Row time scheduling (MFTS), to manage variable-Iength message transmissions for single-hop passive starcoupled WDM optical networks. By considering not only the message length but also the state of the receivers and the tuning latency, the proposed protocol can reduce the average delay of the network greatly. This paper also introduces a new channel assignment technique latency minimizing scheduling (LMS), which aims to reduce the scheduling latency. We evaluate the proposed algorithm, using extensive discrete-event simulations, by comparing its performance with shortest job first (SJF) algorithm. We find that significant improvement in average delay could be achieved by MFTS algorithm. By combining the proposed message sequencing technique with the channel selection technique, the performance of the optical network could be further improved.
PDF KSCI

Decision Tree of Occupational Lung Cancer Using Classification and Regression Analysis

Kim, Tae-Woo;Koh, Dong-Hee;Park, Chung-Yill
- Safety and Health at Work
- /
- v.1 no.2
- /
- pp.140-148
- /
- 2010
Objectives: Determining the work-relatedness of lung cancer developed through occupational exposures is very difficult. Aims of the present study are to develop a decision tree of occupational lung cancer. Methods: 153 cases of lung cancer surveyed by the Occupational Safety and Health Research Institute (OSHRI) from 1992-2007 were included. The target variable was whether the case was approved as work-related lung cancer, and independent variables were age, sex, pack-years of smoking, histological type, type of industry, latency, working period and exposure material in the workplace. The Classification and Regression Test (CART) model was used in searching for predictors of occupational lung cancer. Results: In the CART model, the best predictor was exposure to known lung carcinogens. The second best predictor was 8.6 years or higher latency and the third best predictor was smoking history of less than 11.25 pack-years. The CART model must be used sparingly in deciding the work-relatedness of lung cancer because it is not absolute. Conclusion: We found that exposure to lung carcinogens, latency and smoking history were predictive factors of approval for occupational lung cancer. Further studies for work-relatedness of occupational disease are needed.
https://doi.org/10.5491/SHAW.2010.1.2.140 인용 PDF

Seamless Video Switching System for Service Compatible 3DTV Broadcasting

Kim, Sangjin;Jeon, Taehyun
- ETRI Journal
- /
- v.38 no.5
- /
- pp.847-857
- /
- 2016
Broadcasting services such as multi/single channel HDTV and 3DTV/2DTV use a multi-channel encoder that changes the bitrate and composition of the video service depending on the time. However, this type of multi-channel encoder could cause a longer latency owing to the variable bitrate and relatively bigger size of the buffers, which results in the same delay as in 3DTV even for a conventional DTV service. On the other hand, systems built based on separate encoders, each of which is optimized for the target service, might not have such latency problems. Nevertheless, there might be a distortion problem in the image and sound at the time of a switchover between two encoders with different output bitrates and group of picture structures. This paper proposes a system that can realize a seamless video service conversion using two different video encoders optimized for each video service. An overall functional description of the video service change control server, which is a main control block for the proposed system, is also provided. The experiment results confirm the seamless switchover and reduced broadcasting latency of DTV services compared with a broadcasting system composed of a multi-channel encoder system.
https://doi.org/10.4218/etrij.16.2615.0036 인용 PDF KSCI KPUBS

Reduced Complexity-and-Latency Variable-to-Check Residual Belief Propagation for LDPC Codes (LDPC 부호를 위한 복잡도와 대기시간을 낮춘 VCRBP 알고리즘)

Kim, Jung-Hyun;Song, Hong-Yeop
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.34 no.6C
- /
- pp.571-577
- /
- 2009
This paper proposes some new improved versions of node-wise VCRBP algorithm for low-density parity-check (LDPC) codes, called forced-convergence node-wise VCRBP algorithm and sign based node-wise VCRBP, both of which significantly reduce the decoding complexity and latency, with only negligible deterioration in error correcting performance.
PDF KSCI

Design of a Variable Shortened and Punctured RS Decoder (단축 및 펑처링 기반의 가변형 RS 복호기 설계)

Song Moon-Kyou;Kong Min-Han;Lim Myoung-Seob
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.31 no.8C
- /
- pp.763-770
- /
- 2006
In this paper, a variable Reed-Solomon(RS) decoder with erasure decoding functionality is designed based on the modified Euclid's algorithm(MEA). The variability of the decoder is implemented through shortening and puncturing based on the RS(124, 108, 8) code, other than the primitive RS(255, 239, 8) code. This leads to shortening the decoding latency. The decoder performs 4-step pipelined operation, where each step is designed to be clocked by an independent clock. Thus by using a faster clock for the MEA block, the complexity and the decoding latency can be reduced. It can support both continuous- and burst-mode decoding. It has been designed in VHDL and synthesized in an FPGA chip, consuming 3,717 logic cells and 2,048-bit memories. The maximum decoding throughput is 33 MByte/sec.
PDF KSCI

Search Result 62, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)