Search | Korea Science

Instructions and Data Prefetch Mechanism using Displacement History Buffer (변위 히스토리 버퍼를 이용한 명령어 및 데이터 프리페치 기법)

Jeong, Yong Su;Kim, JinHyuk;Cho, Tae Hwan;Choi, SangBang
- Journal of the Institute of Electronics and Information Engineers
- /
- v.52 no.10
- /
- pp.82-94
- /
- 2015
In this paper, we propose hardware prefetch mechanism with an efficient cache replacement policy by giving priority to the trigger block in which a spatial region and producing a spatial region by using the displacement field. It could be taken into account the sequence of the program since a history is based on the trigger block of history record, and it could be quickly prefetching the instructions or data address by adding a stored value to the trigger address and displacement field since a history is stored as a displacement value. Also, we proposed a method of replacing at random by the cache replacement policy from the low priority block when the cache area is full after giving priority to the trigger block. We analyzed using the memory simulator program gem5 and PARSEC benchmark to assess the performance of the hardware prefetcher. As a result, compared to the existing hardware prefecture to generate the spatial region using a bit vector, L1 data cache miss rate was reduced about 44.5% on average and an average of 26.1% of L1 instruction misses occur. In addition, IPC (Instruction Per Cycle) showed an improvement of about 23.7% on average.
https://doi.org/10.5573/ieie.2015.52.10.082 인용 PDF KSCI

Performance Relationship Analysis in Map Block Number of NAND Flash Storage Device Using Map Cache Techniques (맵 캐시 기법을 사용하는 낸드 플래시 저장장치의 맵 블록 개수에 따른 성능 관계 분석)

Lee, Daeyong;Song, Yong Ho
- Proceedings of the Korea Information Processing Society Conference
- /
- 2016.10a
- /
- pp.22-25
- /
- 2016
맵 캐시 기법을 사용하는 낸드 플래시 저장장치는 맵 데이터를 저장하기 위한 공간을 필요로 한다. 이 공간을 맵 블록이라 브르며 시스템 유지 및 성능 개선을 위해 사용되는 낸드 블록의 일부를 점유한다. 맵 블록의 개수가 너무 많을 경우 시스템 유지에 블록이 부족해지기 때문에 전반적인 성능이 하락하게 된다. 하지만 맵 블록이 너무 적은 경우에도 전체 맵 데이터를 유지하기 위한 동작이 과도하게 수행되어 성능이 크게 하락하는 문제가 발생한다. 본 논문은 맵 블록 개수에 따른 성능 변화를 분석하고 최적의 맵 블록 개수를 제안한다.
https://doi.org/10.3745/PKIPS.y2016m10a.22 인용 PDF

A Performance Evaluation of Value Predictors in a Superscalar Processor (슈퍼스칼라 프로세서에서 값 예측기의 성능평가)

전병찬;박희룡;이상정
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10c
- /
- pp.10-12
- /
- 2001
와이드 이슈 슈퍼스칼라 프로세서에서 값 예측기는 한 명령어의 결과를 미리 예측하여 명령들 간의 데이터 종속관계를 극복하고 실행함으로써 명령어 수준 병렬성(Instruction Level Parallesim ILP)을 향상시키는 기법이다. 본 논문에서는 명령어 수준 병렬성을 이용하여 성능을 향상시키기 위하여 데이터 값을 미리 예측하여 병렬로 이슈하고 수행하는 값 예측기의 성능을 비교분석 한다. 먼저 값 예측기 종류별로 성능을 측정한다 그리고 테이블의 갱신시점, 트레이스 캐시 유무 및 명령윈도우 크기에 따른 값 예측기의 성능영향을 평가분석 한다. 성능분석 결과 최근 값 예측기가 간소한 하드웨어 구성에도 불구하고 우수한 성능을 보였다. 그리고 예측테이블 갱신시점과 트레이스캐시의 사용이 값 예측기의 성능향상에 영향을 주었다.
PDF

Semantic Caching Method that consider Layer and Region for Disconnected Operation in Location Based Service (위치 기반 서비스에서 비연결 연산을 위한 Layer와 Region을 고려한 의미론적 캐싱 기법)

이상철;이충호;김재홍;배해영
- Proceedings of the Korean Information Science Society Conference
- /
- 2001.10a
- /
- pp.301-303
- /
- 2001
현재 무선단말기 보급의 확대와 시간과 공간의 제약을 뛰어넘는 장점으로 무선 인터넷 환경이 급속도로 발전하고 있다. 그러나, 본 논문의 연구 분야인 모바일 지리정보 시스템(Mobile Geographic Information System)에서 무선인터넷을 통하여 실시간 지리 정보를 얻기 위해 사용자의 무선단말기와 서버 사이에 빈번한 데이터 송수신이 이뤄져야 하며 데이터의 빠른 변화를 수용해야 한다. 그러나, 아직 현저히 낮은 대역폭을 가진 채널을 통해 통신하며, 비싼 이용 요금과 오류로 인한 자발적 또는 비자발적 연결 끊김 현상 등은 서비스 제공에 한계를 가진다. 그러므로 본 논문에서는 이동 컴퓨팅 환경의 연구를 통해 연결 끊김 현상에서도 클라이언트의 지속적인 서비스 운영을 위한 비연결 연산으로 의미론적 캐시 기법과 캐시 교체 전략에 대해 알아보며, 본 논문에서는 Layer와 Region을 고려한 의미론적 캐싱 기법을 제안하여 무선 환경에서 동적인 지도서비스를 위한 기법을 제시하였다. 이 연구를 통해 아직 많은 한계를 지닌 무선인터넷 환경에서 지리정보뿐 아니라 위치기반의 다양한 서비스 분야에 응용될 수 있으리라 기대된다.
PDF

A Cache Managing Strategy for Fast Media Data Access (미디어 데이터의 빠른 참조를 위한 캐시 운영 전략)

Moon, Hyun-Ju;Kim, Suk-il
- The KIPS Transactions:PartA
- /
- v.11A no.1
- /
- pp.11-20
- /
- 2004
Multimedia data processing in streaming pattern contains high spatial locality and low temporal locality. This paper has proposed a dynamic data prefetching scheme that fully exploits the regularity between memory addresses referred consecutively. Compared to the existing data Prefetching scheme, the Proposed scheme can reduce data Prefetching error when an application divides an way into smaller blocks and processes them block by block. Experimental results on various media benchmark programs show the proposed scheme predicts memory addresses more accurately and results in better performance than existing prefetching schemes.
https://doi.org/10.3745/KIPSTA.2004.11A.1.011 인용 PDF KSCI

Multi-layer Caching Scheme Considering Sub-graph Usage Patterns (서브 그래프의 사용 패턴을 고려한 다중 계층 캐싱 기법)

Yoo, Seunghun;Jeong, Jaeyun;Choi, Dojin;Park, Jaeyeol;Lim, Jongtae;Bok, Kyoungsoo;Yoo, Jaesoo
- The Journal of the Korea Contents Association
- /
- v.18 no.3
- /
- pp.70-80
- /
- 2018
Due to the recent development of social media and mobile devices, graph data have been using in various fields. In addition, caching techniques for reducing I/O costs in the process of large capacity graph data have been studied. In this paper, we propose a multi-layer caching scheme considering the connectivity of the graph, which is the characteristics of the graph topology, and the history of the past subgraph usage. The proposed scheme divides a cache into Used Data Cache and Prefetched Cache. The Used Data Cache maintains data by weights according to the frequently used sub-graph patterns. The Prefetched Cache maintains the neighbor data of the recently used data that are not used. In order to extract the graph patterns, their past history information is used. Since the frequently used sub-graphs have high probabilities to be reused, they are cached. It uses a strategy to replace new data with less likely data to be used if the memory is full. Through the performance evaluation, we prove that the proposed caching scheme is superior to the existing cache management scheme.
https://doi.org/10.5392/JKCA.2018.18.03.070 인용 PDF KSCI

Cache Memory and Replacement Algorithm Implementation and Performance Comparison

Park, Na Eun;Kim, Jongwan;Jeong, Tae Seog
- Journal of the Korea Society of Computer and Information
- /
- v.25 no.3
- /
- pp.11-17
- /
- 2020
In this paper, we propose practical results for cache replacement policy by measuring cache hit and search time for each replacement algorithm through cache simulation. Thus, the structure of each cache memory and the four types of alternative policies of FIFO, LFU, LRU and Random were implemented in software to analyze the characteristics of each technique. The paper experiment showed that the LRU algorithm showed hit rate and search time of 36.044% and 577.936ns in uniform distribution, 45.636% and 504.692ns in deflection distribution, while the FIFO algorithm showed similar performance to the LRU algorithm at 36.078% and 554.772ns in even distribution and 45.662% and 489.574ns in bias distribution. Then LFU followed, Random algorithm was measured at 30.042% and 622.866ns at even distribution, 36.36% at deflection distribution and 553.878ns at lowest performance. The LRU replacement method commonly used in cache memory has the complexity of implementation, but it is the most efficient alternative to conventional alternative algorithms, indicating that it is a reasonable alternative method considering the reference information of data.
https://doi.org/10.9708/jksci.2020.25.03.011 인용 PDF KSCI

A Study on Improvement of Buffer Cache Performance for File I/O in Deep Learning (딥러닝의 파일 입출력을 위한 버퍼캐시 성능 개선 연구)

Jeongha Lee;Hyokyung Bahn
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.24 no.2
- /
- pp.93-98
- /
- 2024
With the rapid advance in AI (artificial intelligence) and high-performance computing technologies, deep learning is being used in various fields. Deep learning proceeds training by randomly reading a large amount of data and repeats this process. A large number of files are randomly repeatedly referenced during deep learning, which shows different access characteristics from traditional workloads with temporal locality. In order to cope with the difficulty in caching caused by deep learning, we propose a new sampling method that aims at reducing the randomness of dataset reading and adaptively operating on existing buffer cache algorithms. We show that the proposed policy reduces the miss rate of the buffer cache by 16% on average and up to 33% compared to the existing method, and improves the execution time by up to 24%.
https://doi.org/10.7236/JIIBC.2024.24.2.93 인용 PDF HTML

Design and implementation of a cache manager for pipeline time-series data (배관 시계열 데이터를 위한 캐시 관리자의 설계 및 구현)

Kim, Seon-Hyo;Kim, Won-Sik;Shin, Je-Yong;Han, Wook-Shin
- Proceedings of the Korea Information Processing Society Conference
- /
- 2005.11a
- /
- pp.109-112
- /
- 2005
배관에 생기는 구멍이나 틈은 대형 사고의 원인이 될 수 있다. 이러한 배관의 결함을 찾기 위해서는 먼저 센서를 부착한 배관 탐사 장비를 배관에 통과시키고, 배관을 통과하는 중에 센서가 읽은 정보들을 배관 탐사 장비의 하드 디스크에 저장한다. 배관 통과가 완료된 후, 분석가는 분석 프로그램을 사용하여 탐사 장비에서 얻은 데이터에서 결함을 수동적으로 찾는다. 분석가가 데이터를 분석할 때 일반적으로 두 가지 패턴이 존재한다. 첫 번째 패턴은 일정한 구간의 센서 데이터를 순차적으로 분석하는 패턴이고, 두 번째 패턴은 현재 구간에서 이전 구간으로 되돌아가서 다시 분석하는 반복적인 패턴이다. 현재까지 만족할 만 한 수준으로 자동적으로 분석이 되지 않으므로, 분석가는 수작업으로 분석을 하는 경우가 많은데 이로 인해 최근에 읽은 부분을 전후 반복해서 액세스하는 반복적인 패턴이 많이 사용된다. 반복적 패턴의 경우 시스템의 성능을 향상시키기 위해, 이전에 읽은 배관 센서 데이터를 캐싱 할 필요가 있다. 그러나 기존의 분석 소프트웨어에는 캐싱 기능이 없으므로 반복적 패턴일 경우 데이터베이스에서 동일한 데이터를 반복적으로 읽는 문제를 가지고 있다. 본 논문에서는 배관 센서 데이터를 효율적으로 관리하는 캐쉬 관리자를 설계하고 구현하였다. 세부적으로는, 배관 센서 데이터를 시계열 데이터로 간주하고, 시계열 데이터에 대한 캐시 관리자를 제안하였다. 본 논문은 배관 탐사 장비에서 획득한 데이터들을 시계열 데이터로 간주하여 데이터베이스 측면에서 이러한 문제들을 접근하였다는 점에서 의미가 있으며, 향후 이 분야에 대한 많은 연구들이 나올 것으로 기대한다.
PDF

An Address Translation Technique Large NAND Flash Memory using Page Level Mapping (페이지 단위 매핑 기반 대용량 NAND플래시를 위한 주소변환기법)

Seo, Hyun-Min;Kwon, Oh-Hoon;Park, Jun-Seok;Koh, Kern
- Journal of KIISE:Computing Practices and Letters
- /
- v.16 no.3
- /
- pp.371-375
- /
- 2010
SSD is a storage medium based on NAND Flash memory. Because of its short latency, low power consumption, and resistance to shock, it's not only used in PC but also in server computers. Most SSDs use FTL to overcome the erase-before-overwrite characteristic of NAND flash. There are several types of FTL, but page mapped FTL shows better performance than others. But its usefulness is limited because of its large memory footprint for the mapping table. For example, 64MB memory space is required only for the mapping table for a 64GB MLC SSD. In this paper, we propose a novel caching scheme for the mapping table. By using the mapping-table-meta-data we construct a fully associative cache, and translate the address within O(1) time. The simulation results show more than 80 hit ratio with 32KB cache and 90% with 512KB cache. The overall memory footprint was only 1.9% of 64MB. The time overhead of cache miss was measured lower than 2% for most workload.
PDF KSCI

Search Result 256, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)