• Title/Summary/Keyword: memory latency

Search Result 361, Processing Time 0.03 seconds

Eager Data Transfer Mechanism for Reducing Communication Latency in User-Level Network Protocols

  • Won, Chul-Ho;Lee, Ben;Park, Kyoung;Kim, Myung-Joon
    • Journal of Information Processing Systems
    • /
    • v.4 no.4
    • /
    • pp.133-144
    • /
    • 2008
  • Clusters have become a popular alternative for building high-performance parallel computing systems. Today's high-performance system area network (SAN) protocols such as VIA and IBA significantly reduce user-to-user communication latency by implementing protocol stacks outside of operating system kernel. However, emerging parallel applications require a significant improvement in communication latency. Since the time required for transferring data between host memory and network interface (NI) make up a large portion of overall communication latency, the reduction of data transfer time is crucial for achieving low-latency communication. In this paper, Eager Data Transfer (EDT) mechanism is proposed to reduce the time for data transfers between the host and network interface. The EDT employs cache coherence interface hardware to directly transfer data between the host and NI. An EDT-based network interface was modeled and simulated on the Linux-based, complete system simulation environment, Linux/SimOS. Our simulation results show that the EDT approach significantly reduces the data transfer time compared to DMA-based approaches. The EDTbased NI attains 17% to 38% reduction in user-to-user message time compared to the cache-coherent DMA-based NIs for a range of message sizes (64 bytes${\sim}$4 Kbytes) in a SAN environment.

Partioning for hardwae-software codesign (하드웨어-소프트웨어 통합 설계를 위한 분할)

  • 윤경로;박동하;신현철
    • Journal of the Korean Institute of Telematics and Electronics A
    • /
    • v.33A no.7
    • /
    • pp.261-268
    • /
    • 1996
  • Hardware-software codesign becomes improtant to effectively sagisfy perfomrance goals, because designers can trade-off in the way hardware and software components work teogether to exhibit a specified behavior. In this paper, a hardware-software pratitioning algorithm is presetned, in which the system behavioral description containing a mixture of hardware and software components is partitioned into hardware part and software part. The partitioning algorithm tries to minimize the given cost function under constraints on hardware resources or latency. Recursive moving of operations between the hardware and software parts is used to find a near optimum partition and the list scheduling approach is used to estimate the hardware area and latency. Since memory may take substantial protion of the hardware part, memory cost is included in sthe hardware cost. Experimental resutls show that our algorithm is effective.

  • PDF

A Locality-Aware Write Filter Cache for Energy Reduction of STTRAM-Based L1 Data Cache

  • Kong, Joonho
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.16 no.1
    • /
    • pp.80-90
    • /
    • 2016
  • Thanks to superior leakage energy efficiency compared to SRAM cells, STTRAM cells are considered as a promising alternative for a memory element in on-chip caches. However, the main disadvantage of STTRAM cells is high write energy and latency. In this paper, we propose a low-cost write filter (WF) cache which resides between the load/store queue and STTRAM-based L1 data cache. To maximize efficiency of the WF cache, the line allocation and access policies are optimized for reducing energy consumption of STTRAM-based L1 data cache. By efficiently filtering the write operations in the STTRAM-based L1 data cache, our proposed WF cache reduces energy consumption of the STTRAM-based L1 data cache by up to 43.0% compared to the case without the WF cache. In addition, thanks to the fast hit latency of the WF cache, it slightly improves performance by 0.2%.

Hardware Platforms for Flash Memory/NVRAM Software Development

  • Nam, Eyee-Hyun;Choi, Ki-Seok;Choi, Jin-Yong;Min, Hang-Jun;Min, Sang-Lyul
    • Journal of Computing Science and Engineering
    • /
    • v.3 no.3
    • /
    • pp.181-194
    • /
    • 2009
  • Flash memory is increasingly being used in a wide range of storage applications because of its low power consumption, low access latency, small form factor, and high shock resistance. However, the current platforms for flash memory software development do not meet the ever-increasing requirements of flash memory applications. This paper presents three different hardware platforms for flash memory/NVRAM (non-volatile RAM) software development that overcome the limitations of the current platforms. The three platforms target different types of host system and provide various features that facilitate the development and verification of flash memory/NVRAM software. In this paper, we also demonstrate the usefulness of the three platforms by implementing three different types of storage system (one for each platform) based on them.

Multi-access Edge Computing Scheduler for Low Latency Services (저지연 서비스를 위한 Multi-access Edge Computing 스케줄러)

  • Kim, Tae-Hyun;Kim, Tae-Young;Jin, Sunggeun
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.15 no.6
    • /
    • pp.299-305
    • /
    • 2020
  • We have developed a scheduler that additionally consider network performance by extending the Kubernetes developed to manage lots of containers in cloud computing nodes. The network delay adapt characteristics of the compute nodes were learned during server operation and the learned results were utilized to develop placement algorithm by considering the existing measurement units, CPU, memory, and volume together, and it was confirmed that the low delay network service was provided through placement algorithm.

Multi-mode Embedded Compression Algorithm and Architecture for Code-block Memory Size and Bandwidth Reduction in JPEG2000 System (JPEG2000 시스템의 코드블록 메모리 크기 및 대역폭 감소를 위한 Multi-mode Embedded Compression 알고리즘 및 구조)

  • Son, Chang-Hoon;Park, Seong-Mo;Kim, Young-Min
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.46 no.8
    • /
    • pp.41-52
    • /
    • 2009
  • In Motion JPEG2000 encoding, huge bandwidth requirement of data memory access is the bottleneck in required system performance. For the alleviation of this bandwidth requirement, a new embedded compression(EC) algorithm with a little bit of image quality drop is devised. For both random accessibility and low latency, very simple and efficient entropy coding algorithm is proposed. We achieved significant memory bandwidth reductions (about 53${\sim}$81%) and reduced code-block memory to about half size through proposed multi-mode algorithms, without requiring any modification in JPEG2000 standard algorithm.

Effects of Treadmill Exercise on Memory, Hippocampal Cell Proliferation, BDNF, TrkB, and Forebrain Cholinergic Cells in Adolescent Rats (트레드밀 운동이 청소년기 흰쥐의 기억력과 해마 신경세포생성, BDNF, TrkB, 그리고 전뇌 콜린 세포에 미치는 영향)

  • Lee, Hee-Hyuk
    • Journal of Life Science
    • /
    • v.19 no.3
    • /
    • pp.403-410
    • /
    • 2009
  • This study investigated the effects of treadmill exercise on memory ability, cell proliferation, BDNF, and TrkB in the hippocampus and forebrain cholinergic cells in adolescent rats. Male Sprague-Dawley rats (4 weeks old) were randomly assigned to the following two groups: the sedentary group (n=10) and the exercise group (n=10). Rats in the exercise group were forced to run on a treadmill for 30 min, five times per week for 4 weeks. The latency of the step-through avoidance task was used in order to evaluate memory ability. Hippocampal brain-derived neurotrophic factor (BDNF) and tropomyosin-related kinase B (TrkB) expression were assessed by Western blotting. Hippocampal cell proliferation and forebrain cholinergic cells were assessed by immunohistochemistry. The present study showed that treadmill running during the adolescent period significantly improved memory capability, increased hippocampal cell proliferation, up-regulated hippocampal BDNF and TrkB expression, and enhanced the number of forebrain cholinergic cells. These results suggest that regular exercise during the adolescent period may enhance memory function.

Strain-dependent Differences of Locomotor Activity and Hippocampus-dependent Learning and Memory in Mice

  • Kim, Joong-Sun;Yang, Mi-Young;Son, Yeong-Hoon;Kim, Sung-Ho;Kim, Jong-Choon;Kim, Seung-Joon;Lee, Yong-Duk;Shin, Tae-Kyun;Moon, Chang-Jong
    • Toxicological Research
    • /
    • v.24 no.3
    • /
    • pp.183-188
    • /
    • 2008
  • The behavioral phenotypes of out-bred ICR mice were compared with those of in-bred C57BL/6 and BALB/c mice. In particular, this study examined the locomotor activity and two forms of hippocampus-dependent learning paradigms, passive avoidance and object recognition memory. The basal open-field activity of the ICR strain was greater than that of the C57BL/6 and BALB/c strains. In the passive avoidance task, all the mice showed a significant increase in the cross-over latency when tested 24 hours after training. The strength of memory retention in the ICR mice was relatively weak and measurable, as indicated by the shorter cross-over latency than the C57BL/6 and BALB/c mice. In the object recognition memory test, all strains had a significant preference for the novel object during testing. The index for the preference of a novel object was lower for the ICR and BALB/c mice. Nevertheless, the variance and the standard deviation in these strains were comparable. Overall, these results confirm the strain differences on locomotor activity and hippocampus-dependent learning and memory in mice.

Improvement of Memory Impairment by L-Theanine Through Inhibition of Acetylcholinesterase Activity in Mice (마우스에서 L-Theanine의 기억력 회복능 및 Acetylcholinesterase 활성 억제)

  • Yuk, Dong-Yeon;Kim, Tae-Il;Park, Sang-Gi;Park, Hyoung-Kook;Yoon, Yeo-Kyeung;Hong, Jin-Tae
    • YAKHAK HOEJI
    • /
    • v.51 no.6
    • /
    • pp.409-414
    • /
    • 2007
  • Acetylcholinesterase (AChE) plays a role in the progression of Alzheimer's disease (AD). In this study, we examined the improving effect of L-theanine, a major amino acid in Japanese green tea (Camellia sinensis) on the scopolamine (1 mg/kg/mouse)-induced memory dysfunction in mice. Treatment with L-theanine (2, 4 mg/kg/mouse p.o.) in the drinking water for 7 days reversed the scopolamine-induced latency time and distance in the water maze test, latency time in the passive avoidance test, and inhibited AChE activity. This study suggests that L-theanine may be a useful agent for prevention of progression of AD.

Reliability Optimization Technique for High-Density 3D NAND Flash Memory Using Asymmetric BER Distribution (에러 분포의 비대칭성을 활용한 대용량 3D NAND 플래시 메모리의 신뢰성 최적화 기법)

  • Myungsuk Kim
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.18 no.1
    • /
    • pp.31-40
    • /
    • 2023
  • Recent advances in flash technologies, such as 3D processing and multileveling schemes, have successfully increased the flash capacity. Unfortunately, these technology advances significantly degrade flash's reliability due to a smaller cell geometry and a finer-grained cell state control. In this paper, we propose an asymmetric BER-aware reliability optimization technique (aBARO), new flash optimization that improves the flash reliability. To this end, we first reveal that bit errors of 3D NAND flash memory are highly skewed among flash cell states. The proposed aBARO exploits the unique per-state error model in flash cell states by selecting the most error-prone flash states and by forming narrow threshold voltage distributions (for the selected states only). Furthermore, aBARO is applied only when the program time (tPROG) gets shorter when a flash cell becomes aging, thereby keeping the program latency of storage systems unchanged. Our experimental results with real 3D MLC and TLC flash devices show that aBARO can effectively improve flash reliability by mitigating a significant number of bit errors. In addition, aBARO can also reduce the read latency by 40%, on average, by suppressing the read retries.