• Title/Summary/Keyword: Local memory

Search Result 358, Processing Time 0.024 seconds

Remote Cache Replacement Policy using Processor Locality in Multi-Processor System (다중 프로세서 시스템에서 프로세서 지역성을 이용한 원격 캐쉬 교체 정책)

  • Han Sang Yoon;Kwak Jong Wook;Jhang Seong Tae;Jhon Chu Shik
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.32 no.11_12
    • /
    • pp.541-556
    • /
    • 2005
  • The memory access latency of the system has been a primary factor of performance degradation in single-processor system and multi-processor system. The remote memory access latency takes a lot of overhead over the local memory access latency especially in the distributed shared-memory system. To resolve this problem, the multi-level cache architecture that contains a remote cache in the multi-processor system has been proposed. In this paper, we propose a new cache replacement policy that improves the performance of the multi-processor system with the remote cache. If the multi-level cache keeps the multi-level inclusion(MLI) property and uses the LRU(Least Recently Used) cache replacement policy, the LRU information of the higher-level cache(a processor cache) would be different with that of the lower-level cache(a remote cache). In this situation, the replacement of a remote cache line can induce the exchange of a processor cache line that is used by the processor. It is a main factor of performance degradation in a whole system. To alleviate this disadvantage of the LRU replacement polity, the new policy analyses tht processor's remote memory access pattern of each node and uses this information to reduce the number of invalidations of the useful cache line in the higher-level cache. The new replacement policy of the remote cache can improve the performance by $3.5\%$ in maximum and $2.5\%$ in average on SPLASH-2 benchmarks, compared to the general LRU cache replacement policy.

Development of Machine Learning based Flood Depth and Location Prediction Model (머신러닝을 이용한 침수 깊이와 위치예측 모델 개발)

  • Ji-Wook Kang;Jong-Hyeok Park;Soo-Hee Han;Kyung-Jun Kim
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.18 no.1
    • /
    • pp.91-98
    • /
    • 2023
  • With the increasing flood damage by frequently localized heavy rains, flood prediction research are being conducted to prevent flooding damage in advance. In this paper, we present a machine-learning scheme for developing a flooding depth and location prediction model using real-time rainfall data. This scheme proposes a dataset configuration method using the data as input, which can robustly configure various rainfall distribution patterns and train the model with less memory. These data are composed of two: valid total data and valid local. The one data that has a significant effect on flooding predicted the flooding location well but tended to have different values for predicting specific rainfall patterns. The other data that means the flood area partially affects flooding refers to valid local data. The valid local data was well learned for the fixed point method, but the flooding location was not accurately indicated for the arbitrary point method. Through this study, it is expected that a lot of damage can be prevented by predicting the depth and location of flooding in a real-time manner.

NORMOBARIC OXYGEN($O_2$) ADMINISTRATION EFFECT ON ATTENTION AND MEMORY FUNCTION IN TEENAGE ADOLESCENTS (10대 청소년의 주의력과 기억능력에 미치는 정상기압 산소흡입 효과)

  • Kim, Byung-Hyo;Kim, Young-Mi;Cho, Soo-Churl;Kim, Boong-Nyun
    • Journal of the Korean Academy of Child and Adolescent Psychiatry
    • /
    • v.13 no.1
    • /
    • pp.76-84
    • /
    • 2002
  • Objectives:This study was conducted to investigate the effect of oxygen on attention and memory functions in healthy adolescents. Methods:The participant subjects were recruited from local advertisement. All subjects are students attending ordinary middle and high school. Their degree of achievement was average or below average. Before the study, its nature and purpose were fully explained to the patients and their parents, and a written informed consent was obtained from each child's parent and a written assent from each child for entire the procedure. The Ethics Committee and Clinical Research Committee of Gyeongsang National University Hospital approved the protocol. For baseline assessment, all subjects received tests for attention and memory. All tests were conducted by a certified psychologist. Stroop test, continuous performance test and trail making test A and B were used for evaluation of attention. As memory tests, we used memory assessment scale(MAS), standardized memory assessment tools. Ten to fourteen days after initial assessments, same tests was applied to the same subjects after prior 5 minute oxygen inhalation. Results:1) Attention test:Improved performances in trail making part B, and stroop test were found in normobaric oxygen inhalation group compared to air inhalation group. Improved reaction time in those tests seemed to reflect the enhanced executive prefrontal activity. 2) Memory test:More words and digits memorization were found in short-term memory subscale score in MAS in oxygen inhalation group compared to air inhalation group. This finding suggested the improved working memory function after oxygen inhalation. Conclusion:Though interpreted cautiously, these results suggested that normobaric oxygen inhalation could enhance executive function and working memory of prefrontal lobe. Further study, however, should be performed to investigate the mechanism of effects of oxygen on cognitive enhancement.

  • PDF

A Sclable Parallel Labeling Algorithm on Mesh Connected SIMD Computers (메쉬 구조형 SIMD 컴퓨터 상에서 신축적인 병렬 레이블링 알고리즘)

  • 박은진;이갑섭성효경최흥문
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.731-734
    • /
    • 1998
  • A scalable parallel algorithm is proposed for efficient image component labeling with local operatos on a mesh connected SIMD computer. In contrast to the conventional parallel labeling algorithms, where a single pixel is assigned to each PE, the algorithm presented here is scalable and can assign m$\times$m pixel set to each PE according to the input image size. The assigned pixel set is converted to a single pixel that has representative value, and the amount of the required memory and processing time can be highly reduced. For N$\times$N image, if m$\times$m pixel set is assigned to each PE of P$\times$P mesh, where P=N/m, the time complexity due to the communication of each PE and the computation complexity are reduced to O(PlogP) bit operations and O(P) bit operations, respectively, which is 1/m of each of the conventional method. This method also diminishes the amount of memory in each PE to O(P), and can decrease the number of PE to O(P2) =Θ(N2/m2) as compared to O(N2) of conventional method. Because the proposed parallel labeling algorithm is scalable, we can adapt to the increase of image size without the hardware change of the given mesh connected SIMD computer.

  • PDF

Polyadenylation-Dependent Translational Control of New Protein Synthesis at Activated Synapse

  • Shin Chan-Young;Yang Sung-Il;Kim Kyun-Hwan;Ko Kwang-Ho
    • Biomolecules & Therapeutics
    • /
    • v.14 no.2
    • /
    • pp.75-82
    • /
    • 2006
  • Synaptic plasticity, which is a long lasting change in synaptic efficacy, underlies many neural processes like learning and memory. It has long been acknowledged that new protein synthesis is essential for both the expression of synaptic plasticity and memory formation and storage. Most of the research interests in this field have focused on the events regulating transcriptional activation of gene expression from the cell body and nucleus. Considering extremely differentiated structural feature of a neuron in CNS, a neuron should meet a formidable task to overcome spatial and temporal restraints to deliver newly synthesized proteins to specific activated synapses among thousands of others, which are sometimes several millimeters away from the cell body. Recent advances in synaptic neurobiology has found that almost all the machinery required for the new protein translation are localized inside or at least in the vicinity of postsynaptic compartments. These findings led to the hypothesis that dormant mRNAs are translationally activated locally at the activated synapse, which may enable rapid and delicate control of new protein synthesis at activated synapses. In this review, we will describe the mechanism of local translational control at activated synapses focusing on the role of cytoplasmic polyadenylation of dormant mRNAs.

Enhanced VLAD

  • Wei, Benchang;Guan, Tao;Luo, Yawei;Duan, Liya;Yu, Junqing
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.7
    • /
    • pp.3272-3285
    • /
    • 2016
  • Recently, Vector of Locally Aggregated Descriptors (VLAD) has been proposed to index image by compact representations, which encodes powerful local descriptors and makes significant improvement on search performance with less memory compared against the state of art. However, its performance relies heavily on the size of the codebook which is used to generate VLAD representation. It indicates better accuracy needs higher dimensional representation. Thus, more memory overhead is needed. In this paper, we enhance VLAD image representation by using two level hierarchical-codebooks. It can provide more accurate search performance while keeping the VLAD size unchanged. In addition, hierarchical-codebooks are used to construct multiple inverted files for more accurate non-exhaustive search. Experimental results show that our method can make significant improvement on both VLAD image representation and non-exhaustive search.

A SMA-based morphing flap: conceptual and advanced design

  • Ameduri, Salvatore;Concilio, Antonio;Pecora, Rosario
    • Smart Structures and Systems
    • /
    • v.16 no.3
    • /
    • pp.555-577
    • /
    • 2015
  • In the work at hand, the development of a morphing flap, actuated through shape memory alloy load bearing elements, is described. Moving from aerodynamic specifications, prescribing the morphed shape enhancing the aerodynamic efficiency of the flap, a suitable actuation architecture was identified, able to affect the curvature. Each rib of the flap was split into three elastic elements, namely "cells", connected each others in serial way and providing the bending stiffness to the structure. The edges of each cell are linked to SMA elements, whose contraction induces rotation onto the cell itself with an increase of the local curvature of the flap airfoil. The cells are made of two metallic plates crossing each others to form a characteristic "X" configuration; a good flexibility and an acceptable stress concentration level was obtained non connecting the plates onto the crossing zone. After identifying the main design parameters of the structure (i.e. plates relative angle, thickness and depth, SMA length, cross section and connections to the cell) an optimization was performed, with the scope of enhancing the achievable rotation of the cell, its ability in absorbing the external aerodynamic loads and, at the same time, containing the stress level and the weight. The conceptual scheme of the architecture was then reinterpreted in view of a practical realization of the prototype. Implementation issues (SMA - cells connection and cells relative rotation to compensate the impressed inflection assuring the SMA pre-load) were considered. Through a detailed FE model the prototype morphing performance were investigated in presence of the most severe load conditions.

Performance Analysis of a Multiprocessor System Using Simulator Based on Parsec (Parsec 기반 시뮬레이터를 이용한 다중처리시스템의 성능 분석)

  • Lee Won-Joo;Kim Sun-Wook;Kim Hyeong-Rae
    • Journal of the Korea Society of Computer and Information
    • /
    • v.11 no.2 s.40
    • /
    • pp.35-42
    • /
    • 2006
  • In this paper we implement a new simulator for performance analysis of a parallel digital signal processing distributed shared memory multiprocessor systems. using Parsec The key idea of this simulator is suitable in simulation of system that uses DMA function of TMS320C6701 DSP chip and local memory which have fast access time. Also, because correction of performance parameter and reconfiguration for hardware components are easy, we can analyze performance of system in various execution environments. In the simulation, FET, 2D FET, Matrix Multiplication. and Fir Filter, which are widely used DSP algorithms. have been employed. Using our simulator, the result has been recorded according to different the number of processor, data sizes, and a change of hardware element. The performance of our simulator has been verified by comparing those recorded results.

  • PDF

Implementation of GPU based MPEG-2 Decoder (GPU 기반의 MPEG-2 디코더의 구현)

  • Kim, Kyung-Su;Kim, Hong-Sik;Kim, Cheong-Ghil;Park, Woo-Chan
    • Journal of Digital Contents Society
    • /
    • v.9 no.3
    • /
    • pp.371-377
    • /
    • 2008
  • Recently the performance of GPU is increasing much faster compared to GPU and GPU is used for various application programs. In this paper, MPEG-2 Decoder is implemented based on a GPU programming language, CG. The proposed methodology is to perform block rendering with texture data according to video standard with very high parallelism by using the pipeline of GPU which is a stream processing structure. To reduce the data bandwidth between system memory and GPU, local memory is used for graphic card. According to the experiment, the proposed scheme shows performance improvement by more than 2 times compared to CPU based scheme.

  • PDF

Performance Analysis of A Distributed Shared Memory Multiprocessor System Using PASEC (PARSEC을 이용한 분산공유메모리 다중프로세서 시스템의 성능분석)

  • Park, Joon-Seok;Jeon, Chang-Ho
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.10
    • /
    • pp.3049-3054
    • /
    • 2000
  • In this paper, the effects of the hardware components and runtime environments on the overall performance of a distributed shared memory system are analyzed through simulation. In simulation, the system is modeled using PARSE[1.2] closely to the real runtime environment and the 2D FFT is virtually executed on it. The results of simulation show that the minor hardware components such as bus interfaces and local bus of a processor, which are usuallyignored or neglected when analyzing performance. have significant impacts on the overall system performance. Performance variations caused from runtime environments such as loop overhead and code optimuzatio are also analyzed quantitatively.

  • PDF