통합 검색 | Korea Science

다중스레드 모델의 성능 향상을 위한 가용 레지스터 기반 캐슁 기법 (A Register-Based Caching Technique for the Advanced Performance of Multithreaded Models)

고훈준;권영필;유원희
- 정보처리학회논문지A
- /
- 제8A권2호
- /
- pp.107-116
- /
- 2001
A multithreaded model is a hybrid one which combines locality of execution of the von Neumann model with asynchronous data availability and implicit parallelism of the dataflow model. Much researches that have been made toward the advanced performance of multithreaded models are about the cache memory which have been proved to be efficient in the von Neumann model. To use an instruction cache or operand cache, the multithreaded models must have cache memories. If cache memories are added to the multithreaded model, they may have the disadvantage of high implementation cost in the mode. To solve these problems, we did not add cache memory but applied the method of executing the caching by using available registers of the multithreaded models. The available register-based caching method is one that use the registers which are not used on the execution of threads. It may accomplish the same effect as the cache memory. The multithreaded models can compute the number of available registers to be used during the process of the register optimization, and therefore this method can be easily applied on the models. By applying this method, we can also remove the access conflict and the bottleneck of frame memories. When we applied the proposed available register-based caching method, we found that there was an improved performance of the multithreaded model. Also, when the available-register-based caching method is compared with the cache based caching method, we found that there was the almost same execution overhead.
PDF

Evaluation Of The Content-Based Packet Scheduling Policies On The Multithreaded Multiprocessor Network System

Yim Kangbin
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2004년도 ICEIC The International Conference on Electronics Informations and Communications
- /
- pp.39-41
- /
- 2004
In this paper, I propose a thread scheduling policy for faster packet processing on the network processors with multithreaded multiprocessor architecture. To implement the proposed policy, I derived several basic parameters related to the thread scheduling and included a new parameter representing the packet contents and the features of the multithreaded architecture. Through the empirical study using a network processor, I proved the proposed scheduling ploicy provides better throughput and load balancing compared to the generally used thread scheduling policy.
PDF

A Process Algebra-Based Detection Model for Multithreaded Programs in Communication System

Wang, Tao;Shen, Limin;Ma, Chuan
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- 제8권3호
- /
- pp.965-983
- /
- 2014
Concurrent behaviors of multithreaded programs cannot be described effectively by automata-based models. Thus, concurrent program intrusion attempts cannot be detected. To address this problem, we proposed the process algebra-based detection model for multithreaded programs (PADMP). We generate process expressions by static binary code analysis. We then add concurrency operators to process expressions and propose a model construction algorithm based on process algebra. We also present a definition of process equivalence and behavior detection rules. Experiments demonstrate that the proposed method can accurately detect errors in multithreaded programs and has linear space-time complexity. The proposed method provides effective support for concurrent behavior modeling and detection.
https://doi.org/10.3837/tiis.2014.03.014 인용 PDF KSCI KPUBS HTML

Robust Multithreaded Object Tracker through Occlusions for Spatial Augmented Reality

Lee, Ahyun;Jang, Insung
- ETRI Journal
- /
- 제40권2호
- /
- pp.246-256
- /
- 2018
A spatial augmented reality (SAR) system enables a virtual image to be projected onto the surface of a real-world object and the user to intuitively control the image using a tangible interface. However, occlusions frequently occur, such as a sudden change in the lighting environment or the generation of obstacles. We propose a robust object tracker based on a multithreaded system, which can track an object robustly through occlusions. Our multithreaded tracker is divided into two threads: the detection thread detects distinctive features in a frame-to-frame manner, and the tracking thread tracks features periodically using an optical-flow-based tracking method. Consequently, although the speed of the detection thread is considerably slow, we achieve real-time performance owing to the multithreaded configuration. Moreover, the proposed outlier filtering automatically updates a random sample consensus distance threshold for eliminating outliers according to environmental changes. Experimental results show that our approach tracks an object robustly in real-time in an SAR environment where there are frequent occlusions occurring from augmented projection images.
https://doi.org/10.4218/etrij.2017-0047 인용 PDF KSCI

네트워크 프로세서를 위한 다중 쓰레드 스케줄링 (Multi-thread Scheduling for the Network Processor)

임강빈;박준구;정기현;최경희
- 정보처리학회논문지C
- /
- 제11C권3호
- /
- pp.337-344
- /
- 2004
본 논문은 다중 프로세서(Multiprocessor) 기반 다중 쓰레드(Multithreaded) 구조의 네트워크 프로세서를 이용한 패킷 치리 시스템에서 패킷을 보다 고속으로 처리하기 위한 쓰레드 스케줄링 기법을 제안한다 이를 위하여 스케줄링과 관련한 인자를 실험을 통하여 얻고, 패킷 내용 및 다중 쓰레드 아키텍쳐를 표현하는 인자를 포함하도록 설계하였다. 시뮬레이터를 이용한 실험을 통하여 제안된 스케줄링 기법이 제공하는 처리율 및 부하 분산 정도가 다른 스케줄링 기법과 비교하여 효율적임을 증명하였다.
https://doi.org/10.3745/KIPSTC.2004.11C.3.337 인용 PDF KSCI

다중스레드 구조에서 함수 언어 루프의 효과적 실행 (The Efficient Execution of Functional Language Loops on the Multithreaded Architectures)

하상호
- 한국정보처리학회논문지
- /
- 제7권3호
- /
- pp.962-970
- /
- 2000
Multithreading is attractive in that it can tolerate memory latency and synchronization by effectively overlapping communication with computation. While several compiler techniques have been developed to produce multithreaded codes from functional languages programs, there still remains a lot of works to implement loops effectively. Executing lops in a style of multithreading usually causes some overheads, which can reduce severely the effect of multirheading. This paper suggests several methods in terms of architectures or compilers which can optimize loop execution by multithreading. We then simulate and analyze them for the matrix multiplication program.
PDF

A Multithreaded Implementation of HEVC Intra Prediction Algorithm for a Photovoltaic Monitoring System

Choi, Yung-Ho;Ahn, Hyung-Keun
- Transactions on Electrical and Electronic Materials
- /
- 제13권5호
- /
- pp.256-261
- /
- 2012
Recently, many photovoltaic systems (PV systems) including solar parks and PV farms have been built to prepare for the post fossil fuel era. To investigate the degradation process of the PV systems and thus, efficiently operate PV systems, there is a need to visually monitor PV systems in the range of infrared ray through the Internet. For efficient visual monitoring, this paper explores a multithreaded implementation of a recently developed HEVC standard whose compression efficiency is almost two times higher than H.264. For an efficient parallel implementation under a meshbased 64 multicore system, this work takes into account various design choices which can solve potential problems of a two-dimensional interconnects-based 64 multicore system. These problems may have not occurred in a small-scale multicore system based on a simple bus network. Through extensive evaluation, this paper shows that, for an efficient multithreaded implementation of HEVC intra prediction in a mesh-based multicore system, much effort needs to be made to optimize communications among processing cores. Thus, this work provides three design choices regarding communications, i.e., main thread core location, cache home policy, and maximum coding unit size. These design choices are shown to improve the overall parallel performance of the HEVC intra prediction algorithm by up to 42%, achieving a 7 times higher speed-up.
https://doi.org/10.4313/TEEM.2012.13.5.256 인용 PDF KSCI

Java 언어를 위한 쓰레드 모니터링 시스템 (A Thread Monitoring System for Java)

문세원;창병모
- 정보처리학회논문지A
- /
- 제13A권3호
- /
- pp.205-210
- /
- 2006
본 연구에서는 견고한 멀티 쓰레드 소프트웨어 개발을 돕기 위한 쓰레드 모니터링 시스템을 개발하였다. 이 시스템은 쓰레드 실행 과정과 동기화 과정를 시각적으로 추적, 모니터링 할 수 있다. 또한 사용자는 실행 전에 옵션 선택을 통해 관심 있는 쓰레드와 동기화만을 선택하여 이들을 중점적으로 모니터링 할 수 있으며 실행 후에는 실행된 쓰레드와 동기화의 특징을 요약한 프로파일 정보를 제공한다. 본 연구에서는 코드 인라인 기법을 기반으로 이 시스템을 구현하였으며 실험을 통한 실험 결과를 통해 그 효융성을 보인다.
https://doi.org/10.3745/KIPSTA.2006.13A.3.205 인용 PDF KSCI

Design and Implementation of a Massively Parallel Multithreaded Architecture: DAVRID

Sangho Ha;Kim, Junghwan;Park, Eunha;Yoonhee Hah;Sangyong Han;Daejoon Hwang;Kim, Heunghwan;Seungho Cho
- Journal of Electrical Engineering and information Science
- /
- 제1권2호
- /
- pp.15-26
- /
- 1996
MPAs(Massively Parallel Architectures) should address two fundamental issues for scalability: synchronization and communication latency. Dataflow architecture faces problems of excessive synchronization overhead and inefficient execution of sequential programs while they offer the ability to exploit massive parallelism inherent in programs. In contrast, MPAs based on von Neumann computational model may suffer from inefficient synchronization mechanism and communication latency. DAVRID (DAtaflow/Von Neumann RISC hybrID) is a massively parallel multithreaded architecture which takes advantages of von Neumann and dataflow models. It has good single thread performance as well as tolerates synchronization and communication latency. In this paper, we describe the DAVRID architecture in detail and evaluate its performance through simulation runs over several benchmarks.
PDF

벡타 연산을 효율적으로 수행하기 위한 다중 스레드 구조 (A Multithreaded Architecture for the Efficient Execution of Vector Computations)

윤성대;정기동
- 한국정보처리학회논문지
- /
- 제2권6호
- /
- pp.974-984
- /
- 1995
본 논문에서는 벡타연산을 효율적으로 수행하고 대단위 병렬시스템을 지원하는 다중 스레드구조, MULVEC(MULtithreaded architecture of the VEctor Computations) 을 제시한다. MULVEC은 데이타플로우 모델에 수퍼 스칼라 RISC 마이크로 프로세서를 갖는 기존의 폰 노이만 모델을 도입하였다. 그리고 동일한 스레드 세그멘트내에 벡타 연산이 반복되는 경우에 상태필드를 이용하여 동기화의 수를 감축시켰으며, 이에 의해 문맥전환 횟수, 통신량 등을 감소시켰다. 그리고 노드 수의 변화에 대한 MULVEC의 성능평가(프로그램들의 수행시간, 프로세서들의 이용율)와 *T의 성능평가(프로그램의 수행시간)를 SPARC station 20 (super scalar RISC microprocessor)에서 시뮬레이션을 하였으며, 노드의 수, 루프의 반복홋수 등에 따라 프로그램의 수행시간이 MULVEC이 *T보 다 약 1-2배 정도 빠르다는 것을 알 수 있었다.
PDF

검색결과 42건 처리시간 0.019초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)