Search | Korea Science

Development of high-speed image interpolation method using CUDA (GPU를 이용한 고속 영상 보간법 개발)

Cui, Xue-Nan;Park, Eun-Soo;Kim, Jun-Chui;Jung, Young-Han;Kim, Hak-Il
- Proceedings of the KIEE Conference
- /
- 2008.10b
- /
- pp.300-301
- /
- 2008
본 논문에서는 GPU를 이용한 고속 보간법 개발방법에 대해 제안한다. GPU는 흔히 그래픽 연산에 사용되지만 최근에는 GPGPH가 각광을 받고 있다. 특히 NVIDIA에서 발표한 CUDA를 이용하면 GPU를 쉽게 접근하여 프로세싱 할 수 있어 많은 분야에서 GPU를 활용하고 있다. 본 논문에서는 실제 CUDA를 이용하여 여러 가지 보간법에 대한 알고리즘을 구현하여 CUDA의 성능을 확인하였다. CPU에서 구현한 알고리즘과 CUDA를 이용한 알고리즘을 비교했을 때 메모리 할당 및 전송부분을 제외한 수순 프로세싱 시간을 보면 CPU에서 훨씬 좋은 성능을 나타내었고, 메모리 할당 및 전송을 고려했을 때 작은 사이즈 영상에서는 오히려 역효과가 나타났고, 대용량 영상에서는 좋은 성능을 나타냄을 확인하였다.
PDF

A Block Allocation Scheme for Rapid Mount in YAFFS (신속한 Mount 를 위한 YAFFS 에서의 블록 할당 방식)

Kim, Seok-Hyun;Lee, Jae-Heung;Oh, Jin-Ha;Min, Hong;Gu, Bon-Cheol;Yi, Sang-Ho;Cho, Yoo-Kun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2007.11a
- /
- pp.753-754
- /
- 2007
YAFFS 는 리눅스에서 사용되는 NAND 플래시 전용 파일 시스템이다. YAFFS 는 파일 시스템을 mount 할 때 전체 플래시를 scan 하여 메모리 상에 파일 시스템의 디렉토리 트리 구조를 만든다. 이 작업은 전체 메모리를 scan 하기 때문에 이 작업은 많은 시간을 필요로 한다. 이 논문에서는 YAFFS 의 블록 할당 방식을 개선하여 mount 시간을 크게 줄일 수 있는 방식을 제안한다.
https://doi.org/10.3745/PKIPS.y2007m11a.753 인용 PDF

A Parallel Task Oriented Memory Manager for Dynamic Objects (동적 객체에 대한 병렬 타스크 중심의 메모리 관리기)

Kim, Eun-Jeong;Bae, Jong-Min
- The Transactions of the Korea Information Processing Society
- /
- v.4 no.5
- /
- pp.1391-1400
- /
- 1997
공유 메모리 다중 프로세서상에서 많은 동적 객체를 생성하는 언어가 실행될 때, 동적 객체에 대한 메모리 관리 알고리즘은 프로그램의 실행 속도에 큰 영향을 미친다. 본 논문에서는 이러한 환경에서 프로그램의 성능을 향상 시킬 수 있는 새로운 메모리 관리 알고리즘을 제안한다. 이를 위해 힘 영역의 할당 및 회수 작업을 병렬 타스크 중심으로 행한다. 또한 동적 객체를 병렬 타스크사이에 공유 되는 객체(shared data) 와 비공유 객체(mon-shared data)로 구분하고, 힘 영역을 공동 영역과 전용 영역으로 분리 한다. 이는 병렬 타스크가 동적으로 스케줄링되는 것을 자유롭게 하고 창조 지역성 을 높이는 효과가 있으며, 전용 영역에 대한 메모리 재사용으로 인하여 볼용 셀수집기의 수행 횟수를 줄일 수 있다.
PDF

The Design of Hybrid Memory Memcached based on SSD (SSD에 기반한 하이브리드 메모리 멤캐시드 설계)

Yoo, Sang-Hyun;Youn, Hee-Yong
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2014.07a
- /
- pp.333-335
- /
- 2014
페이스북, 트위터와 같은 클라우드 및 웹 서비스 제공회사와 클라우드 및 웹 서비스 제공자는 수많은 사용자들에 의해 발생되는 무수한 데이터를 빠르게 처리하기 위해 하드 디스크보다는 램에 저장 할 필요가 있다. 그러한 좋은 도구로서 분산메모리 객체 캐싱 소프트웨어인 멤캐시드가 있다. 멤캐시드의 성능은 저장공간의 크기에 따라 많은 차이를 보이는데, 하드웨어의 비용, 전력소비와 온도조절 등 공간의 제약을 감안했을 때, 무작정 개별 서버에 많은 RAM을 장착하거나, 서버 배열을 확장하는 것은 효율적인 방법이 아니다. 따라서 많은 양의 데이터가 메모리에 저장이 가능하도록 RAM과 SSD를 같이 확장한 SSD 기반 하이브리드 메모리를 제안한다. 하이브리드 메모리는 객체 캐시로 동작하고 페이지 단위로 할당하는 것보다 객체 단위로 자원할당을 함으로서 SSD에서 빠른 무작위 읽기를 할 수 있게 해 객체의 접근속도를 향상시켰다.
PDF

Flash Memory Management Policy To Improve Wear-Leveling using Hot-and-Cold Flag (Hot-and-Cold 플래그를 이용한 플래시 메모리 균등 사용을 위한 관리 정책)

Jung, Phil-Young;Lee, Sung-Joon;Ahn, Kwang-Seon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2003.05a
- /
- pp.479-482
- /
- 2003
비휘발성 메모리인 플래시 메모리는 가볍고, 전력 소비가 적으며, 충격에 대한 저항이 강한 특징이 있다. 그러나 플래시 메모리는 세그먼트당 삭제 횟수가 최악의 경우 10만 번 정도로 제한된다는 단점이 있어서, 만약 삭제가 특정 세그먼트에 집중된다면 전체 수명이 단축되게 된다. 하지만, 플래시 메모리의 모든 세그먼트가 골고루 사용된다면 수명을 연장시킬 수 있다. 이를 위해 저장되는 자료의 유형을 파악하여 Hot, Cold 그리고 Lukewarm 이라는 그룹별로 분리해서 관리한다. 단순한 분리는 사용 횟수의 양극화를 가져오지만, 양극화된 세그먼트에 반대 유형의 자료를 할당하고 관리하면 전체 세그먼트의 사용률을 균등하게 할 수 있다. 이를 위해, 쓰기와 클리닝 작업 외에 양극화의 패턴을 통해 유형을 결정해주는 루틴을 포함하였다.
PDF

CSR Sparse Matrix Vector Multiplication Using Zero Copy (Zero Copy를 이용한 CSR 희소행렬 연산)

Yoon, SangHyeuk;Jeon, Dayun;Park, Neungsoo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2021.05a
- /
- pp.45-47
- /
- 2021
APU(Accelerated Processing Unit)는 CPU와 GPU가 통합되어있는 프로세서이며 같은 메모리 공간을 사용한다. CPU와 GPU가 분리되어있는 기존 이종 컴퓨팅 환경에서는 GPU가 작업을 처리하기 위해 CPU에서 GPU로 메모리 복사가 이루어졌지만, APU는 같은 메모리 공간을 사용하므로 메모리 복사 없이 가상주소 할당으로 같은 물리 주소에 접근할 수 있으며 이를 Zero Copy라 한다. Zero Copy 성능을 테스트하기 위해 희소행렬 연산을 사용하였으며 기존 메모리 복사대비 크기가 큰 데이터는 약 4.67배, 크기가 작은 데이터는 약 6.27배 빨랐다.
https://doi.org/10.3745/PKIPS.y2021m05a.45 인용 PDF

Design of an Automated Testing Tool to Detect Dynamic Memory Access Errors in C Programs (C언어 기반 프로그램의 동적 메모리 접근 오류 테스트 자동화 도구 설계)

Cho, Dae-Wan;Oh, Seung-Uk;Kim, Hyeon-Soo
- Journal of KIISE:Software and Applications
- /
- v.34 no.8
- /
- pp.708-720
- /
- 2007
Memory access errors are frequently occurred in computer programs written in C programming language [1,2]. Accordingly, a number of research works have suggested a wide variety of methods to detect such errors automatically. However, they have one or more of the following problems: inability to detect all memory errors, changing the memory allocation mechanism, and excessive performance overhead. To cope with these problems, in this paper we suggest a new and automated tool to detect dynamic memory access errors in C programs.
PDF KSCI

BIM Geometry Cache Structure for Data Streaming with Large Volume (대용량 BIM 형상 데이터 스트리밍을 위한 캐쉬 구조)

Kang, Tae-Wook
- Journal of the Korea Academia-Industrial cooperation Society
- /
- v.18 no.9
- /
- pp.1-8
- /
- 2017
The purpose of this study is to propose a cache structure for processing large-volume building information modeling (BIM) geometry data,whereit is difficult to allocate physical memory. As the number of BIM orders has increased in the public sector, it is becoming more common to visualize and calculate large-volume BIM geometry data. Design and review collaboration can require a lot of time to download large-volume BIM data through the network. If the BIM data exceeds the physical free-memory limit, visualization and geometry computation cannot be possible. In order to utilize large amounts of BIM data on insufficient physical memory or a low-bandwidth network, it is advantageous to cache only the data necessary for BIM geometry rendering and calculation time. Thisstudy proposes acache structure for efficiently rendering and calculating large-volume BIM geometry data where it is difficult to allocate enough physical memory.
https://doi.org/10.5762/KAIS.2017.18.9.1 인용 PDF KSCI

Compact Field Remapping for Dynamically Allocated Structures (동적으로 할당된 구조체를 위한 압축된 필드 재배치)

Kim, Jeong-Eun;Han, Hwan-Soo
- Journal of KIISE:Software and Applications
- /
- v.32 no.10
- /
- pp.1003-1012
- /
- 2005
The most significant difference of embedded systems from general purpose systems is that embedded systems are allowed to use only limited resources including battery and memory. Especially, the number of applications increases which deal with multimedia data. In those systems with high data computations, the delay of memory access is one of the major bottlenecks hurting the system performance. As a result, many researchers have investigated various techniques to reduce the memory access cost. Most programs generally have locality in memory references. Temporal locality of references means that a resource accessed at one point will be used again in the near future. Spatial locality of references is that likelihood of using a resource gets higher if resources near it were just accessed. The latest embedded processors usually adapt cache memory to exploit these two types of localities. Processors access faster cache memory than off-chip memory, reducing the latency. In this paper we will propose the enhanced dynamic allocation technique for structure-type data in order to eliminate unused memory space and to reduce both the cache miss rate and the application execution time. The proposed approach aggregates fields from multiple records dynamically allocated and consecutively remaps them on the memory space. Experiments on Olden benchmarks show $13.9\%$ L1 cache miss rate drop and $15.9\%$ L2 cache miss drop on average, compared to the previously proposed techniques. We also find execution time reduced by $10.9\%$ on average, compared to the previous work.
PDF KSCI

Memory Management based Hybrid Transactional Memory Scheme for Efficiently Processing Transactions in Multi-core Environment (멀티코어 환경에서 효율적인 트랜잭션 처리를 위한 메모리 관리 기반 하이브리드 트랜잭셔널 메모리 기법)

Jang, Yeon-Woo;Kang, Moon-Hwan;Chang, Jae-Woo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.04a
- /
- pp.795-798
- /
- 2017
최근 멀티코어 프로세서가 개발됨에 따라 병렬 프로그래밍은 멀티코어를 효과적으로 활용하기 위한 기법으로 그 중요성이 높아지고 있다. 트랜잭셔널 메모리는 처리 방식에 따라 HTM, STM, HyTM으로 구분되며, 최근 HTM 및 STM 결합한 HyTM 이 활발히 연구되고 있다. 그러나 기존의 HyTM 는 HTM과 STM의 동시성 제어를 위해 블룸필터를 사용하는 반면, 블룸필터의 자체적인 긍정 오류를 해결하지 못한다. 아울러, 트랜잭션 처리를 위한 메모리 할당/해제를 기존의 락 메커니즘을 사용하여 관리한다. 따라서 멀티코어 환경에서 스레드 수가 증가할수록 트랜잭션 처리 효율이 떨어진다. 본 논문에서는 멀티코어 환경에서 효율적인 트랜잭션 처리를 위한 메모리 관리 기반 하이브리드 트랜잭셔널 메모리 기법을 제안한다. 제안하는 기법은 트랜잭션 처리에 최적화된 블룸필터를 제공함으로써, 병렬적으로 동시에 수행되는 서로 다른 환경의 트랜잭션에 대해 일관성 있는 처리를 지원한다. 아울러, CPU 캐시라인에 최적화된 메모리 기법을 통해, 메모리 할당량이 적은 트랜잭션은 로컬 캐시에 할당함으로써 트랜잭션의 빠른 처리를 지원한다.
https://doi.org/10.3745/PKIPS.y2017m04a.795 인용 PDF

Search Result 260, Processing Time 0.047 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)