Search | Korea Science

Fast Computation of DWT and JPEG2000 using GPU (GPU를 이용한 DWT 및 JPEG2000의 고속 연산)

Lee, Man-Hee;Park, In-Kyu;Won, Seok-Jin;Cho, Sung-Dae
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.44 no.6
- /
- pp.9-15
- /
- 2007
In this paper, we propose an efficient method for Processing DWT (Discrete Wavelet Transform) on GPU (Graphics Processing Unit). Since the DWT and EBCOT (embedded block coding with optimized truncation) are the most complicated submodules in JPEG2000, we design a high-performance processing framework for performing DWT using the fragment shader of GPU based on the render-to-texture (RTT) architecture. Experimental results show that the performance increases significantly, in which DWT running on modern GPU is more than 10 times faster than on modern CPU. Furthermore, by replacing the DWT part of Jasper which is the JPEG2000 reference software, the overall processing is 2$\sim$16 times faster than the original JasPer. The GPU-driven render-to-texture architecture proposed in this paper can be used in the general image and computer vision processing for high-speed processing.
PDF KSCI

A 3D Memory System Allowing Multi-Access (다중접근을 허용하는 3차원 메모리 시스템)

이형
- Journal of KIISE:Computer Systems and Theory
- /
- v.32 no.9
- /
- pp.457-464
- /
- 2005
In this paper a 3D memory system that allows 17 access types at an arbitrary position is introduced. The proposed memory system is based on two main functions: memory module assignment function and address assignment function. Based on them, the memory system supports 17 access types: 13 Lines, 3 Rectangles, and 1 Hexahedron. That is, the memory system allows simultaneous access to multiple data in any access types at an arbitrary position with a constant interval. In order to allow 17 access types the memory system consists of memory module selection circuitry, data routing circuitry for READ/WRITE, and address calculation/routing circuitry In the point of view of a developer and a programmer, the memory system proposed in this paper supports easy hardware extension according to the applications and both of them to deal with it as a logical three-dimensional away In addition, multiple data in various across types can be simultaneously accessed with a constant interval. Therefore, the memory system is suitable for building systems related to ,3D applications (e.g. volume rendering and volume clipping) and a frame buffer for multi-resolution.
PDF KSCI

Adaptive Service Mode Conversion to Minimize Buffer Space Requirement in VOD Server (주문형 비디오 서버의 버퍼 최소화를 위한 가변적 서비스 모드 변환)

Won, Yu-Jip
- Journal of KIISE:Computer Systems and Theory
- /
- v.28 no.5
- /
- pp.213-217
- /
- 2001
Excessive memory buffer requirement in continuous media playback is a serious impediment of wide spread usage of on-line multimedia service. Skewed access frequency of available video files provides an opportunity of re-using the date blocks which has been loaded by one session for later usage. We present novel algorithm which minimizes the buffer requirement in multiple sessions of multimedia playbacks. In continuous media playback originated from the disk, a certain amount of memory buffer is required to synchronize asynchronous disk. Read operation and synchronous playback operation. As aggregate playback bandwodth increases, larger amount of buffer needs to be allocated for this synchronization purpose. The focus of this work is to study the asymptotic behavior of the synchronization buffer requirement and to develop an algorithm coping with this excessive buffer requirement under bandwidth congestioon. We argue that in a large scale continuous media server, it may not be necessary to read the blocks for each session directly from the disk. The beauty of our work lies in the fact that it dynamically adapts to disk utilization of the server and finds the optimal way of servicinh the individual sessions while minimizing the overall buffer space requirement. Optimality of the proposed algorithm is shown by proof. The effectiveness and performance of the proposed scheme is examined via simulation.
PDF

Asymmetric Motion Vector-Based Side Information Generation for Efficient Distributed Video Coding (효과적인 분산 비디오 부호화를 위한 비대칭성 움직임 벡터 기반 보조정보 생성 방법)

Na, Taeyoung;Kim, Munchurl
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2010.11a
- /
- pp.129-131
- /
- 2010
분산 비디오 부호화(distributed video coding)는 분산 소스 부호화의 대표적인 응용분야로서 부호화 복잡도가 부호화기에서 복호화기로 이동되어 저전력 부호화 환경에 매우 적합하다. 본 논문에서는 분산 비디오 부호화의 성능 향상에 있어 가장 중요한 보조 정보의 효과적인 생성 방법을 제안한다. 우선 보조 정보 생성을 위한 키 프레임들 간의 블록 움직임 추정에 있어 기존 방법들이 대체적으로 가정하고 있는 선형적인 움직임 이동에 따른 잘못된 예측을 해결하기 위해 두 장 이상의 키 프레임을 사용하여 블록 움직임을 추정한 후, 선형 회귀(linear regression)를 이용하여 보조 정보 상의 블록 움직임 궤적을 추정한다. 이때 움직임 추정을 위한 키 프레임 번호를 증가하며 선입선출(FIFO)형 버퍼에 저장 및 삭제하여 동일한 보조정보에 해당하는 여러 움직임 벡터 필드와, 기존의 선형적인 움직임이 가정된 움직임 벡터 필드를 동시에 생성한다. 다음으로 보간(interpolation)하려는 보조 정보 프레임 내의 임의의 블록에 가장 가깝게 통과하는 움직임 벡터 필드를 선택하여 해당하는 블록의 최종 움직임 벡터로 선택한다. 실험결과 제안하는 보조 정보 생성 방법은 기존의 방법과 비교했을 때 비대칭성 움직임 벡터 사용만으로 평균 PSNR이 0.216dB 만큼 증가하는 것을 확인할 수 있었다.
PDF

The Development of Frame Buffer Structure for Automatic Dynamic Resolution Switching on Android Mobile Platform (안드로이드 모바일 플랫폼에서의 동적 해상도 자동전환을 위한 프레임버퍼 구조 개발)

Kim, Ju-Sung;Kwon, Oh-Chul;Lee, Chan-Gun
- Proceedings of the Korean Information Science Society Conference
- /
- 2010.06a
- /
- pp.269-270
- /
- 2010
PDF

Deblocking Filter Parallelization for HEVC Hardware Design (HEVC 하드웨어 구현을 위한 디블록킹 필터 병렬화)

Kim, Dae-Eun;Kim, Munchurl;Kim, Hyunmi
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2012.07a
- /
- pp.236-238
- /
- 2012
본 논문에서는 HEVC 코덱에서 프레임 단위로 수행되던 디블록킹 필터를 하드웨어 구현 시에 LCU 단위로 처리되는 파이프라인 구조를 적용하여 병렬적으로 수행할 수 있는 방법을 제안한다. 파이프라인 구조에서는 현재 처리되고 있는 하나의 LCU 에 대해 디블록킹 필터를 수행하기 위해서 현재 처리하고 있는 LCU 뿐만 아니라 주변의 LCU 의 화소 값 등의 정보가 필요하며 주변의 LCU 의 화소 값을 모두 저장하는 것은 불필요한 메모리소모를 야기해 HEVC 코덱의 복잡도를 증가시킬 수 있다. 또한 현재 처리되는 LCU 의 경계에 디블록킹 필터를 수행하는 경우 현재 처리되는 LCU 이전의 수정할 수 없는 LCU 의 화소 값도 수정되어야 한다. 따라서 본 논문에서는 이를 해결하기 위해 수평 버퍼 와 수직 버퍼의 개념을 도입하여 처리되는 LCU 의 왼쪽 LCU 의 오른쪽 끝 4 열의 화소와 위쪽 LCU 의 아래쪽 끝 4 행의 화소만을 저장하여 메모리를 합리적으로 사용하는 방법을 제시하고 평행이동 LCU 개념을 적용하여 수정 불가능한 화소 값들을 처리하는 방법을 제시한다. 제안된 구조에 따라 구현된 소프트웨어 상에서 기존의 참조 소프트웨어인 HM6.0 과 동일한 결과를 얻을 수 있었다.
PDF

A Digital Right Management System based on Shared Key fool for Video Data Protection (동영상 데이터 보호를 위한 공유 키 풀 기반의 DRM 시스템)

Kim Jung-Jae;Park Jae-Pyo;Jun Moon-Seog
- The KIPS Transactions:PartC
- /
- v.12C no.2 s.98
- /
- pp.183-190
- /
- 2005
In this thesis, first, we propose I-frame encryption techniques of video data for video data itself encryption and propose license agent that processing user's certification and decryption in client system automatically when user execute encrypted video data in system server. License agent runs user's certification, encryption and decryption of video data based on PID(Public Key Infrastructure) using shared key-pool when execute of video data. Also, compose duplex buffer control and propose real time decryption method using efficient buffer scheduling to reduce much playing delay times that happen processing decryption when execute of videoa data of high-capacity.
https://doi.org/10.3745/KIPSTC.2005.12C.2.183 인용 PDF KSCI

Improvement on EBA Smoothing Algorithm for Efficient Resource Utilization (효율적인 자원 이용을 위한 EBA 스무딩 알고리즘의 개선)

Lee Myoun-Jae;Park Do-Soon
- Proceedings of the Korean Information Science Society Conference
- /
- 2005.07a
- /
- pp.358-360
- /
- 2005
스무딩은 가변 비트율로 저장된 비디오 데이터를 클라이언트로 전송할 때 일련의 고정 비트율로 전송할 수 있도록 전송 계획을 세우는 것이다. 이러한 스무딩 알고리즘들 중에서 e-PCRTT 알고리즘은 전송률 변화 횟수가 주어지고 구간의 크기가 고정이어서 전송률 변화 횟수, 첨두 전송률, 버퍼 이용률 등의 평가 요소들이 증가될 수 있다. 이러한 문제점을 개선하기 위해 전송률 변화 횟수의 제한이 없고 구간의 크기가 가변적인 EBA(Enhanced Bandwidth Allocation)[9,10] 스무딩 알고리즘을 제안한 바 있다. 그러나 이 방법에서는 전송률의 증가 또는 감소를 고려하지 않고 전송률을 변화시키기 때문에 이전 구간의 전송률에 비해 급격하게 높은 전송률이 요구될 수 있으며 증가되는 전송률로 보내야 하는 프레임 개수가 많아질 수 있다. 이는 네트워크 자원의 효율적인 사용을 어렵게 할 수 있다. 따라서, 본 논문에서는 EBA 알고리즘을 개선하여 첨두 전송률과 증가되는 전송률로 보내야 되는 프레임 개수를 감소시키는 알고리즘을 제안한다. 제안된 알고리즘의 성능은 E.T 90의 비디오 소스를 가지고 EBA 알고리즘과 전송률 변화 횟수, 첨두 전송률, 증가되는 전송률로 보내야 하는 프레임 개수 등을 비교 분석하여 평가하였다.
PDF

A JPEG Input Buffer Architecture for Real-Time Applications (실시간 JPEG 입력 버퍼 아키텍처)

Im, Min-Jung
- Journal of the Institute of Electronics Engineers of Korea SD
- /
- v.39 no.2
- /
- pp.7-13
- /
- 2002
When a USB digital camera is used for PC video-conference applications, motion picture data need to be transferred to the PC through the USB port. Due to the mismatch between the data rates of the USB and the motion picture, data compression should be performed before the transmission from the USB. While many motion picture compression algorithms require large intermediate memory space, the JPEG algorithm does not need to store an entire frame for the compression. Instead, a relatively small buffer is required at the input of the JPEG compression engine to resolve the inconsistency between the orders of the inputted data and the consumed data. Data reordering can be easily implemented using a double buffering scheme, which still requires a considerable size of memory. In this paper, a novel memory management scheme is proposed to avoid the double buffering. The proposed memory architecture requires a small amount of memory and a simple address generation scheme, resulting in overall cost reduction.
PDF KSCI

A Study on RTP-based Lip Synchronization Control for Very Low Delay in Video Communication (초저지연 비디오 통신을 위한 RTP 기반 립싱크 제어 기술에 관한 연구)

Kim, Byoung-Yong;Lee, Dong-Jin;Kwon, Jae-Cheol;Sim, Dong-Gyu
- Journal of Korea Multimedia Society
- /
- v.10 no.8
- /
- pp.1039-1051
- /
- 2007
In this paper, a new lip synchronization control method is proposed to achieve very low delay in the video communication. The lip control is so much vital in video communication as delay reduction. In a general way, to control the lip synchronization, both the playtime and capture time calculated from RTP time stamp are used. RTP timestamp is created by stream sender and sent to the receiver along the stream. It is extracted from the received packet by stream receiver to calculate playtime and capture time. In this paper, we propose the method of searching most adjacent corresponding frame of the audio signal, which is assumed to be played with uniform speed. Encoding buffer of stream sender is removed to reduce the buffering delay. Besides, decoder buffer of receiver, which is used to correct the cracked packet, is resulted to process only 3 frames. These mechanisms enable us to achieve ultra low delay less than 100 ms, which is essential to video communication. Through simulations, the proposed method shows below the 100 ms delay and controlled the lip synchronization between audio and video.
PDF

Search Result 186, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)