• Title/Summary/Keyword: 다시점 영상 부호화

Search Result 92, Processing Time 0.026 seconds

Multiview Video Sequence CODEC with View Scalability (View Scalability를 고려한 다시점 동영상 코덱)

  • 임정은;손광훈
    • Journal of Broadcast Engineering
    • /
    • v.9 no.3
    • /
    • pp.236-245
    • /
    • 2004
  • A multiview sequence CODEC with view scaiability is proposed in this paper. We define a GGOP (Group of GOP) structure as a basic coding unit to efficiently code multiview sequences. 7he proposed CODEC provides flexible GGOP structures based on the number of views and baseline distances among cameras. Multiview sequences encode consists of disparity estimation/compensation, motion estimation/compensation, residual coding and rate control and generates multiview sequence bitstream. The main bitstream is the same as an MPEG-2 mono-sequence bitstream for MPEG-2 compatibility. The auxiliary bitstream contains information concerning the remaining multiview sequences except for the reference sequences. The proposed CODEC with view scalability provides that a number of view flints are selectively determined at the receiver according to the type of display modes. The proposed multiview sequence CODEC is tested with several multiview sequences to determine its flexibility. compatibility with MPEG-2 and view scaiability. In addition, we subjectively confirm that the decoded bitstreams with view scaiability can be Properly displayed by several types of display modes. including 3D monitors.

A Fast Mode Decision using Anchor Pictures for Multiview Video Coding (기준 화면을 이용한 다시점 영상 부호화의 빠른 모드 결정 방법)

  • Jung, Choong-Hyun;Shin, Kwang-Mu;Chung, Ki-Dong
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2010.06c
    • /
    • pp.530-533
    • /
    • 2010
  • 다시점 영상 부호화에서는 시점 간의 공간적 중복성을 이용하여 데이터 중복성을 제거하는 것이 중요하다. 독립적으로 부호화하는 동시 부호화 방법(simulcast)보다 부호화 효율이 더욱 향상하였지만 계산 복잡도가 증가하는 문제가 있다. 본 논문에서는 다시점 영상 부호화기의 계산 복잡도를 감소시키기 위한 빠른 모드 결정 방법을 제안한다. GOP 내의 양 끝에 위치하고 있는 기준 화면의 MAD를 계산하여 영역을 분할하고 영역 맵을 생성한다. 시점 간의 예측을 사용하는 시점의 경우 인접 시점의 기준 화면도 이용하여 영역을 분할한다. 생성된 맵은 비기준 화면의 부호화 시 적용되어 후보 모드를 조기에 판단한다. 이와 같은 방법을 적용한 후의 실험 결과, 화질의 손실이 거의 없으면서 부호화 시간은 평균 58.6% 감소하였고, 비트율은 평균 1.9% 증가하였다.

  • PDF

A Fast Mode Decision of Non-anchor Pictures in Multi-view Video Coding for 3D Applications (3D 응용을 위한 다시점 영상 부호화에서 비기준 화면의 빠른 모드결정 기법)

  • Jung, Choong-Hyun;Shin, Kwang-Mu;Park, Seong-Ho;Chung, Ki-Dong
    • Journal of Korea Multimedia Society
    • /
    • v.15 no.7
    • /
    • pp.859-869
    • /
    • 2012
  • The Multi-view Video Coding (MVC) which is exploiting disparities between views has been developed to improve the coding efficiency of multi-view video. But MVC has a problem of having high computing complexities because of disparity estimation. This paper propose a fast mode decision for non-anchor picture to reduce the computational time of MVC. The proposed method uses two phases. Anchor pictures in hierarchical B picture structure have a higher correlation with prediction mode selection of non-anchor pictures, so in the first phase, prediction mode of non-anchor pictures is selected by exploiting the macro-block regions in anchor picture. In the second phase, we select a reference direction of inter prediction mode exploiting a higher correlation among reference directions of inter prediction modes of 7 block sizes. Experimental results show that the proposed method could save average about 44% in the encoding time with negligible coding efficiency losses.

Fast Mode Decision using Global Disparity Vector for Multi-view Video Coding (다시점 영상 부호화에서 전역 변이 벡터를 이용한 고속 모드 결정)

  • Han, Dong-Hoon;Cho, Suk-Hee;Hur, Nam-Ho;Lee, Yung-Lyul
    • Journal of Broadcast Engineering
    • /
    • v.13 no.3
    • /
    • pp.328-338
    • /
    • 2008
  • Multi-view video coding (MVC) based on H.264/AVC encodes multiple views efficiently by using a prediction scheme that exploits inter-view correlation among multiple views. However, with the increase of the number of views and use of inter-view prediction among views, total encoding time will be increased in multiview video coding. In this paper, we propose a fast mode decision using both MB(Macroblock)-based region segmentation information corresponding to each view in multiple views and global disparity vector among views in order to reduce encoding time. The proposed method achieves on average 40% reduction of total encoding time with the objective video quality degradation of about 0.04 dB peak signal-to-noise ratio (PSNR) by using joint multi-view video model (JMVM) 4.0 that is the reference software of the multiview video coding standard.

Near-lossless Coding of Multiview Texture and Depth Information for Graphics Applications (그래픽스 응용을 위한 다시점 텍스처 및 깊이 정보의 근접 무손실 부호화)

  • Yoon, Seung-Uk;Ho, Yo-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.46 no.1
    • /
    • pp.41-48
    • /
    • 2009
  • This Paper introduces representation and coding schemes of multiview texture and depth data for complex three-dimensional scenes. We represent input color and depth images using compressed texture and depth map pairs. The proposed X-codec encodes them further to increase compression ratio in a near-lossless way. Our system resolves two problems. First, rendering time and output visual quality depend on input image resolutions rather than scene complexity since a depth image-based rendering techniques is used. Second, the random access problem of conventional image-based rendering could be effectively solved using our image block-based compression schemes. From experimental results, the proposed approach is useful to graphics applications because it provides multiview rendering, selective decoding, and scene manipulation functionalities.

Temporal Prediction Structure for Multi-view Video Coding (다시점 비디오 부호화를 위한 시간적 예측 구조)

  • Yoon, Hyo-Sun;Kim, Mi-Young
    • Journal of Korea Multimedia Society
    • /
    • v.15 no.9
    • /
    • pp.1093-1101
    • /
    • 2012
  • Multi-view video is obtained by capturing one three-dimensional scene with many cameras at different positions. Multi-view video coding exploits inter-view correlations among pictures of neighboring views and temporal correlations among pictures of the same view. Multi-view video coding which uses many cameras requires a method to reduce the computational complexity. In this paper, we proposed an efficient prediction structure to improve performance of multi-view video coding. The proposed prediction structure exploits an average distance between the current picture and its reference pictures. The proposed prediction structure divides every GOP into several small groups to decide the maximum index of hierarchical B layer and the number of pictures of each B layer. Experimental results show that the proposed prediction structure shows good performance in image quality and bit-rates. When compared to the performance of hierarchical B pictures of Fraunhofer-HHI, the proposed prediction structure achieved 0.07~0.13 (dB) of PSNR gain and was down by 6.5(Kbps) in bitrate.

An Efficient Reference Picture Selection Method for MVC (다시점 비디오 부호화기를 위한 효율적인 참조 영상 선택 알고리즘)

  • Ryu, Seungchul;Seo, Jungdong;Kim, Donghyun;Sohn, Kwanghoon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2010.11a
    • /
    • pp.74-77
    • /
    • 2010
  • 다시점 비디오 부호화기(MVC)는 다양한 블록 크기 기반의 움직임 추정과 변이 추정을 수행한다. 또한 2개 이상의 다중 참조 영상 움직임 추정 기술을 사용한다. 이 기술들을 통해 MVC는 높은 부호화 효율을 얻을 수 있지만 실제 적용하기에는 너무 높은 부호화 복잡도가 걸림돌로 작용한다. 본 논문에서는 MVC의 부호화 복잡도를 감소시키기 위하여 효율적인 참조 영상 선택 알고리즘을 제안한다. 부호화에 사용된 참조 영상들은 인접한 블록들 간에 높은 상호 연관성을 가지므로, 부호화된 이웃 블록들의 참조 영상 정보를 기반으로 현재 블록의 참조 영상을 효율적으로 선택할 수 있다. 실험을 통해 제안된 알고리즘이 부호화 시간을 기존의 MVC에 비해 최대 73.3%, 평균 57.3% 감소시키며 부호화 효율의 감소는 무시할 만한 수준임을 확인하였다.

  • PDF

H.264 Encoding Technique of Multi-view Video expressed by Layered Depth Image (계층적 깊이 영상으로 표현된 다시점 비디오에 대한 H.264 부호화 기술)

  • Shin, Jong-Hong;Jee, Inn-Ho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.14 no.2
    • /
    • pp.43-51
    • /
    • 2014
  • Multi-view video including depth image is necessary to develop a new compression encoding technique for storage and transmission, because of a huge amount of data. Layered depth image is an efficient representation method of multi-view video data. This method makes a data structure that is synthesis of multi-view color and depth image. This efficient method to compress new contents is suggested to use layered depth image representation and to apply for video compression encoding by using 3D warping. This paper proposed enhanced compression method using layered depth image representation and H.264/AVC video coding technology. In experimental results, we confirmed high compression performance and good quality of reconstructed image.

Multi-view Video Coding using View Interpolation (영상 보간을 이용한 다시점 비디오 부호화 방법)

  • Lee, Cheon;Oh, Kwan-Jung;Ho, Yo-Sung
    • Journal of Broadcast Engineering
    • /
    • v.12 no.2
    • /
    • pp.128-136
    • /
    • 2007
  • Since the multi-view video is a set of video sequences captured by multiple array cameras for the same three-dimensional scene, it can provide multiple viewpoint images using geometrical manipulation and intermediate view generation. Although multi-view video allows us to experience more realistic feeling with a wide range of images, the amount of data to be processed increases in proportion to the number of cameras. Therefore, we need to develop efficient coding methods. One of the possible approaches to multi-view video coding is to generate an intermediate image using view interpolation method and to use the interpolated image as an additional reference frame. The previous view interpolation method for multi-view video coding employs fixed size block matching over the pre-determined disparity search range. However, if the disparity search range is not proper, disparity error may occur. In this paper, we propose an efficient view interpolation method using initial disparity estimation, variable block-based estimation, and pixel-level estimation using adjusted search ranges. In addition, we propose a multi-view video coding method based on H.264/AVC to exploit the intermediate image. Intermediate images have been improved about $1{\sim}4dB$ using the proposed method compared to the previous view interpolation method, and the coding efficiency have been improved about 0.5 dB compared to the reference model.

Depth Image Compression based on a MPEG-4 SA-DCT for the Edge Preserving Method (MPEG-4 SA-DCT 기반의 경계 보존 방법을 이용한 깊이 영상 압축)

  • Kim, Dong-Hyun;Seo, Jung-Dong;Sohn, Kwang-Hoon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.11a
    • /
    • pp.119-122
    • /
    • 2009
  • 멀티미디어 처리 분야의 급속한 발전으로 인해 3차원 TV (3DTV)는 차세대 방송 시스템 시장에서 가장 주목을 받는 제품이 되었다. 3DTV는 사용자가 원하는 시점을 자유롭게 선택할 수 있고, 입체감을 제공하여 사용자가 마치 그 곳에 있는 듯한 효과를 줄 수 있다. 지금까지 입체 영상은 스테레오 영상을 기반으로 하나의 시점에 대한 입체 영상을 제공했지만 최근에는 다시점 영상을 이용하여 다양한 위치에서의 입체 영상을 제공하는 기술이 연구되고 있다. 다시점 영상은 사용자에게 임의 시점의 영상에 대한 시청을 가능케 하여 입체감 있는 화면을 제공할 수 있다. 입체감 있는 영상을 만들기 위해서는 다시점 영상의 시점 간 가상 시점을 생성할 수 있도록 하고 깊이 정보를 포함하고 있는 깊이 영상 (Depth Image)을 획득하여야 한다. 획득된 깊이 영상 데이터와 다시점 비디오 데이터를 동시에 전송하는 다시점 비디오 시스템이 상용화되기 위해서는 방대한 양의 데이터를 효율적으로 압축하는 다시점 비디오 부호화 기술 개발이 필수적이다. 본 논문에서는 기존의 컬러 영상의 효율적인 압축 방법을 제안하던 다시점 비디오 부호화 기술에 국한되지 않고 3차원 영상 화질을 객관적으로 높일 수 있도록 깊이 영상의 효율적 압축 기법에 대한 새로운 방법을 제안한다.

  • PDF