• Title/Summary/Keyword: Stereo Coding

Search Result 50, Processing Time 0.025 seconds

Volumetric Image System for High Efficiency Video Coding (고효율 비디오코딩을 위한 입체영상시스템)

  • Kim, Sang Hyun
    • The Journal of the Korea Contents Association
    • /
    • v.16 no.1
    • /
    • pp.515-520
    • /
    • 2016
  • Volumetric image system has many applications recently in education, 3D movie, medical images but these applications have several problems that need to be overcome. Volumetric display may process a amount of visual data and design the high efficient vision system for realtime display. In case of stereo system for volumetric display motion vectors, disparity vectors from the stereoscopic sequences and residual images with the reference images has been transmitted, and the stereoscopic sequences have been reconstructed at the receiver for volumetric display. So central issue for the design of efficient volumetric image system lies in selecting an appropriate stereo matching and robust vision system. In this paper, we proposed high efficient vision system, which design vision stage with rotating and moving horizontally, and match the successive stereo image efficiently. In experimental results with volumetric image system, the proposed method represents high efficiency with minimizing error and low computational load for volumetric display.

Multi-band Approach to Deep Learning-Based Artificial Stereo Extension

  • Jeon, Kwang Myung;Park, Su Yeon;Chun, Chan Jun;Park, Nam In;Kim, Hong Kook
    • ETRI Journal
    • /
    • v.39 no.3
    • /
    • pp.398-405
    • /
    • 2017
  • In this paper, an artificial stereo extension method that creates stereophonic sound from a mono sound source is proposed. The proposed method first trains deep neural networks (DNNs) that model the nonlinear relationship between the dominant and residual signals of the stereo channel. In the training stage, the band-wise log spectral magnitude and unwrapped phase of both the dominant and residual signals are utilized to model the nonlinearities of each sub-band through deep architecture. From that point, stereo extension is conducted by estimating the residual signal that corresponds to the input mono channel signal with the trained DNN model in a sub-band domain. The performance of the proposed method was evaluated using a log spectral distortion (LSD) measure and multiple stimuli with a hidden reference and anchor (MUSHRA) test. The results showed that the proposed method provided a lower LSD and higher MUSHRA score than conventional methods that use hidden Markov models and DNN with full-band processing.

High efficient 3D vision system using simplification of stereo image rectification structure (스테레오 영상 교정 구조의 간략화를 이용한 고효율 3D 비젼시스템)

  • Kim, Sang Hyun
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.12 no.6
    • /
    • pp.605-611
    • /
    • 2019
  • 3D Vision system has many applications recently but popularization have many problems that need to be overcome. Volumetric display may process a amount of visual data and design the high efficient vision system for display. In case of stereo system for volumetric display, disparity vectors from the stereoscopic sequences and residual images with the reference images has been transmitted, and the reconstructed stereoscopic sequences have been displayed at the receiver. So central issue for the design of efficient volumetric vision system lies in selecting an appropriate stereo matching and robust vision system. In this paper, we propose high efficient vision system with the reduction of rectification error which can perform the 3D data extraction efficiently with low computational complexity. In experimental results with proposed vision system, the proposed method can perform the 3D data extraction efficiently with reducing rectification error and low computational complexity.

Robust Primary-ambient Signal Decomposition Method using Principal Component Analysis with Phase Alignment (위상 정렬을 이용한 주성분 분석법의 강인한 스테레오 음원 분리 성능유지 기법)

  • Baek, Yong-Hyun;Hyun, Dong-Il;Park, Young-Cheol
    • Journal of Broadcast Engineering
    • /
    • v.19 no.1
    • /
    • pp.64-74
    • /
    • 2014
  • The primary and ambient signal decomposition of a stereo sound is a key step to the stereo upmix. The principal component analysis (PCA) is one of the most widely used methods of primary-ambient signal decomposition. However, previous PCA-based decomposition algorithms assume that stereo sound sources are only amplitude-panned without any consideration of phase difference. So it occurs some performance degradation in case of live recorded stereo sound. In this paper, we propose a new PCA-based stereo decomposition algorithm that can consider the phase difference between the channel signals. The proposed algorithm overcomes limitation of conventional signal model using PCA with phase alignment. The phase alignment is realized by using inter-channel phase difference (IPD) which is widely used in parametric stereo coding. Moreover, Enhanced Modified PCA(EMPCA) is combined to solve the problem of conventional PCA caused by Primary to Ambient energy Ratio(PAR) and panning angle dependency. The simulation results are presented to show the improvements of the proposed algorithm.

Stereo image compression based on error concealment for 3D television (3차원 텔레비전을 위한 에러 은닉 기반 스테레오 영상 압축)

  • Bak, Sungchul;Sim, Donggyu;Namkung, Jae-Chan;Oh, Seoung-jun
    • Journal of Broadcast Engineering
    • /
    • v.10 no.3
    • /
    • pp.286-296
    • /
    • 2005
  • This paper presents a stereo-based image compression and transmission system for 3D realistic television. In the proposed system, a disparity map is extracted from an input stereo image pair and the extracted disparity map and one of two input images are transmitted or stored at a local or remote site. However, correspondences can not be determined in occlusion areas. Thus, it is not easy to recover 3D information in such regions. In this paper, a reconstruction image compensation algorithm based on error block concealment and in-loop filtering is proposed to minimize the reconstruction error in generating stereo image pair. The effectiveness of the proposed algorithm is shown in term of objective accuracy of reconstruction image with several real stereo image pairs.

High-level framework for scalable 3D video coding based on HEVC (HEVC 기반 삼차원 영상의 스케일러블 전송을 위한 확장 시스템)

  • Choi, Byeongdoo;Cho, Yongjin;Park, Min Woo;Lee, Jin Young;Wey, Hocheon;Kim, Chanyul
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2013.06a
    • /
    • pp.182-184
    • /
    • 2013
  • A HEVC-based scalable 3D video coding system is proposed. The proposed system supports scalable transmission of multiview video data with depth maps. Key technologies in this system are reference picture management, reference picture list construction, and cross-layer dependency signaling. All the proposed technologies are used for the development of video coding system for UHD stereo display and glassless 3D display.

  • PDF

MPEG Surround for Multi-Channel Audio Coding-Part 2: Various Modes and Tools (다채널 오디오 코딩을 위한 MPEG Surround-2부: 다양한 모드 및 툴들)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.610-617
    • /
    • 2009
  • An overview of various modes and tools of MPEG Surround is provided Because the binaural mode of MPEG Surround supports the virtual 5.1-channel playback based on HRTFs, it can be played via headphones and earphones for portable audio devices. MPEG Surround also supports the enhanced matrix mode which converts stereo signals to 5.1-channel signals without side information, the 3D stereo mode which deals with 3D-coded signals, the low power version which greatly reduces the computational load in the decoding process. Besides, MPEG Surround provides the arbitrary downmix gains (ADGs) tool which is applied to artistic downmix signals, the matrix compatibility tool which is applied to downmix signals by conventional matrix-based methods, the residual coding tool -which can be used at high bit rates, and the GES tool which is applied to specific sound such as applause. The listening test results by various companies and organizations are also presented for important modes and tools.

Efficient Data Representation of Stereo Images Using Edge-based Mesh Optimization (윤곽선 기반 메쉬 최적화를 이용한 효율적인 스테레오 영상 데이터 표현)

  • Park, Il-Kwon;Byun, Hye-Ran
    • Journal of Broadcast Engineering
    • /
    • v.14 no.3
    • /
    • pp.322-331
    • /
    • 2009
  • This paper proposes an efficient data representation of stereo images using edge-based mesh optimization. Mash-based two dimensional warping for stereo images mainly depends on the performance of a node selection and a disparity estimation of selected nodes. Therefore, the proposed method first of all constructs the feature map which consists of both strong edges and boundary lines of objects for node selection and then generates a grid-based mesh structure using initial nodes. The displacement of each nodal position is iteratively estimated by minimizing the predicted errors between target image and predicted image after two dimensional warping for local area. Generally, iterative two dimensional warping for optimized nodal position required a high time complexity. To overcome this problem, we assume that input stereo images are only horizontal disparity and that optimal nodal position is located on the edge include object boundary lines. Therefore, proposed iterative warping method performs searching process to find optimal nodal position only on edge lines along the horizontal lines. In the experiments, we compare our proposed method with the other mesh-based methods with respect to the quality by using Peak Signal to Noise Ratio (PSNR) according to the number of nodes. Furthermore, computational complexity for an optimal mesh generation is also estimated. Therefore, we have the results that our proposed method provides an efficient stereo image representation not only fast optimal mesh generation but also decreasing of quality deterioration in spite of a small number of nodes through our experiments.

Stereo Image Coding Using Zerotree (제로트리 기법을 이용한 스테레오 영상 부호화)

  • Bae, Jin-Woo;Shin, Choel;Yoo, Ji-Sang
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.26 no.12A
    • /
    • pp.2092-2099
    • /
    • 2001
  • In the three-dimensional image system using stereoscopic images, efficient coding schemes which can get rid of redundancy between the left and right images are usually used. In this paper, we propose an efficient coding method by using relationship between a reference image and residual image. In the proposed algorithm, zero-tree method which guaranty a good quality in low bit rate is used for encoding the residual image. Zero-tree algorithm gives good coding performance, but it has computational complexity so that we used ADLS method to reduce time for the disparity estimation. Using the wavelet based zero-tree method, it is shown that high quality of image in the limited band-width can be preserved through computer simulation.

  • PDF

Stereoscopic Video Coding for Subway Accident Monitoring System (지하철 사고 감시를 위한 스테레오 비디오 부호화 기법)

  • Oh, Seh-Chan;Kim, Gil-Dong;Park, Sung-Hyuk
    • Proceedings of the KIEE Conference
    • /
    • 2005.10b
    • /
    • pp.484-486
    • /
    • 2005
  • Passenger safety is a primary concern of railway system but, it has been urgent issue that dozens of people are killed every year when they falloff from train platforms. Recently, advancements in IT have enabled applying vision sensors to railway environments, such as CCTV and stereo camera sensors. In this paper, we propose a stereoscopic video coding scheme for subway accident monitoring system. The proposed scheme is designed for providing flexible video among various displays, such as control center, station employees and train driver. We uses MPEG-2 standard for coding the left-view sequence and IBMDC for predicting the P- and B-types of frames of the right-view sequence. IBMDC predicts matching block by interpolating both motion and disparity predicted macroblocks. To provide efficient stereoscopic video service. we define both temporally and spatially scalable layers for each eye's-view by using the concept of Spatio-Temporal scalability. According to the experimental results. we expect the proposed functionalities will play a key role in establishing highly flexible stereoscopic video codec for ubiquitous display environment where devices and network connections are heterogeneous.

  • PDF