• Title/Summary/Keyword: 2-D Coding

Search Result 564, Processing Time 0.024 seconds

A Temporal Error Concealment based on Motion Vector Recovery for H.264/AVC

  • Wu, Jun;Liu, Xingang;Yoo, Kook-Yeol
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2007.05a
    • /
    • pp.341-344
    • /
    • 2007
  • In this paper, a new temporal error concealment method for the new coding standard H.264/AVC is presented, which uses the high correlation between the motion vectors of neighboring blocks. By using the motion vector of neighboring MB of the lost MB, the MV of the lost MB are recovered. It is shown that under FMO coding method of H.264/AVC, the proposed method increases PSNR gain up to 2.85dB compared to build-in algorithm in the H.264/AVC test model and 2.59dB compared to Lagrange interpolation.

MMT based V3C data packetizing method (MMT 기반 V3C 데이터 패킷화 방안)

  • Moon, Hyeongjun;Kim, Yeonwoong;Park, Seonghwan;Nam, Kwijung;Kim, Kyuhyeon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2022.06a
    • /
    • pp.836-838
    • /
    • 2022
  • 3D Point Cloud는 3D 콘텐츠를 더욱 실감 나게 표현하기 위한 데이터 포맷이다. Point Cloud 데이터는 3차원 공간상에 존재하는 데이터로 기존의 2D 영상에 비해 거대한 용량을 가지고 있다. 최근 대용량 Point Cloud의 3D 데이터를 압축하기 위해 V-PCC(Video-based Point Cloud Compression)와 같은 다양한 방법이 제시되고 있다. 따라서 Point Cloud 데이터의 원활한 전송 및 저장을 위해서는 V-PCC와 같은 압축 기술이 요구된다. V-PCC는 Point Cloud의 데이터들을 Patch로써 뜯어내고 2D에 Projection 시켜 3D의 영상을 2D 형식으로 변환하고 2D로 변환된 Point Cloud 영상을 기존의 2D 압축 코덱을 활용하여 압축하는 기술이다. 이 V-PCC로 변환된 2D 영상은 기존 2D 영상을 전송하는 방식을 활용하여 네트워크 기반 전송이 가능하다. 본 논문에서는 V-PCC 방식으로 압축한 V3C 데이터를 방송망으로 전송 및 소비하기 위해 MPEG Media Transport(MMT) Packet을 만드는 패킷화 방안을 제안한다. 또한 Server와 Client에서 주고받은 V3C(Visual Volumetric Video Coding) 데이터의 비트스트림을 비교하여 검증한다.

  • PDF

Performance Analysis of QPSK and QDPSK Signals with Diversity Reception and Coding Techniques in Fading Plus Impulsive Noise Environments (임펄스 잡음과 페이딩이 함께 존재하는 환경에서 다이버시티 수신 기법과 부호화 기법을 채용하는 QPSK 및 QDPSK 신호의 성능 해석)

  • Leem, Kill-Yong;Cho, Sung-Joon;Lee, Jin
    • The Proceeding of the Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.5 no.4
    • /
    • pp.3-17
    • /
    • 1994
  • The error probability of QPSK and DPSK signals with diversity reception technique in m-distribution fading plus impulsive noise environments has been derived and the error probability is evaluated and compared with that in Gaussian noise environment. The error performance degrades as impulsive noise becomes strong and degree of degradation of signal performance in QDPSK signal is larger than that in QPSK signal. The diversity reception technique can improve the error performance not only in fading plus Gaussian noise environment but in fading plus impulsive noise environment. When diversity reception technique is used, the improvement of error performance attains about 10dB to 15dB in terms of CNR as compared with that in non diversity reception. Among diversity techniques the maximal ratio combining is must effective. When diversity reception and coding techniques are used together in impulsive noise plus Rayleigh fading environments, the improvement of error performance attains about 12dB to 15dB in terms of CNR as compared with that of only diversity reception technique case and the improvement of error perform- ance in RS coding attains about 2dB in terms of CNR as compared with that of BCH coding case.

  • PDF

EFFICIENT MULTIVIEW VIDEO CODING BY OBJECT SEGMENTATION

  • Boonthep, Narasak;Chiracharit, Werapon;Chamnongthai, Kosin;Ho, Yo-Sung
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.01a
    • /
    • pp.294-297
    • /
    • 2009
  • Multi-view video consists of a set of multiple video sequences from multiple viewpoints or view directions in the same scene. It contains extremely a large amount of data and some extra information to be stored or transmitted to the user. This paper presents inter-view correlations among video objects and the background to reduce the prediction complexity while achieving a high coding efficiency in multi-view video coding. Our proposed algorism is based on object-based segmentation scheme that utilizes video object information obtained from the coded base view. This set of data help us to predict disparity vectors and motion vectors in enhancement views by employing object registration, which leads to high compression and low-complexity coding scheme for enhancement views. An experimental results show that the superiority can provide an improvement of PSNR gain 2.5.3 dB compared to the simulcast.

  • PDF

3D-Distortion Based Rate Distortion Optimization for Video-Based Point Cloud Compression

  • Yihao Fu;Liquan Shen;Tianyi Chen
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.17 no.2
    • /
    • pp.435-449
    • /
    • 2023
  • The state-of-the-art video-based point cloud compression(V-PCC) has a high efficiency of compressing 3D point cloud by projecting points onto 2D images. These images are then padded and compressed by High-Efficiency Video Coding(HEVC). Pixels in padded 2D images are classified into three groups including origin pixels, padded pixels and unoccupied pixels. Origin pixels are generated from projection of 3D point cloud. Padded pixels and unoccupied pixels are generated by copying values from origin pixels during image padding. For padded pixels, they are reconstructed to 3D space during geometry reconstruction as well as origin pixels. For unoccupied pixels, they are not reconstructed. The rate distortion optimization(RDO) used in HEVC is mainly aimed at keeping the balance between video distortion and video bitrates. However, traditional RDO is unreliable for padded pixels and unoccupied pixels, which leads to significant waste of bits in geometry reconstruction. In this paper, we propose a new RDO scheme which takes 3D-Distortion into account instead of traditional video distortion for padded pixels and unoccupied pixels. Firstly, these pixels are classified based on the occupancy map. Secondly, different strategies are applied to these pixels to calculate their 3D-Distortions. Finally, the obtained 3D-Distortions replace the sum square error(SSE) during the full RDO process in intra prediction and inter prediction. The proposed method is applied to geometry frames. Experimental results show that the proposed algorithm achieves an average of 31.41% and 6.14% bitrate saving for D1 metric in Random Access setting and All Intra setting on geometry videos compared with V-PCC anchor.

Spatially Scalable Kronecker Compressive Sensing of Still Images (공간 스케일러블 Kronecker 정지영상 압축 센싱)

  • Nguyen, Canh Thuong;Jeon, Byeungwoo
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.52 no.10
    • /
    • pp.118-128
    • /
    • 2015
  • Compressive sensing (CS) has to face with two challenges of computational complexity reconstruction and low coding efficiency. As a solution, this paper presents a novel spatially scalable Kronecker two layer compressive sensing framework which facilitates reconstruction up to three spatial resolutions as well as much improved CS coding performance. We propose a dual-resolution sensing matrix based on the quincunx sampling grid which is applied to the base layer. This sensing matrix can provide a fast-preview of low resolution image at encoder side which is utilized for predictive coding. The enhancement layer is encoded as the residual measurement between the acquired measurement and predicted measurement data. The low resolution reconstruction is obtained from the base layer only while the high resolution image is jointly reconstructed using both two layers. Experimental results validate that the proposed scheme outperforms both conventional single layer and previous multi-resolution schemes especially at high bitrate like 2.0 bpp by 5.75dB and 5.05dB PSNR gain on average, respectively.

A Study on the Multiresolutional Coding Based on Spline Wavelet Transform (스플라인 웨이브렛 변환을 이용한 영상의 다해상도 부호화에 관한 연구)

  • 김인겸;정준용;유충일;이광기;박규태
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.19 no.12
    • /
    • pp.2313-2327
    • /
    • 1994
  • As the communication environment evolves, there is an increasing need for multiresolution image coding. To meet this need, the entrophy constratined vector quantizer(ECVQ) for coding of image pyramids by spline wavelet transform is introduced in this paper. This paper proposes a new scheme for image compression taking into account psychovisual feature both in the space and frequency domains : this proposed method involves two steps. First we use spline wavelet transform in order to obtain a set of biorthogonal subclasses of images ; the original image is decomposed at different scale using a pyramidal algorithm architecture. The decomposition is along the vertical and horizontal directions and maintains constant the number of pixels required the image. Second, according to Shannon's rate distortion theory, the wavelet coefficients are vectored quantized using a multi-resolution ECVQ(entropy-constrained vector quantizer) codebook. The simulation results showed that the proposed method could achieve higher quality LENA image improved by about 2.0 dB than that of the ECVQ using other wavelet at 0.5 bpp and, by about 0.5 dB at 1.0 bpp, and reduce the block effect and the edge degradation.

  • PDF

MPEG Surround for Multi-Channel Audio Coding-Part 2: Various Modes and Tools (다채널 오디오 코딩을 위한 MPEG Surround-2부: 다양한 모드 및 툴들)

  • Pang, Hee-Suk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.7
    • /
    • pp.610-617
    • /
    • 2009
  • An overview of various modes and tools of MPEG Surround is provided Because the binaural mode of MPEG Surround supports the virtual 5.1-channel playback based on HRTFs, it can be played via headphones and earphones for portable audio devices. MPEG Surround also supports the enhanced matrix mode which converts stereo signals to 5.1-channel signals without side information, the 3D stereo mode which deals with 3D-coded signals, the low power version which greatly reduces the computational load in the decoding process. Besides, MPEG Surround provides the arbitrary downmix gains (ADGs) tool which is applied to artistic downmix signals, the matrix compatibility tool which is applied to downmix signals by conventional matrix-based methods, the residual coding tool -which can be used at high bit rates, and the GES tool which is applied to specific sound such as applause. The listening test results by various companies and organizations are also presented for important modes and tools.

A 3D Wavelet Coding Scheme for Light-weight Video Codec (경량 비디오 코덱을 위한 3D 웨이블릿 코딩 기법)

  • Lee, Seung-Won;Kim, Sung-Min;Park, Seong-Ho;Chung, Ki-Dong
    • The KIPS Transactions:PartB
    • /
    • v.11B no.2
    • /
    • pp.177-186
    • /
    • 2004
  • It is a weak point of the motion estimation technique for video compression that the predicted video encoding algorithm requires higher-order computational complexity. To reduce the computational complexity of encoding algorithms, researchers introduced techniques such as 3D-WT that don't require motion prediction. One of the weakest points of previous 3D-WT studies is that they require too much memory for encoding and too long delay for decoding. In this paper, we propose a technique called `FS (Fast playable and Scalable) 3D-WT' This technique uses a modified Haar wavelet transform algorithm and employs improved encoding algorithm for lower memory and shorter delay requirement. We have executed some tests to compare performance of FS 3D-WT and 3D-V. FS 3D-WT has exhibited the same high compression rate and the same short processing delay as 3D-V has.

Performance Analysis of MFSK Signal using Reed-Solomon / Convolutional Concatenated Coding and MRC Diversity Techniques in m-distributed Fading Environment (m-분포 페이딩 환경에서 Reed-Solomon/컨벌루션 연접 부호화 기법과 MRC 다이버시티 기법을 함께 이용하는 MFSK 신호의 성능 해석)

  • 이희덕;강희조;조성준
    • The Proceeding of the Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.5 no.2
    • /
    • pp.10-19
    • /
    • 1994
  • The error rate equation of Reed-Solomon/Convoutional concatenated coded MFSK signal transmitted over m-distributed fading channel with Additive White Gaussian Noise (AWGN) and re- ceived with Maximal Ratio Combining (MRC) diversity has been derived. The bit error probability has been evaluated using the derived equation and shown n figures as a function of signal to noise ratio, fading index and the number of diversity branches. From the results obtained, we have shown that the bit error probability of MFSK signal is improved by using coding technique in fading environment. The concatenated coding technique is found to be very effective. When concatenated coding and MRC diversity reception techniques are used together in fading environ- ment, the improvement of error performance attains about 6.6 dB in terms of SNR as compared with that of employing only concatenated coding case.

  • PDF