• Title/Summary/Keyword: Video Distortion

Search Result 397, Processing Time 0.02 seconds

Motion Vector Resolution Decision Algorithm based on Neural Network for Fast VVC Encoding (고속 VVC 부호화를 위한 신경망 기반 움직임 벡터 해상도 결정 알고리즘)

  • Baek, Han-gyul;Park, Sang-hyo
    • Journal of Broadcast Engineering
    • /
    • v.26 no.5
    • /
    • pp.652-655
    • /
    • 2021
  • Among various inter prediction techniques of Versatile Video Coding (VVC), adaptive motion vector resolution (AMVR) technology has been adopted. However, for AMVR, various MVs should be tested per each coding unit, which needs a computation of rate-distortion cost and results in an increase in encoding complexity. Therefore, in order to reduce the encoding complexity of AMVR, it is necessary to effectively find an optimal AMVR mode. In this paper, we propose a lightweight neural network-based AMVR decision algorithm based on more diverse datasets.

Image processing of pseudo rate-distortion function based on MSSSIM and KL-Divergence, using multiple video processing filters for video compression. (MSSIM 및 KL Divergence 기반 의사 율-왜곡 평가 함수와 복수개의 영상처리 필터를 이용한 동영상 전처리 방법)

  • Seok, Jinwuk;Cho, Seunghyun;Kim, Hui Yong;Choi, Jin Soo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2018.06a
    • /
    • pp.169-172
    • /
    • 2018
  • 본 논문에서는 동영상 화질을 최대한 유지하면서 압축 비트량 절감을 효율적으로 이루기 위해 복수개의 영상처리 필터를 영상의 블록에 따라 선택적으로 적용하고, 영상처리 필터의 선택을 위한 MSSIM 및 KL Divergence 기반의 영상 처리 평가 함수를 제안한다. 영상압축의 경우, 영상 내 특징에 따라 화질과 비트량 절감의 특성이 다르며, 이에 따라 단일 목적을 가진 영상처리 필터로서는 화질을 유지하면서, 비트량 절감이라는 목적을 동시에 만족 시키기 어렵다. 이에 따라, 주관적 화질을 최대한 유지하면서, 비트량을 절감시키기 위해 주관적 화질 측도로서 MSSSIM(Multi-scale Structural Similarity)를 사용하고 비트량 측도를 위하여 KL Divergence 를 사용함과 동시에 두 가지 척도를 하나의 척도로 결합시키기 위한 방법을 제안한다. 아울러 제안한 측도를 사용하여 서로 다른 특성을 가진 영상처리 필터를 전처리 필터로 사용할 경우, 주관적 화질을 최대한 유지하면서 비트량 절감을 유지할 수 있도록 동영상 압축이 가능함을 확인할 수 있었다.

  • PDF

Development of Projector Calibration Mapping Technology for Physically Interactive Sport Game (체감형 스포츠 게임을 위한 프로젝터 캘리브레이션 매핑 기술 개발)

  • Gil, Young-Ik;Seo, Hye-Ran;Lee, Hyeon-Ju;Ko, Il-Ju
    • Journal of Korea Game Society
    • /
    • v.19 no.5
    • /
    • pp.39-52
    • /
    • 2019
  • Recently many studies has been conducted for interactive game to contents immersion through visual stimuli. Among them, Multi-Plane Image tech is suitable for projecting video in wide views, but it is difficult to perform calibration and mapping tasks for viewing separately. In this paper, we developed a calibration mapping applies to the field archery game space in the form of CAVE. As a result, we maintained the resolution confirmed the space through the video without distortion and expect to be applicable to the fields requiring presence, such as training contents.

A Fast Inter-layer Mode Decision Method inScalable Video Coding (공간적 스케일러블 비디오 부호화에서 계층간 모드 고속 결정 방법)

  • Lee, Bum-Shik;Hahm, Sang-Jin;Park, Chang-Seob;Park, Keun-Soo;Kim, Mun-Churl
    • Journal of Broadcast Engineering
    • /
    • v.12 no.4
    • /
    • pp.360-372
    • /
    • 2007
  • We propose a fast inter-layer mode decision method by utilizing coding information of base layer upward its enhancement layer inscalable video coding (SVC), also called MPEG-4 part 10 Advanced Video Coding Amendment 3 or H.264 Scalable Extension (SE) which is being standardized. In this paper, when the motion vectors from the base layer have zero motion (0, 0) in inter-layer motion prediction or the Integer Transform coefficients of the residual between current MB and the motion compensated MB by the predicted motion vectors from the base layer are all zero, the block mode of the corresponding block to be encoded at the enhancement layer is determined to be the $16{\times}16$ mode. In addition, if the predicted mode of the MB to be encoded at the enhancement layer is not equal to the $16{\times}16$ mode, then the rate-distortion optimization is only performed on the reduced candidated modes which are same or smaller partitioned modes. Our proposed method exhibits the complexity reduction in encoding time up to 72%. Nevertheless, it shows negligible PSNR degradation and bit rate increase up to 0.25dB and 1.73%, respectively.

An Early Termination Algorithm of Prediction Unit (PU) Search for Fast HEVC Encoding (HEVC 고속 부호화를 위한 PU 탐색 조기 종료 기법)

  • Kim, Jae-Wook;Kim, Dong-Hyun;Kim, Jae-Gon
    • Journal of Broadcast Engineering
    • /
    • v.19 no.5
    • /
    • pp.627-630
    • /
    • 2014
  • The latest video coding standard, high efficiency video coding (HEVC) achieves high coding efficiency by employing a quadtree-based coding unit (CU) block partitioning structure which allows recursive splitting into four equally sized blocks. At each depth level, each CU is partitioned into variable sized blocks of prediction units (PUs). However, the determination of the best CU partition for each coding tree unit (CTU) and the best PU mode for each CU causes a dramatic increase in computational complexity. To reduce such computational complexity, we propose a fast PU decision algorithm that early terminates PU search. The proposed method skips the computation of R-D cost for certain PU modes in the current CU based on the best mode and the rate-distortion (RD) cost of the upper depth CU. Experimental results show that the proposed method reduces the computational complexity of HM12.0 to 18.1% with only 0.2% increases in BD-rate.

Performance Evaluation of Bit Error Resilience for Pixel-domain Wyner-Ziv Video Codec with Frame Difference Residual Signal (화면 간 차이 신호에 대한 화소 영역 위너-지브 비디오 코덱의 비트 에러 내성 성능 평가)

  • Kim, Jin-Soo
    • The Journal of the Korea Contents Association
    • /
    • v.12 no.8
    • /
    • pp.20-28
    • /
    • 2012
  • DVC(Distributed Video Coding) technique is a new paradigm, which is based on the Slepian-Wolf and Wyner-Ziv theorems. DVC offers not only flexible partitioning of the complexity between the encoder and decoder, but also robustness to channel errors due to intrinsic joint source-channel coding. Many conventional research works have been focused on the light video encoder and its rate-distortion performance improvement. However, in this paper, we propose a new DVC codec which is effectively applicable for error-prone environment. The proposed method adopts a quantiser without dead-zone and symmetric Gray code around zero value. Through computer simulations, the proposed method is evaluated by the bit errors position as well as the number of burst bit errors. Additionally, it is shown that the maximum and minimum transmission rate for the given application can be linearly determined by the number of bit errors.

Improved Side Information Generation using Field Coding for Wyner-Ziv Codec (Wyner-Ziv 부호화기를 위한 필드 부호화 기반 개선된 보조정보 생성)

  • Han, Chan-Hee;Jeon, Yeong-Il;Lee, Si-Woong
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.11
    • /
    • pp.10-17
    • /
    • 2009
  • Wyner-Ziv video coding is a new video compression paradigm based on distributed source coding theory of Slepian-Wolf and Wyner-Ziv. Wyner-Ziv coding enables light-encoder/heavy-decoder structure by shifting complex modules including motion estimation/compensation task to the decoder. Instead of performing the complicated motion estimation process in the encoder, the Wyner-Ziv decoder performs the motion estimation for the generation of side information in order to make the predicted signal of the Wyner-Ziv frame. The efficiency of side information generation deeply affects the overall coding performance, since the bit-rates of the Wyner-Ziv coding is directly dependent on side information. In this paper, an improved side information generation method using field coding is proposed. In the proposed method, top fields are coded with the existing SI generation method and bottom fields are coded with new SI generation method using the information of the top fields. Simulation results show that the proposed method improves the quality of the side information and rate-distortion performance compared to the conventional method.

Rate-Constrained Key Frame Selection Method using Iteration (반복 과정을 통한 율-제한 주요 화명 선택 기법)

  • Lee, Hun-Cheol;Kim, Seong-Dae
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.4
    • /
    • pp.388-398
    • /
    • 2002
  • Video representation through representative frames (key frames) has been addressed frequently as an efficient way of preserving the whole temporal information of sequence with a considerably smaller amount of data. Such compact video representation is suitable for the purpose of video browsing in limited storage or transmission bandwidth environments. In a case like this, the controllability of the total key frame number (i.e. key frame rate) depending on the storage or bandwidth capacity is an important requirement of a key frame selection method. In this paper, we present a sequential key frame selection method when the number of key frames is given as a constraint. It first selects the desired number of initial key frames and determines non-overlapping initial time intervals that are represented by each key frame. Then, it adjusts the positions of key frames and time intervals by iteration, which minimizes the distortion. Experimental result demonstrates the improved performance of our algorithm over the existing approaches.

Effects of LDPCA Frame Size for Parity Bit Estimation Methods in Fast Distributed Video Decoding Scheme (고속 분산 비디오 복호화 기법에서 패리티 비트 예측방식에 대한 LDPCA 프레임 크기 효과)

  • Kim, Man-Jae;Kim, Jin-Soo
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.8
    • /
    • pp.1675-1685
    • /
    • 2012
  • DVC (Distributed Video Coding) technique plays an essential role in providing low-complexity video encoder. But, in order to achieve the better rate-distortion performances, most DVC systems need feedback channel for parity bit control. This causes the DVC-based system to have high decoding latency and becomes as one of the most critical problems to overcome for a real implementation. In order to overcome this problem and to accelerate the commercialization of the DVC applications, this paper analyzes an effect of LDPCA frame size for adaptive LDPCA frame-based parity bit request estimations. First, this paper presents the LDPCA segmentation method in pixel-domain and explains the temporal-based bit request estimation method and the spatial-based bit request estimation method using the statistical characteristics between adjacent LDPCA frames. Through computer simulations, it is shown that the better performance and fast decoding is observed specially when the LDPCA frame size is 3168 in QCIF resolution.

Spatial and Temporal Resolution Selection for Bit Stream Extraction in H.264 Scalable Video Coding (H.264 SVC에서 비트 스트림 추출을 위한 공간과 시간 해상도 선택 기법)

  • Kim, Nam-Yun;Hwang, Ho-Young
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.1
    • /
    • pp.102-110
    • /
    • 2010
  • H.264 SVC(Scalable Video Coding) provides the advantages of low disk storage requirement and high scalability. However, a streaming server or a user terminal has to extract a bit stream from SVC file. This paper proposes a bit stream extraction method which can get the maximum PSNR value while date bit rate does not exceed the available network bandwidth. To do this, this paper obtains the information about extraction points which can get the maximum PSNR value offline and decides the spatial/temporal resolution of a bit stream at run-time. This resolution information along with available network bandwidth is used as the parameters to a bit stream extractor. Through experiment with JSVM reference software, we proved that proposed bit stream extraction method can get a higher PSNR value.