• Title/Summary/Keyword: H.264 Codec

Search Result 101, Processing Time 0.022 seconds

ROI-based Encoding using Face Detection and Tracking for mobile video telephony (얼굴 인식과 추적을 이용한 ROI 기반 영상 통화 코덱 설계 및 구현)

  • Lee, You-Sun;Kim, Chang-Hee;Na, Tae-Young;Lim, Jeong-Yeon;Joo, Young-Ho;Kim, Ki-Mun;Byun, Jae-Woan;Kim, Mun-Churl
    • Proceedings of the IEEK Conference
    • /
    • 2008.06a
    • /
    • pp.77-78
    • /
    • 2008
  • With advent of 3G mobile communication services, video telephony becomes one of the major services. However, due to a narrow channel bandwidth, the current video telephony services have not yet reached a satisfied level. In this paper, we propose an ROI (Region-Of-Interest) based improvement of visual quality for video telephony services with the H.264|MPEG-4 Part 10 (AVC: Advanced Video Coding) codec. To this end, we propose a face detection and tracking method to define ROI for the AVC codec based video telephony. Experiment results show that our proposed ROI based method allowed for improved visual quality in both objective and subjective perspectives.

  • PDF

High-Performance Architecture of 4×4/8×8 DCT and Quantization Circuit for Unified Video CODEC (통합 비디오 코덱을 위한 4×4/8×8 DCT와 양자화 회로의 고성능 구조)

  • Lee, Seon-Young;Cho, Kyeong-Soon
    • The KIPS Transactions:PartA
    • /
    • v.18A no.2
    • /
    • pp.39-44
    • /
    • 2011
  • This paper proposes the new high-performance circuit architecture of the transform and quantization for unified video CODEC. The proposed architecture can be applied to all kinds of transforms and quantizations for the video compression standards such as JPEG, MPEG-1/2/4, H.264 and VC-1. We defined the permutation matrices to reorder the transform matrix of the $8{\times}8$ DCT and partitioned the reordered $8{\times}8$ transform matrix into four $4{\times}4$ sub-matrices. The $8{\times}8$ DCT is performed by repeating the $4{\times}4$ DCT's based on the reordered and partitioned transform matrices. Since our circuit accepts the transform coefficients from the users, it can be extended very easily to cover any kind of DCT-based transforms for future standards. The multipliers in the DCT circuit are shared by the quantization circuit in order to minimize the circuit size. The quantization circuit is merged into the DCT circuit without any significant increase of circuit resources and processing time. We described the proposed DCT and quantization circuit at RTL, and verified its operation on FPGA board.

New Video Compression Method based on Low-complexity Interpolation Filter-bank (저 복잡도 보간 필터 뱅크 기반의 새로운 비디오 압축 방법)

  • Nam, Jung-Hak;Jo, Hyun-Ho;Sim, Dong-Gyu;Choi, Byeong-Doo;Cho, Dae-Sung
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.5
    • /
    • pp.165-174
    • /
    • 2010
  • The H.264/AVC standard obtained better performance than previous compression standards, but it also increased the computational complexity of CODEC simultaneously. Various techniques recently included at the KTA software developed by VCEG also were increasing its complexity. Especially adaptive interpolation filter has more complexity than two times due to development for coding efficiency. In this paper, we propose low-complexity filter bank to improve speed up of decoding and coding gain. We consists of filter bank of a fixed-simple filter for low-complexity and adaptive interpolation filter for high coding efficiency. Then we compensated using optimal filter at each macroblock-level or frame-level. Experimental results shows a similar coding efficiency compared to existing adaptive interpolation filter and decoding speed of approximately 12% of the entire decoder gained.

Performance improvement for Streaming of High Capacity Panoramic Video (대용량 파노라마 비디오 스트리밍의 성능개선)

  • Kim, Young-Back;Kim, Tae-Ho;Lee, Dae-Gyu;Kim, Jae-Joon
    • Journal of Internet Computing and Services
    • /
    • v.11 no.2
    • /
    • pp.143-153
    • /
    • 2010
  • When providing high quality panoramic video across the Internet, mobile communications, and broadcasting areas, it requires a suitable video codec that satisfies both high-compression efficiency and random access functionality. The users must have high-compression efficiency in order to enable video streaming of high-volume panoramic data. Random access allows the user to move the viewpoint and direction freely. In this paper, we propose the parallel processing scheme under cell units in order to improve the performance of streaming service for large screen panoramic video in 10Mbps bandwidths based on H.264/AVC with high compression rate. This improved algorithm divides a screen composed of cells less than $256{\times}256$ in size, encodes it, and decodes it with the cells in the present view. At this point, encoding/decoding is parallel processed by the present cell units. Also, since the cells only included in the present view are packed and transmitted, the possible processing of not extricating blocks is proven by experiment.

Adaptive Model-Based Quantization Parameter Decision for Video Rate Control (비디오 비트율 제어를 위한 적응적 모델 기반의 양자화 변수 결정 방법)

  • Kim, Seon-Ki;Ho, Yo-Sung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.4C
    • /
    • pp.411-417
    • /
    • 2007
  • The rate control is an essential component in video coding to provide better quality under given coding constraints, such as channel capacity, frame rates, etc. In general, source data cannot be described as a single distribution in a video coding, hence it can cause an exhaustive approximation problem. It drops a coding efficiency under weak channel environments, such as mobile communications. In this paper, we design a new quantization parameter decision model that is based on a rate-distortion function of generalized Gaussian distribution. In order to adaptively express various source data distribution, we decide a shape parameter by observing a ratio of samples, which have a small value. For experiment, the proposed algorithm is implemented into H.264/AVC video codec, and its performance is compared with that of MPEG-2 TM5, H.263 TMN8 rate control algorithm. As shown in simulation results, the proposed algorithm provides an improved quality rather than previous algorithms and generates the number of bits closed to the target bits.

Characteristic Analysis for Compression of Digital Hologram (디지털 홀로그램의 압축을 위한 특성 분석)

  • Kim, Jin-Kyum;Kim, Kyung-Jin;Kim, Woo-Suk;Lee, Yoon-Huck;Oh, Kwan-Jung;Kim, Jin-Woong;Kim, Dong-Wook;Seo, Young-Ho
    • Journal of Broadcast Engineering
    • /
    • v.24 no.1
    • /
    • pp.164-181
    • /
    • 2019
  • This paper introduces the analysis and development of digital holographic data codec technology to effectively compress hologram data. First, the generation method and data characteristics of the hologram standard data set provided by JPEG Pleno are introduced. We analyze energy compaction according to hologram generation method using discrete wavelet transform and discrete cosine transform. The quantization efficiency according to the hologram generation method is analyzed by applying uniform quantization and non-uniform quantization. We propose a transformation method quantization method suitable for hologram generation method through transform and quantization experiments. Finally, holograms are compressed using standard compression codecs such as JPEG, JPEG2000, AVC/H.264 and HEVC/H.265 and the results are analyzed.

Scalable Extension of HEVC for Flexible High-Quality Digital Video Content Services

  • Lee, Hahyun;Kang, Jung Won;Lee, Jinho;Choi, Jin Soo;Kim, Jinwoong;Sim, Donggyu
    • ETRI Journal
    • /
    • v.35 no.6
    • /
    • pp.990-1000
    • /
    • 2013
  • This paper describes the scalable extension of High Efficiency Video Coding (HEVC) to provide flexible high-quality digital video content services. The proposed scalable codec is designed on multi-loop decoding architecture to support inter-layer sample prediction and inter-layer motion parameter prediction. Inter-layer sample prediction is enabled by inserting the reconstructed picture of the reference layer (RL) into the decoded picture buffer of the enhancement layer (EL). To reduce the motion parameter redundancies between layers, the motion parameter of the RL is used as one of the candidates in merge mode and motion vector prediction in the EL. The proposed scalable extension can support scalabilities with minimum changes to the HEVC and provide average Bj${\o}$ntegaard delta bitrate gains of about 24% for spatial scalability and of about 21% for SNR scalability compared to simulcast coding with HEVC.

Panoramic-Stereopic Monitoring System for Tele-Operating Excavator by Using Center-viewpoint Corresponding Algorithm (중심시점 일치 알고리즘 기반 원격제어 굴삭기의 파노라믹 입체 모니터링 시스템)

  • Kim, Bong-Seok;Park, Chang-Woo;Jun, Se-Woong;Lee, Jong-Bae
    • Proceedings of the Computational Structural Engineering Institute Conference
    • /
    • 2007.04a
    • /
    • pp.74-77
    • /
    • 2007
  • In this paper, we propose a system which can display panoramic-stereopic image for tele-operating excavator. the system was implemented by using center-viewpoint corresponding algorithm and image mosaic process. We could obtain remarkable result from the survey among the ex-pert of excavator. We also design a control station with RF transfer module, H.264 codec module and suggested algorithm. Especially, our system shows a superb performance on cubic effect and presence sense of a field of excavating work.

  • PDF

Fast Edge Map Method And Edge Map Compression Using Edge Features (고속 Edge Map 생성 방법과 Edge 특성을 이용한 Edge Map 압축)

  • Kim, Do-Hyun;Kim, Yoon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2015.07a
    • /
    • pp.45-48
    • /
    • 2015
  • 오늘날 하드웨어의 발전으로 인해 영상 해상도는 FHD를 넘어 4K UHD 이상의 영상 해상도가 사용화되고 있다. 하지만 Edge Map을 만들기 위해 일반적으로 사용하는 함수들은 Convolution 함수 일종으로서 영상의 해상도가 높을수록 더 많은 Complexity를 요구한다. 또한 현재 주요 영상 압축 기술인 JPEG, H.264/AVC High efficiency video coding(HEVC)같은 기법들은 자연 영상을 중점으로 개발되어 있어 Edge map 압축에 있어 자연 영상만큼의 효율을 보여주지 못하고 있다. 본 논문은 원 영상을 Down Scaling한 뒤 이미지를 다시 원래 사이즈로 Up Scaling하여 두 영상의 차를 이용한 Edge Map을 생성하는 새로운 방법을 소개한다. 생성된 Edge Map의 특성인 Histogram 값의 분포가 0을 중심으로 Gaussian 분포를 가지는 것을 이용한 Zero Based 코덱을 제안한다. 제안된 알고리즘을 이용하여 고 해상도 영상에서도 빠르게 Edge Map을 생성하고 제안한 코덱을 통해 해당 Edge map을 압축한 결과 다른 압축 기술보다 더 뛰어난 성능을 보여주었다.

  • PDF

Linear Sub-band Decomposition based Pre-processing Algorithm for Perceptual Video Coding (지각적 동영상 부호화를 위한 선형 부 대역 분해 기반 전처리 기법)

  • Choi, Kwang Yeon;Song, Byung Cheol
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.54 no.1
    • /
    • pp.80-87
    • /
    • 2017
  • This paper proposes a pre-processing algorithm to improve perceptual video coding efficiency which decomposes an input frame via a sub-band decomposition, and suppresses only high frequency band(s) having low visual sensitivity. First, we decompose the input frame into several frequency subbands by a linear sub-band decomposition. Next, high frequency subband(s) which is rarely recognized by human visual system (HVS) is suppressed by applying relatively small gain(s). Finally, the high frequency suppressed frame is compressed by a specific video encoder. We can find from the experimental results that if comparing before-use and after-use of the proposed pre-processing prior to the encoder, no visual difference is shown. Also, the proposed algorithm achieves bit-saving of 13.12% on average in a H.264 video encoder.