• Title/Summary/Keyword: video compression.

Search Result 778, Processing Time 0.027 seconds

Evaluation of Video Codec AI-based Multiple tasks (인공지능 기반 멀티태스크를 위한 비디오 코덱의 성능평가 방법)

  • Kim, Shin;Lee, Yegi;Yoon, Kyoungro;Choo, Hyon-Gon;Lim, Hanshin;Seo, Jeongil
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.273-282
    • /
    • 2022
  • MPEG-VCM(Video Coding for Machine) aims to standardize video codec for machines. VCM provides data sets and anchors, which provide reference data for comparison, for several machine vision tasks including object detection, object segmentation, and object tracking. The evaluation template can be used to compare compression and machine vision task performance between anchor data and various proposed video codecs. However, performance comparison is carried out separately for each machine vision task, and information related to performance evaluation of multiple machine vision tasks on a single bitstream is not provided currently. In this paper, we propose a performance evaluation method of a video codec for AI-based multi-tasks. Based on bits per pixel (BPP), which is the measure of a single bitstream size, and mean average precision(mAP), which is the accuracy measure of each task, we define three criteria for multi-task performance evaluation such as arithmetic average, weighted average, and harmonic average, and to calculate the multi-tasks performance results based on the mAP values. In addition, as the dynamic range of mAP may very different from task to task, performance results for multi-tasks are calculated and evaluated based on the normalized mAP in order to prevent a problem that would be happened because of the dynamic range.

An Orthogonal Approximate DCT for Fast Image Compression (고속 영상 압축을 위한 근사 이산 코사인 변환)

  • Kim, Seehyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.19 no.10
    • /
    • pp.2403-2408
    • /
    • 2015
  • For image data the discrete cosine transform (DCT) has comparable energy compaction capability to Karhunen-Loeve transform (KLT) which is optimal. Hence DCT has been widely accepted in various image and video compression standard such as JPEG, MPEG-2, and MPEG-4. Recently some approximate DCT's have been reported, which can be computed much faster than the original DCT because their coefficients are either zero or the power of 2. Although the level of energy compaction is slightly degraded, the approximate DCT's can be utilized in real time implementation of image or visual compression applications. In this paper, an approximate 8-point DCT which contains 17 non-zero power-of-2 coefficients and high energy compaction capability comparable to DCT is proposed. Transform coding experiments with several images show that the proposed transform outperforms the published works.

A Study on Implementation of the Fast Motion Estimation (고속 움직임 예측기 구현에 관한 연구)

  • Kim, Jin-Yean;Park, Sang-Bong;Jin, Hyun-Jun;Park, Nho-Kyung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.27 no.1C
    • /
    • pp.69-77
    • /
    • 2002
  • Sine digital signal processing for motion pictures requires huge amount of data computation to store, manipulate and transmit, more effective data compression is necessary. Therefore, the ITU-T recommended H.26x as data compression standards for digital motion pictures. The data compression method that eliminates time redundancies by motion estimation using relationship between picture frames has been widely used. Most video conding systems employ block matching algorithm for the motion estimation and compensation, and the algorithm is based on the minimun value of cast functions. Therefore, fast search algorithm rather than full search algorithm is more effective in real time low data rates encodings such as H.26x. In this paper, motion estimation employing the Nearest-Neighbors algorithm is designed to reduce search time using FPGA, coded in VHDL, and simulated and verified using Xilink Foundation.

A Study on Applications of Wavelet Bases for Efficient Image Compression (효과적인 영상 압축을 위한 웨이브렛 기저들의 응용에 관한 연구)

  • Jee, Innho
    • Journal of IKEEE
    • /
    • v.21 no.1
    • /
    • pp.39-45
    • /
    • 2017
  • Image compression is now essential for applications such as transmission and storage in data bases. For video and digital image applications the use of long tap filters, while not providing any significant coding gain, may increase the hardware complexity. We use a wavelet transform in order to obtain a set of bi-orthogonal sub-classes of images; First, the design of short kernel symmetric analysis is presented in 1-dimensional case. Second, the original image is decomposed at different scales using a subband filter banks. Third, this paper is presented a technique for obtaining 2-dimensional bi-orthogonal filters using McClellan transform. It is shown that suggested wavelet bases is well used on wavelet transform for image compression. From performance comparison of bi-orthogonal filter, we actually use filters close to ortho-normal filters on application of wavelet bases to image analysis.

An experimental study on triaxial failure mechanical behavior of jointed specimens with different JRC

  • Tian, Wen-Ling;Yang, Sheng-Qi;Dong, Jin-Peng;Cheng, Jian-Long;Lu, Jia-wei
    • Geomechanics and Engineering
    • /
    • v.28 no.2
    • /
    • pp.181-195
    • /
    • 2022
  • Roughness and joint inclination angle are the important factors that affect the strength and deformation characteristics of jointed rock mass. In this paper, 3D printer has been employed to make molds firstly, and casting the jointed specimens with different joint roughness coefficient (JRC), and different joint inclination angle (α). Conventional triaxial compression tests were carried out on the jointed specimens, and the influence of JRC on the strength and deformation parameters was analyzed. At the same time, acoustic emission (AE) testing system has been adopted to reveal the AE characteristic of the jointed specimens in the process of triaxial compression. Finally, the morphological of the joint surface was observed by digital three-dimensional video microscopy system, and the relationship between the peak strength and JRC under different confining pressures has been discussed. The results indicate that the existence of joint results in a significant reduction in the strength of the joint specimen, JRC also has great influence on the morphology, quantity and spatial distribution characteristics of cracks. With the increase of JRC, the triaxial compressive strength increase, and the specimen will change from brittle failure to ductile failure.

Fast Intra Prediction Mode Decision based on Rough Mode Decision and Most Probable Mode in HEVC (Rough Mode Decision과 Most Probable Mode에 기반을 둔 HEVC 고속 인트라 예측 모드 결정 방법)

  • Lee, Seung-Ho;Park, Sang-Hyo;Jang, Euee Seon
    • Journal of Broadcast Engineering
    • /
    • v.19 no.2
    • /
    • pp.158-165
    • /
    • 2014
  • High Efficiency Video Coding (HEVC), the latest video coding standard, has twice of the compression efficiency compared to AVC/H.264 under the same image quality condition. To obtain the improved efficiency, however, it was adopted for many methods which need complicated calculation, and the time complexity of HEVC was increased more than that of AVC/H.264. To solve this problem, the various fast algorithms have been researched. In this paper, we propose a fast intra prediction mode decision method which uses result of Rough Mode Decision (RMD) and Most Probable Mode (MPM). The proposed method selects a best predicted mode by comparing each predicted directions which are calculated through RMD and MPM. We applied the proposed method to HM 10.0 and conducted an comparing experiment in All-Intra environment. The experiment result showed that total encoding time is reduced by about 26% on average with about a 0.8% loss of BD-rate.

A study of scene change detection in HEVC bit stream (HEVC 비트 스트림 상에서의 장면전환 검출 기법 연구)

  • Eom, Yumie;Yoo, Sung-Geun;Yoon, So-Jeong
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2014.06a
    • /
    • pp.258-261
    • /
    • 2014
  • The era of realistic broadcast with high fidelity has come after the wide-spread distribution of UHD display and the transmission of UHD experimental broadcast in CATV. However, UHD broadcast now has constraint because it requires much amount of bandwidth and data in broadcasting transmission and production system. Not only HEVC(High Efficiency Video Codec) which has more than two times higher compression rate but also cloud-based editing system would be the key to solve the problems above. Also, fast scene change detection of videos is needed to index and search UHD videos smoothly. In this paper, therefore, a method is proposed to index and search the scene change information of large volume UHD videos compressed with high-efficiency codec. Application usages of fast detection of scene change information in various UHD video environments are considered by using this algorithm.

  • PDF

Status of Profiles and Levels for JVT Video Coding Standard (JVT 동영상 국제표준 프로파일/레벨 동향)

  • 김해광;이상윤
    • Broadcasting and Media Magazine
    • /
    • v.7 no.3
    • /
    • pp.12-18
    • /
    • 2002
  • JVT is an international video coding standard that is being developed jointly by VCEG of ITU and MPEG of ISO. The standardization efforts are targeted mainly for a very high compression ratio. JVT is a general video coding technology that may be used in various application fields. JVT began to work seriously on the profiles and levels issues since Geneva meeting, January 2002. Profiles are sub sets of technical tools from the entire tools and levels limit processing power and memory resources of a decoder As of now, three profiles of Baseline, Main and X (not defined name yet) and hierarchically structured levels are defined in JVT FCD. The profiling issue is very important for the JVT s initial objective of Baseline royalty free policy. Royalty free Baseline profiling is currently under practical hurdles and this issue may impact as one of critical factors on the success of JVT standard.

Improved Side Information Generation using Field Coding for Wyner-Ziv Codec (Wyner-Ziv 부호화기를 위한 필드 부호화 기반 개선된 보조정보 생성)

  • Han, Chan-Hee;Jeon, Yeong-Il;Lee, Si-Woong
    • The Journal of the Korea Contents Association
    • /
    • v.9 no.11
    • /
    • pp.10-17
    • /
    • 2009
  • Wyner-Ziv video coding is a new video compression paradigm based on distributed source coding theory of Slepian-Wolf and Wyner-Ziv. Wyner-Ziv coding enables light-encoder/heavy-decoder structure by shifting complex modules including motion estimation/compensation task to the decoder. Instead of performing the complicated motion estimation process in the encoder, the Wyner-Ziv decoder performs the motion estimation for the generation of side information in order to make the predicted signal of the Wyner-Ziv frame. The efficiency of side information generation deeply affects the overall coding performance, since the bit-rates of the Wyner-Ziv coding is directly dependent on side information. In this paper, an improved side information generation method using field coding is proposed. In the proposed method, top fields are coded with the existing SI generation method and bottom fields are coded with new SI generation method using the information of the top fields. Simulation results show that the proposed method improves the quality of the side information and rate-distortion performance compared to the conventional method.

Initial QP Determination Algorithm using Bit Rate Model (비트율 모델을 이용한 초기 QP 결정 알고리즘)

  • Park, Sang-Hyun
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.16 no.9
    • /
    • pp.1947-1954
    • /
    • 2012
  • The first frame is encoded in intra mode which generates a larger number of bits. In addition, the first frame is used for the inter mode encoding of the following frames. Thus the initial QP for the first frame affects the first frame as well as the following frames. Traditionally, the initial QP is determined among four constant values only depending on the bpp. In the case of low bit rate video coding, the initial QP value is fixed to 40 regardless of the output bandwidth. Although this initialization scheme is simple, yet it is not accurate enough. An accurate initial QP prediction scheme should not only depends on bpp but also on the complexity of the video sequence and the output bandwidth. In the proposed scheme, we determine the initial QP according to the ratio of the first frame to the total bits allocated to a GOP. To estimate the QP of the allocated bits, Rate-QP model is used. It is shown by experimental results that the new algorithm can predict the optimal initial QP more accurately and generate the PSNR performance better than that of the existing JVT algorithm.