• 제목/요약/키워드: 2D Video

Search Result 910, Processing Time 0.036 seconds

Fast Quadtree Based Normalized Cross Correlation Method for Fractal Video Compression using FFT

  • Chaudhari, R.E.;Dhok, S.B.
    • Journal of Electrical Engineering and Technology
    • /
    • v.11 no.2
    • /
    • pp.519-528
    • /
    • 2016
  • In order to achieve fast computational speed with good visual quality of output video, we propose a frequency domain based new fractal video compression scheme. Normalized cross correlation is used to find the structural self similar domain block for the input range block. To increase the searching speed, cross correlation is implemented in the frequency domain using FFT with one computational operation for all the domain blocks instead of individual block wise calculations. The encoding time is further minimized by applying rotation and reflection DFT properties to the IFFT of zero padded range blocks. The energy of overlap small size domain blocks is pre-computed for the entire reference frame and retaining the energies of the overlapped search window portion of previous adjacent block. Quadtree decompositions are obtained by using domain block motion compensated prediction error as a threshold to control the further partitions of the block. It provides a better level of adaption to the scene contents than fixed block size approach. The result shows that, on average, the proposed method can raise the encoding speed by 48.8 % and 90 % higher than NHEXS and CPM/NCIM algorithms respectively. The compression ratio and PSNR of the proposed method is increased by 15.41 and 0.89 dB higher than that of NHEXS on average. For low bit rate videos, the proposed algorithm achieve the high compression ratio above 120 with more than 31 dB PSNR.

Clustered Segment Index Scheme for P2P VOD Service on Virtual Mesh Overlay Network (가상 메시 오버레이 네트워크상에서의 P2P VOD 서비스를 위한 클러스터 세그먼트 인덱스 기법)

  • Lim, Pheng-Un;Choi, Hwang-Kyu
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.65 no.6
    • /
    • pp.1052-1059
    • /
    • 2016
  • Video-on-Demand(VoD) is one of the most popular media streaming which attracted many researchers' attention. VMesh is one of the most cited works in the field of the VoD system. VMesh is proposed to solve the problem of random seeking functionality. However, a large number of the DHT(Distributed Hash Table) searches in VMesh is sill the main problem which needs to be solved. In order to reduce the number of the DHT searches, the clustered segment index(CSI) scheme is proposed. In this scheme, the video segments are divided into clusters. The segment information of the video segments, which are clustered into the same cluster, are stored in the same clustered segment index that can be searched by using the hash key. Each peer also can request the required segments by using this clustered segment index. The experiment results show that the number of the DHT searches in the proposed scheme is less than that of VMesh even in case of peers leave and join the network or peers perform the fast forward/backward operations.

Dependent Quantization for Scalable Video Coding

  • Pranantha, Danu;Kim, Mun-Churl;Hahm, Sang-Jin;Lee, Keun-Sik;Park, Keun-Soo
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2006.11a
    • /
    • pp.127-132
    • /
    • 2006
  • Quantization in video coding plays an important role in controlling the bit-rate of compressed video bit-streams. It has been used as an important control means to adjust the amount of bit-streams to at]owed bandwidth of delivery networks and storage. Due to the dependent nature of video coding, dependent quantization has been proposed and applied for MPEG-2 video coding to better maintain the quality of reconstructed frame for given constraints of target bit-rate. Since Scalable Video Coding (SVC) being currently standardized exhibits highly dependent coding nature not only between frames but also lower and higher scalability layers where the dependent quantization can be effectively applied, in this paper, we propose a dependent quantization scheme for SVC and compare its performance in visual qualities and bit-rates with the current JSVM reference software for SVC. The proposed technique exploits the frame dependences within each GOP of SVC scalability layers to formulate dependent quantization. We utilize Lagrange optimization, which is widely accepted in R-D (rate-distortion) based optimization, and construct trellis graph to find the optimal cost path in the trellis by minimizing the R-D cost. The optimal cost path in the trellis graph is the optimal set of quantization parameters (QP) for frames within a GOP. In order to reduce the complexity, we employ pruning procedure using monotonicity property in the trellis optimization and cut the frame dependency into one GOP to decrease dependency depth. The optimal Lagrange multiplier that is used for SVC is equal to H.264/AVC which is also used in the mode prediction of the JSVM reference software. The experimental result shows that the dependent quantization outperforms the current JSVM reference software encoder which actually takes a linear increasing QP in temporal scalability layers. The superiority of the dependent quantization is achieved up to 1.25 dB increment in PSNR values and 20% bits saving for the enhancement layer of SVC.

  • PDF

Adaptive Pre-/Post-Filters for NRT-Based Stereoscopic Video Coding

  • Lee, Byung-Tak;Lee, BongHo;Choi, Haechul;Kim, Jin-Soo;Yun, Kugjin;Cheong, Won-Sik;Kim, Jae-Gon
    • ETRI Journal
    • /
    • v.34 no.5
    • /
    • pp.666-673
    • /
    • 2012
  • Non-real-time delivery of stereoscopic video has been considered as a service scenario for 3DTV to overcome the limited bandwidth in the terrestrial digital television system. A hybrid codec combining MPEG-2 and H.264/AVC has been suggested for the compression of stereoscopic video for 3DTV. In this paper, we propose a stereoscopic video coding scheme using adaptive pre-/post-filters (APPF) to improve the quality of 3D video while retaining compatibility with legacy video coding standards. The APPF are applied adaptively to blocks of various sizes determined by the macroblock coding mode and reference frame index. Experiment results show that the proposed method achieves up to 24.86% bit rate savings relative to a hybrid codec of MPEG-2 and H.264/AVC including the inter-view prediction.

A Characterisitc Analysis Study of Android based Stereoscopic 3D Technology (안드로이드기반 스테레오스코픽 3D 기술 특성분석 연구)

  • Kim, Cheong Ghil
    • Journal of Satellite, Information and Communications
    • /
    • v.8 no.2
    • /
    • pp.68-73
    • /
    • 2013
  • In recent years, the developments in 3D technologies have initiated the commercialization of 3D services on mobile devices. For this purpose, stereoscopic 3D technology is used, which enables 3D TV on mobile devices including smartphone with glasses-free 3D viewing. As a result, the issues related with stereo imaging have been spotlighted greatly. Especially, three MPEG coding standards are provided for mobile 3D services, namely H.264/AVC with and without SEI message and H.264/MVC. In this respect, this paper presents an overview of developments in stereoscopic technologies for mobile devices to gain some perspective on the changes and progress. in this paper, we verified the availability of android based stereoscipic 3D technology related with mobile 3D TV and Smartphone with special emphasis on 3D video format and 3D features by various technollogy characteristics analysis.

BoF based Action Recognition using Spatio-Temporal 2D Descriptor (시공간 2D 특징 설명자를 사용한 BOF 방식의 동작인식)

  • KIM, JinOk
    • Journal of Internet Computing and Services
    • /
    • v.16 no.3
    • /
    • pp.21-32
    • /
    • 2015
  • Since spatio-temporal local features for video representation have become an important issue of modeless bottom-up approaches in action recognition, various methods for feature extraction and description have been proposed in many papers. In particular, BoF(bag of features) has been promised coherent recognition results. The most important part for BoF is how to represent dynamic information of actions in videos. Most of existing BoF methods consider the video as a spatio-temporal volume and describe neighboring 3D interest points as complex volumetric patches. To simplify these complex 3D methods, this paper proposes a novel method that builds BoF representation as a way to learn 2D interest points directly from video data. The basic idea of proposed method is to gather feature points not only from 2D xy spatial planes of traditional frames, but from the 2D time axis called spatio-temporal frame as well. Such spatial-temporal features are able to capture dynamic information from the action videos and are well-suited to recognize human actions without need of 3D extensions for the feature descriptors. The spatio-temporal BoF approach using SIFT and SURF feature descriptors obtains good recognition rates on a well-known actions recognition dataset. Compared with more sophisticated scheme of 3D based HoG/HoF descriptors, proposed method is easier to compute and simpler to understand.

A Video Watermarking Based on Wavelet Transform Using Spread Spectrum Technique (대역확산방법을 이용한 웨이블릿 기반의 비디오 워터마킹)

  • Kim, Seung-Jin;Kim, Tae-Su;Lee, Kuhn-Il
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.5 s.305
    • /
    • pp.11-18
    • /
    • 2005
  • In this paper, we proposed a video watermarking algerian based on wavelet transform using statistical characteristic of video according to the energy distribution and the spread spectrum technique. In the proposed method, the original video is splitted by spatial difference metric and classified into the motion region and the motionless region according to the motion degree. The motion region is decomposed into 3-levels using 3D DWT and the motionless region is decomposed into 2-levels using 2D DWT The baseband of the wavelet-decomposed image is not utilized because of the image quality. So that the standard deviation of the highest subband coefficients except for the baseband is used to determine the threshold. Binary video watermarks preprocessed by the random permutation and the spread spectrum technique are embedded into selected coefficients. In computer experiments, the proposed algorithm was found to be more invisible and robust than the conventional algorithms.

Applying differential techniques for 2D/3D video conversion to the objects grouped by depth information (2D/3D 동영상 변환을 위한 그룹화된 객체별 깊이 정보의 차등 적용 기법)

  • Han, Sung-Ho;Hong, Yeong-Pyo;Lee, Sang-Hun
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.13 no.3
    • /
    • pp.1302-1309
    • /
    • 2012
  • In this paper, we propose applying differential techniques for 2D/3D video conversion to the objects grouped by depth information. One of the problems converting 2D images to 3D images using the technique tracking the motion of pixels is that objects not moving between adjacent frames do not give any depth information. This problem can be solved by applying relative height cue only to the objects which have no moving information between frames, after the process of splitting the background and objects and extracting depth information using motion vectors between objects. Using this technique all the background and object can have their own depth information. This proposed method is used to generate depth map to generate 3D images using DIBR(Depth Image Based Rendering) and verified that the objects which have no movement between frames also had depth information.

Influence of Gaming Display and Wearing Glasses on Perceived Characteristics, Presence, and Fatigue (게임 디스플레이 종류와 안경착용 여부에 따른 영상의 인지된 특성, 프레즌스 그리고 피로도의 차이)

  • Lee, Hyunji;Chung, Donghun
    • Journal of Broadcast Engineering
    • /
    • v.17 no.6
    • /
    • pp.1004-1013
    • /
    • 2012
  • 3D images and videos are required viewers to wear 3D glasses. According to the data, about half of Korean people wear glasses or contact lens and this implies 3D video viewers may have a trouble due to putting a pair of 3D glasses atop their glasses. The purpose of this study is to examine gamers' perceived characteristics, presence, and fatigue according to video gaming display (2D vs. 3D) and glasses whether wearing or not. The results show that the interaction effect of the display and wearing glasses was statistically significant in the perceived presence, and the main effect of the display was statistically significant in the perceived characteristics and fatigue.

Clinical Analysis of Video-assisted Thoracoscopic Spinal Surgery in the Thoracic or Thoracolumbar Spinal Pathologies

  • Kim, Sung-Jin;Sohn, Moon-Jun;Ryoo, Ji-Yoon;Kim, Yeon-Soo;Whang, Choong-Jin
    • Journal of Korean Neurosurgical Society
    • /
    • v.42 no.4
    • /
    • pp.293-299
    • /
    • 2007
  • Objective : Thoracoscopic spinal surgery provides minimally invasive approaches for effective vertebral decompression and reconstruction of the thoracic and thoracolumbar spine, while surgery related morbidity can be significantly lowered. This study analyzes clinical results of thoracoscopic spinal surgery performed at our institute. Methods : Twenty consecutive patients underwent video-assisted thoracosopic surgery (VATS) to treat various thoracic and thoracolumbar pathologies from April 2000 to July 2006. The lesions consisted of spinal trauma (13 cases), thoracic disc herniation (4 cases), tuberculous spondylitis (1 case), post-operative thoracolumbar kyphosis (1 case) and thoracic tumor (1 case). The level of operation included upper thoracic lesions (3 cases), midthoracic lesions (6 cases) and thoracolumbar lesions (11 cases). We classified the procedure into three groups: stand-alone thoracoscopic discectomy (3 cases), thoracoscopic fusion (11 cases) and video assisted mini-thoracotomy (6 cases). Results : Analysis on the Frankel performance scale in spinal trauma patients (13 cases), showed a total of 7 patients who had neurological impairment preoperatively : Grade D (2 cases), Grade C (2 cases), Grade B (1 case), and Grade A (2 cases). Four patients were neurologically improved postoperatively, two patients were improved from C to E, one improved from grade D to E and one improved from grade B to grade D. The preoperative Cobb's and kyphotic angle were measured in spinal trauma patients and were $18.9{\pm}4.4^{\circ}$ and $18.8{\pm}4.6^{\circ}$, respectively. Postoperatively, the angles showed statistically significant improvement, $15.1{\pm}3.7^{\circ}$ and $11.3{\pm}2.4^{\circ}$, respectively(P<0.001). Conclusion : Although VATS requires a steep learning curve, it is an effective and minimally invasive procedure which provides biomechanical stability in terms of anterior column decompression and reconstruction for anterior load bearing, and preservation of intercostal muscles and diaphragm.