• Title/Summary/Keyword: 가상비디오생성

Search Result 50, Processing Time 0.024 seconds

Super Metric: Quality Assessment Methods for Immersive Video (몰입형 비디오 품질 평가를 위한 슈퍼 메트릭)

  • Jeong, Jong-Beom;Kim, Seunghwan;Lee, Soonbin;Kim, Inae;Ryu, Eun-Seok
    • Journal of Internet Computing and Services
    • /
    • v.22 no.2
    • /
    • pp.51-58
    • /
    • 2021
  • Three degrees of freedom plus(3DoF+) and six degrees of freedom(6DoF) system, which supports a user's movements in graphical and natural scene-based virtual reality, requires multiple high-quality and high-resolution videos to provide immersive media. Previous video quality assessment methods are not appropriate for the 3DoF+ and 6DoF system assessment because different types of artifacts occur in these systems which are not shown in the traditional video compression. This paper provides the performance assessments of several quality assessment methods in 3DoF+ system. Furthermore, this paper presents a super metric, which combines multiple quality assessment methods, thereby it showed a higher correlation coefficient with the subjective quality assessment than the previous methods. Experimental results on 3DoF+ immersive video showed 0.4513 gain on correlation coefficient with subjective quality assessment compared to that of peak signal-to-noise ratio(PSNR).

인터렉티브 하이브리드 미디어 응용기술 -MPEG-4 SNHC를 중심으로-

  • 김형곤
    • Broadcasting and Media Magazine
    • /
    • v.3 no.2
    • /
    • pp.44-58
    • /
    • 1998
  • 최근의 멀티미디어 기술은 정보의 디지털화와 온라인화에 따라 가전, 컴퓨터, 통신 및 방송 기술이 융화되어 가는 추세에 있으며, 대화형의 하이브리드 멀티미디어 기술을 그 특징으로 하고있다. 하이브리드 멀티미디어는 컴퓨터 그래픽 및 미디(MIDI) 기술로 인위적으로 생성한 2D/3D그래픽 및 음향을 실제의 자연적인 영상과 소리에 추가하여 합성하므로 생성된다. MPEG-4는 이렇게 인위적으로 합성되거나 자연적인 영상 혹은 음향 정보의 디지털 하이브리드 멀티미디어 부호화를 목적으로 하며, 활성화된 혼합 미디어의 내용기반 처리, 상호, 동작 및 사용자의 쉬운 접근 등을 가능하게 한다. SNHC(Synthetic-Natural Hybrid Coding)는 기존의 수동적인 미디어의 전달뿐 아니라 실시간 처리가 가능한 인터랙티브 응용 분야까지 다루고 있으며, 통합된 시공간 부호화 기법을 사용하여 시각, 청각, 2차원, 3차원 컴퓨터 그래픽스 등 다양한 형태의 표준 AV(Aural/Visual) 객체를 처리한다. 표준화는 주로mesh-segmented 비디오 부호화, 구조물 부호화, 객체간의 동기화, AV 객체 스트림의 멀티플렉싱, 혼합 미디어 형태의 시-공간 통합화 등에서 이루어지게 되는데, 이는 궁극적으로 네트워크로 연결되는 가상 환경(Virtual Environment)에서 다수의 사용자가 서로 상호작용 할 수 있는 틀을 제공하는데 있다. 이러한 틀이 제공되면, 대화형 하이브리드 멀티미디어라는 새로운 형태의 정보를 사용함으로써 기존의 미디어로는 경험하지 못하는 다양한 응용과 서비스를 경험할 수 있을 것이다.

  • PDF

Efficient Correlation Channel Modeling for Transform Domain Wyner-Ziv Video Coding (Transform Domain Wyner-Ziv 비디오 부호를 위한 효과적인 상관 채널 모델링)

  • Oh, Ji-Eun;Jung, Chun-Sung;Kim, Dong-Yoon;Park, Hyun-Wook;Ha, Jeong-Seok
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.47 no.3
    • /
    • pp.23-31
    • /
    • 2010
  • The increasing demands on low-power, and low-complexity video encoder have been motivating extensive research activities on distributed video coding (DVC) in which the encoder compresses frames without utilizing inter-frame statistical correlation. In DVC encoder, contrary to the conventional video encoder, an error control code compresses the video frames by representing the frames in the form of syndrome bits. In the meantime, the DVC decoder generates side information which is modeled as a noisy version of the original video frames, and a decoder of the error-control code corrects the errors in the side information with the syndrome bits. The noisy observation, i.e., the side information can be understood as the output of a virtual channel corresponding to the orignal video frames, and the conditional probability of the virtual channel model is assumed to follow a Laplacian distribution. Thus, performance improvement of DVC systems depends on performances of the error-control code and the optimal reconstruction step in the DVC decoder. In turn, the performances of two constituent blocks are directly related to a better estimation of the parameter of the correlation channel. In this paper, we propose an algorithm to estimate the parameter of the correlation channel and also a low-complexity version of the proposed algorithm. In particular, the proposed algorithm minimizes squared-error of the Laplacian probability distribution and the empirical observations. Finally, we show that the conventional algorithm can be improved by adopting a confidential window. The proposed algorithm results in PSNR gain up to 1.8 dB and 1.1 dB on Mother and Foreman video sequences, respectively.

View Synthesis Error Removal for Comfortable 3D Video Systems (편안한 3차원 비디오 시스템을 위한 영상 합성 오류 제거)

  • Lee, Cheon;Ho, Yo-Sung
    • Smart Media Journal
    • /
    • v.1 no.3
    • /
    • pp.36-42
    • /
    • 2012
  • Recently, the smart applications, such as smart phone and smart TV, become a hot issue in IT consumer markets. In particular, the smart TV provides 3D video services, hence efficient coding methods for 3D video data are required. Three-dimensional (3D) video involves stereoscopic or multi-view images to provide depth experience through 3D display systems. Binocular cues are perceived by rendering proper viewpoint images obtained at slightly different view angles. Since the number of viewpoints of the multi-view video is limited, 3D display devices should generate arbitrary viewpoint images using available adjacent view images. In this paper, after we explain a view synthesis method briefly, we propose a new algorithm to compensate view synthesis errors around object boundaries. We describe a 3D warping technique exploiting the depth map for viewpoint shifting and a hole filling method using multi-view images. Then, we propose an algorithm to remove boundary noises that are generated due to mismatches of object edges in the color and depth images. The proposed method reduces annoying boundary noises near object edges by replacing erroneous textures with alternative textures from the other reference image. Using the proposed method, we can generate perceptually inproved images for 3D video systems.

  • PDF

Pattern Similarity Retrieval of Data Sequences for Video Retrieval System (비디오 검색 시스템을 위한 데이터 시퀀스 패턴 유사성 검색)

  • Lee Seok-Lyong
    • The KIPS Transactions:PartD
    • /
    • v.13D no.3 s.106
    • /
    • pp.347-356
    • /
    • 2006
  • A video stream can be represented by a sequence of data points in a multidimensional space. In this paper, we introduce a trend vector that approximates values of data points in a sequence and represents the moving trend of points in the sequence, and present a pattern similarity matching method for data sequences using the trend vector. A sequence is partitioned into multiple segments, each of which is represented by a trend vector. The query processing is based on the comparison of these vectors instead of scanning data elements of entire sequences. Using the trend vector, our method is designed to filter out irrelevant sequences from a database and to find similar sequences with respect to a query. We have performed an extensive experiment on synthetic sequences as well as video streams. Experimental results show that the precision of our method is up to 2.1 times higher and the processing time is up to 45% reduced, compared with an existing method.

3D Reenactment System of Soccer Game (3차원 축구 재연 시스템)

  • 이재호;김진우;김희정
    • Journal of Broadcast Engineering
    • /
    • v.8 no.1
    • /
    • pp.54-62
    • /
    • 2003
  • This paper presents a Soccer Game 3D Reencatment System which reenact the Important scene like getting a goal with image processing and computer graphics technologies. KBS Research Institute of Technology has developed the 3D Reenactment System of Soccer Game called ‘VPlay' to provide TV viewers with fresh images in soccer games. Vplay generates the reenactment of exciting and important soccer scenes by using computer graphics. Vplay extracts legion of players from video with color information, and then computes precise positions of players on the ground by using global motion estimation model and playground axis transformation model. The results are applied to locomotion generation module that generates the locomotion of virtual characters automatically. Using predefined motion and model library, Vplay reenacts the important scene in a quick and convenient manner Vplay was developed for live broadcasting of soccer games that demands rapid producing time and was used efficiently during past WorldCup and Asian Game.

A back tracing in dynamic programming for efficient the stereo matching (효율적인 스테레오 정합을 위한 동적계획법의 역 추적 방법)

  • Park, Jang-Ho;Choi, Hyun-Jun;Seo, Young-Ho;Kim, Dong-Wook
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2009.11a
    • /
    • pp.363-366
    • /
    • 2009
  • 변이영상은 두 스테레오 영상의 시차에 의해 발생하는 각 화소의 변위를 수록한 영상이다. 이 영상은 깊이영상을 생성하여 시점 간 가상영상을 생성하는데 사용된다. 따라서 변이영상은 다시점 비디오 서비스와 직접적인 연관이 있다. 본 논문에서는 유일성(uniqueness)제약과 순차성(ordering) 제약을 사용하여 기준영상과 참조영상 사이의 관계를 이용하여 생성한 변이 공간 영상(DSI : disparity space image)으로부터 비용 행렬을 계산하여 최적의 변이 경로를 찾아가는 다이내믹 프로그래밍을 분석 하였다. 다이내믹 프로그래밍은 정밀한 변이 맵을 얻을 수 있고, 다른 방식들에 비해 연산 속도가 빠르다는 장점을 가지고 있지만, 영상의 화소값의 변화가 없는 영역에서 이전의 경로를 계속 유지하려는 성질에 때문에 발생 하는 오류확산과 가려진 (occluded) 영역에 의한 오차로 인해 정확한 경로를 찾을 수 없는 경우가 빈번히 발생 하여 에러율이 높아지는 단점을 가지고 있다. 이러한 이론을 토대로 기존의 기법들에 비하여 정확도가 우수한 기법들을 제안하였다. 개선된 역 추적 과정을 이용하여 기존의 다이내믹 프로그래밍 기반의 스테레오 정합 기법들보다 우수성이 뛰어난 결과들을 나타내었다.

  • PDF

A Prototype Architecture of an Interactive Service System for Digital Hologram Videos (디지털 홀로그램 비디오를 위한 인터랙티브 서비스 시스템의 프로토타입 설계)

  • Seo, Young-Ho;Lee, Yoon-Hyuk;Yoo, Ji-Sang;Kim, Man-Bae;Choi, Hyun-Jun;Kim, Dong-Wook
    • Journal of Broadcast Engineering
    • /
    • v.17 no.4
    • /
    • pp.695-706
    • /
    • 2012
  • The purpose of this paper is to propose a service system for a digital hologram video, which has not been published yet. This system assumes the existing service frame for 2-dimensional or 3-dimensional image/video, which includes data acquisition, processing, transmission, reception, and reconstruction. This system also includes the function to service the digital hologram at the viewer's view point by tracking the viewer's face. For this function, the image information at the virtual view point corresponding to the viewer's view point is generated to get the corresponding hologram. Here in this paper, only a prototype that includes major functions of it is implemented, which includes camera system for data acquisition, camera calibration and image rectification, depth/intensity image enhancement, intermediate view generation, digital hologram generation, and holographic image reconstruction by both simulation and optical apparatus. The proposed prototype system was implemented and the result showed that it takes about 352ms to generate one frame of digital hologram and reconstruct the image by simulation, or 183ms to reconstruct image by optical apparatus instead of simulation.

Image Mosaic from a Video Sequence using Block Matching Method (블록매칭을 이용한 비디오 시퀀스의 이미지 모자익)

  • 이지근;정성태
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.8
    • /
    • pp.1792-1801
    • /
    • 2003
  • In these days, image mosaic is getting interest in the field of advertisement, tourism, game, medical imaging, and so on with the development of internet technology and the performance of personal computers. The main problem of mage mosaic is searching corresponding points correctly in the overlapped area between images. However, previous methods requires a lot of CPU times and data processing for finding corresponding points. And they need repeated recording with a revolution of 360 degree around objects or background. This paper presents a new image mosaic method which generates a panorama image from a video sequence recorded by a general video camera. Our method finds the corresponding points between two successive images by using a new direction oriented 3­step block matching methods. Experimental results show that the suggested method is more efficient than the methods based on existing block matching algorithm, such as full search and K­step search algorithm.

A Study for Depth-map Generation using Vanishing Point (소실점을 이용한 Depth-map 생성에 관한 연구)

  • Kim, Jong-Chan;Ban, Kyeong-Jin;Kim, Chee-Yong
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.2
    • /
    • pp.329-338
    • /
    • 2011
  • Recent augmentation reality demands more realistic multimedia data with the mixture of various media. High-technology for multimedia data which combines existing media data with various media such as audio and video dominates entire media industries. In particular, there is a growing need to serve augmentation reality, 3-dimensional contents and realtime interaction system development which are communication method and visualization tool in Internet. The existing services do not correspond to generate depth value for 3-dimensional space structure recovery which is to form solidity in existing contents. Therefore, it requires research for effective depth-map generation using 2-dimensional video. Complementing shortcomings of existing depth-map generation method using 2-dimensional video, this paper proposes an enhanced depth-map generation method that defines the depth direction in regard to loss location in a video in which none of existing algorithms has defined.