• Title/Summary/Keyword: Two-View Method

Search Result 991, Processing Time 0.027 seconds

Integrating Multi-view Stereoscopic Transmission System into MPEG-21 DIA (Digital Item Adaptation)

  • Lee, Seung-Won;Kim, Man-Bae;Byun, Hye-Ran;Park, Il-Kwon
    • Journal of Broadcast Engineering
    • /
    • v.12 no.4
    • /
    • pp.342-349
    • /
    • 2007
  • In general multi-view system, all the view sequences acquired at the server are transmitted to the client. However, this kind of system requires high processing power of the server as well as the client, thus it is posing a difficulty in practical applications. To overcome this problem, a relatively simple method is to transmit only two view-sequences requested by the client in order to deliver a stereoscopic video. In this system, effective communication between the server and the client is one of important aspects. Therefore, we propose an efficient multi-view system that transmits two view-sequences according to user's request. The view selection process is integrated into MPEG-21 DIA (Digital Item Adaptation) so that our system is compatible to MPEG-21 multimedia framework. Furthermore, multi-view descriptors related to multi-view camera and systems are newly introduced. The syntax of the descriptions and their elements is represented in XML (extensible Markup Language) schema. Intermediate view reconstruction (IVR) is used to reduce such discomfort with excessive disparity. Furthermore, IVR is useful for smooth transition between two stereoscopic view sequences. Finally, through the implementation of testbed, we can show the valuables and possibilities of our system.

Intermediate Image Generation of Stereo Image Using Depth Information and Block-based Matching Method (깊이정보와 블록기반매칭을 이용한 스테레오 영상의 중간영상 생성)

  • 양광원;허경무;김장기
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.8 no.10
    • /
    • pp.874-880
    • /
    • 2002
  • A number of techniques have been proposed for 3D display using view-difference of two eyes. These methods do not express enough reality like real world. The display images have to change according to the position of a viewer to improve reality. In this paper, we present an approach for generating intermediate image between two different view images by applying new image interpolation algorithm The interpolation algorithm is designed to cope with complex shapes. The proposed image interpolation algorithm generates rotated image about vertical axes by any angle from base images. Each base image that was obtained from CCD camera has an view-angle difference of $3^{\circ}C$, $5.5^{\circ}C$, $^{\circ}C$, $22^{\circ}C$, and $45^{\circ}C$. The proposed into mediate image generation method uses the geometric analysis of image and depth information through the block-based matching method.

Method for Supplementing Single-View Resolution of Multiview Autostereoscopic Three-Dimensional Display Using Plate Beam Splitter

  • Kim, Hyun-Woo;Cho, Myungjin;Lee, Min-Chul
    • Journal of information and communication convergence engineering
    • /
    • v.19 no.2
    • /
    • pp.108-113
    • /
    • 2021
  • Multiview autostereoscopic three-dimensional (MA3D) displays have the disadvantage that the single-view resolution decreases as the number of views increases. Furthermore, the resolution of MA3D displays is relatively degraded, even though the resolution of two-dimensional displays has increased recently. Therefore, it is unattractive to consumers, and the single-view resolution enhancement of MA3D displays is required. In this study, we developed a method for supplementing the single-view resolution of MA3D displays using a plate beam splitter that can show two MA3D displays simultaneously. By applying our proposed method, the resolution of a single view can increase, and the visual obstruction by the optical plate, which is a problem for MA3D displays, can be solved. In addition, an MA3D display was optically designed and fabricated using a parallax barrier. Finally, the experimental optical results obtained using the proposed method and the only MA3D display were compared.

Using Freeze Frame and Visual Notifications in an Annotation Drawing Interface for Remote Collaboration

  • Kim, Seungwon;Billinghurst, Mark;Lee, Chilwoo;Lee, Gun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.12
    • /
    • pp.6034-6056
    • /
    • 2018
  • This paper describes two user studies in remote collaboration between two users with a video conferencing system where a remote user can draw annotations on the live video of the local user's workspace. In these two studies, the local user had the control of the view when sharing the first-person view, but our interfaces provided instant control of the shared view to the remote users. The first study investigates methods for assisting drawing annotations. The auto-freeze method, a novel solution for drawing annotations, is compared to a prior solution (manual freeze method) and a baseline (non-freeze) condition. Results show that both local and remote users preferred the auto-freeze method, which is easy to use and allows users to quickly draw annotations. The manual-freeze method supported precise drawing, but was less preferred because of the need for manual input. The second study explores visual notification for better local user awareness. We propose two designs: the red-box and both-freeze notifications, and compare these to the baseline, no notification condition. Users preferred the less obtrusive red-box notification that improved awareness of when annotations were made by remote users, and had a significantly lower level of interruption compared to the both-freeze condition.

Quantization Parameter Selection Method For H.264-based Multi-view Video Coding (H.264 기반 다시점 비디오 부호화를 위한 양자화 계수 결정 방법)

  • Park, Pil-Kyu;Ho, Yo-Sung
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.32 no.6C
    • /
    • pp.579-584
    • /
    • 2007
  • Recently various prediction structures have been proposed to exploit inter-view correlation among multi-view video sequences. In this paper, we propose a QP(quantization parameter) selection method for the B frame inserted in the first frames of each GOP(group of pictures), where we change QP for the B frame adaptively to achieve uniform picture quality and overall coding gain. Each B frame is coded with reference to two frames in its adjacent views. We calculate QP for the B frame based on the correlation between the two reference frames, calculated using their rate-distortion costs. By applying the proposed method to the MVC reference prediction structure, we have improved the coding gain by 0.09$\sim$0.16 dB.

View interpolation using Bidirectional Disparity Map (Bidirectional disparity map을 이용한 view interpolation)

  • 김대현;김정훈;김상훈;서민정;홍현기;최종수
    • Proceedings of the IEEK Conference
    • /
    • 2001.06d
    • /
    • pp.65-68
    • /
    • 2001
  • In this paper, we propose a method to interpolate two images obtained from two parallel cameras. The proposed method uses BDM(Bidirectional Disparity Map) to prevent hole generation due to occlusion. Furthermore, we use the block-based DM(Disparity Map) to decrease the amount of computation, and also use the adaptive block size to minimize the error of the block-based DM.

  • PDF

Facial Action Unit Detection with Multilayer Fused Multi-Task and Multi-Label Deep Learning Network

  • He, Jun;Li, Dongliang;Bo, Sun;Yu, Lejun
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.11
    • /
    • pp.5546-5559
    • /
    • 2019
  • Facial action units (AUs) have recently drawn increased attention because they can be used to recognize facial expressions. A variety of methods have been designed for frontal-view AU detection, but few have been able to handle multi-view face images. In this paper we propose a method for multi-view facial AU detection using a fused multilayer, multi-task, and multi-label deep learning network. The network can complete two tasks: AU detection and facial view detection. AU detection is a multi-label problem and facial view detection is a single-label problem. A residual network and multilayer fusion are applied to obtain more representative features. Our method is effective and performs well. The F1 score on FERA 2017 is 13.1% higher than the baseline. The facial view recognition accuracy is 0.991. This shows that our multi-task, multi-label model could achieve good performance on the two tasks.

Hybrid 3DTV Systems Based on the Cross-View SHVC (양안 교차 SHVC 기반 융합형 3DTV 시스템)

  • Kang, Dong Wook;Jung, Kyeong Hoon;Kim, Jin Woo;Kim, Jong Ho
    • Journal of Broadcast Engineering
    • /
    • v.23 no.2
    • /
    • pp.316-319
    • /
    • 2018
  • When a terrestrial UHD broadcasting service and a mobile HD broadcasting service are provided using the PLP function provided by ATSC 3.0 and domestic UHD broadcasting standard, a small amount of data may be additionally transmitted to further provide high quality UHD-3D broadcasting service. The left and right images of the stereoscopic image are input, one view image is encoded by the SHVC method, and the other view images are encoded by the SHVC method of the two-view cross-referencing method. However, since the base layers (BL) of the two encoders are mutually common, the two encoders correspond to encoders that generate one BL stream and two enhancement layer (EL) streams. The average encoding efficiency is 16% more efficient compared to the third independent HEVC encoding for the UHD-3D broadcast service. The proposed scheme reduces the fluctuation of PSNR per image frame and increases the image quality of minimum PSNR frame by 0.6dB.

Adaptive Spatio-Temporal Prediction for Multi-view Coding in 3D-Video (3차원 비디오 압축에서의 다시점 부호화를 위한 적응적 시공간적 예측 부호화)

  • 성우철;이영렬
    • Journal of Broadcast Engineering
    • /
    • v.9 no.3
    • /
    • pp.214-224
    • /
    • 2004
  • In this paper, an adaptive spatio-temporal predictive coding based on the H.264 is proposed for 3D immersive media encoding, such as 3D image processing, 3DTV, and 3D videoconferencing. First, we propose a spatio-temporal predictive coding using the same view and inter-view images for the two TPPP, IBBP GOP (group of picture) structures 4hat are different from the conventional simulcast method. Second, an 2D inter-view direct mode for the efficient prediction is proposed when the proposed spatio-temporal prediction uses the IBBP structure. The 2D inter-view direct mode is applied when the temporal direct mode in B(hi-Predictive) picture of the H.264 refers to an inter-view image, since the current temporal direct mode in the H.264 standard could no: be applied to the inter-view image. The proposed method is compared to the conventional simulcast method in terms of PSNR (peak signal to noise ratio) for the various 3D test video sequences. The proposed method shows better PSNR results than the conventional simulcast mode.

Confidence Map based Multi-view Image Generation Method from Stereoscopic Images (양안식 영상을 이용한 신뢰도 기반의 다시점 영상 생성 방법)

  • Kim, Do Young;Ho, Yo-Sung
    • Smart Media Journal
    • /
    • v.2 no.4
    • /
    • pp.27-33
    • /
    • 2013
  • Multi-view video system provides both realistic 3D feelings and free-view navigation. But it is hard to transmit too huge data, so we send only two or three view images and generate intermediate view image using depth information. In this paper, we propose high quality multi-view image generation method from stereoscopic images. Since the stereo matching method does not provide accurate disparity values for all the pixels, especially at the occlusion area, we propose an occlusion handling method using the background pixels at first. We also apply a joint bilateral filtering to enhance the disparity map at the object boundary since it can affect the quality of synthesized images significantly. Finally, we can generate virtual view images at intermediate view positions using confidence map to reduce bad pixel and hole's error. Experimental results show the proposed method performs better than the conventional method.

  • PDF