• Title/Summary/Keyword: multi-View

Search Result 1,267, Processing Time 0.027 seconds

3D Coordinates Acquisition by using Multi-view X-ray Images (다시점 X선 영상을 이용한 3차원 좌표 획득)

  • Yi, Sooyeong;Rhi, Jaeyoung;Kim, Soonchul;Lee, Jeonggyu
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.19 no.10
    • /
    • pp.886-890
    • /
    • 2013
  • In this paper, a 3D coordinates acquisition method for a mechanical assembly is developed by using multiview X-ray images. The multi-view X-ray images of an object are obtained by a rotary table. From the rotation transformation, it is possible to obtain the 3D coordinates of corresponding edge points on multi-view X-ray images by triangulation. The edge detection algorithm in this paper is based on the attenuation characteristic of the X-ray. The 3D coordinates of the object points are represented on a graphic display, which is used for the inspection of a mechanical assembly.

A Survey for 3D Object Detection Algorithms from Images

  • Lee, Han-Lim;Kim, Ye-ji;Kim, Byung-Gyu
    • Journal of Multimedia Information System
    • /
    • v.9 no.3
    • /
    • pp.183-190
    • /
    • 2022
  • Image-based 3D object detection is one of the important and difficult problems in autonomous driving and robotics, and aims to find and represent the location, dimension and orientation of the object of interest. It generates three dimensional (3D) bounding boxes with only 2D images obtained from cameras, so there is no need for devices that provide accurate depth information such as LiDAR or Radar. Image-based methods can be divided into three main categories: monocular, stereo, and multi-view 3D object detection. In this paper, we investigate the recent state-of-the-art models of the above three categories. In the multi-view 3D object detection, which appeared together with the release of the new benchmark datasets, NuScenes and Waymo, we discuss the differences from the existing monocular and stereo methods. Also, we analyze their performance and discuss the advantages and disadvantages of them. Finally, we conclude the remaining challenges and a future direction in this field.

Omni-directional Visual-LiDAR SLAM for Multi-Camera System (다중 카메라 시스템을 위한 전방위 Visual-LiDAR SLAM)

  • Javed, Zeeshan;Kim, Gon-Woo
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.3
    • /
    • pp.353-358
    • /
    • 2022
  • Due to the limited field of view of the pinhole camera, there is a lack of stability and accuracy in camera pose estimation applications such as visual SLAM. Nowadays, multiple-camera setups and large field of cameras are used to solve such issues. However, a multiple-camera system increases the computation complexity of the algorithm. Therefore, in multiple camera-assisted visual simultaneous localization and mapping (vSLAM) the multi-view tracking algorithm is proposed that can be used to balance the budget of the features in tracking and local mapping. The proposed algorithm is based on PanoSLAM architecture with a panoramic camera model. To avoid the scale issue 3D LiDAR is fused with omnidirectional camera setup. The depth is directly estimated from 3D LiDAR and the remaining features are triangulated from pose information. To validate the method, we collected a dataset from the outdoor environment and performed extensive experiments. The accuracy was measured by the absolute trajectory error which shows comparable robustness in various environments.

Improved Method for Depth Map Fusion in Multi View System (Multi View System 에서 Depth Map Fusion 을 위한 개선된 기법)

  • Jung, Woo-Kyung;Kim, Haekwang;Han, Jong-Ki
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2021.06a
    • /
    • pp.223-225
    • /
    • 2021
  • 실감 미디어에 대한 수요가 증가함에 따라 고품질의 실감 미디어에 대한 중요성이 증가하고 있다. 이러한 실감미디어를 제작하기 위해 사용되는 일반적인 기법 중 하나인 Multi View Stereo 는 깊이 영상 추정 및 해당 깊이 영상을 이용하여 3 차원에 point cloud 를 생성하는 fusion 과정을 거치게 된다. 본 논문에서는 다중 시점 영상의 깊이 영상을 정합하는 fusion 과정을 개선하기 위한 방법을 제안한다. 제안하는 방법에서는 깊이 영상, 색상정보를 이용하여 기준 시점의 depth map 을 이용한 fusion 과정을 거친다. 실험을 통하여 제안한 알고리즘을 이용한 결과가 기존보다 개선됨을 보인다.

  • PDF

Filtering for reducing aliasing effects on auto-multiscopic monitor (무안경식 입체 다시점 모니터의 엘리어싱 감소를 위한 필터링 기법)

  • Park, Myung-Su;Yoo, Ji-Sang
    • Journal of Broadcast Engineering
    • /
    • v.15 no.4
    • /
    • pp.591-599
    • /
    • 2010
  • In this paper, we propose a disparity-adaptive filtering method to reduce view-aliasing and inter-perspective aliasing on auto-multiscopic 3D display. View aliasing by the subsampled multi-view images could happen if the resolution of an auto-multiscopic monitor is lower than that of the original multi-view images. Furthermore, multi-view images on auto-multiscopic monitor usually cause eye strain and fatigue because of inter-perspective aliasing by discontinuity that exists between viewpoints. The greater disparity value becomes, the stronger these types of aliasing could be. Thus, we design a lowpass filter whose cut-off frequency is determined adaptively to the number of intermediate reconstructed views and their disparity's strength. Though experimental results, we show that the proposed filtering algorithm could reduce the aliasing effect very efficiently by using DSCQS (double stimulus continuous quality scale method).

Performance Analysis of 3D-HEVC Video Coding (3D-HEVC 비디오 부호화 성능 분석)

  • Park, Daemin;Choi, Haechul
    • Journal of Broadcast Engineering
    • /
    • v.19 no.5
    • /
    • pp.713-725
    • /
    • 2014
  • Multi-view and 3D video technologies for a next generation video service are widely studied. These technologies can make users feel realistic experience as supporting various views. Because acquisition and transmission of a large number of views require a high cost, main challenges for multi-view and 3D video include view synthesis, video coding, and depth coding. Recently, JCT-3V (joint collaborative team on 3D video coding extension development) has being developed a new standard for multi-view and 3D video. In this paper, major tools adopted in this standard are introduced and evaluated in terms of coding efficiency and complexity. This performance analysis would be helpful for the development of a fast 3D video encoder as well as a new 3D video coding algorithm.

A Study on H.264/AVC Video Compression Standard of Multi-view Image Expressed by Layered Depth Image (계층적 깊이 영상으로 표현된 다시점 영상에 대한 H.264/AVC 비디오 압축 표준에 관한 연구)

  • Jee, Innho
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.1
    • /
    • pp.113-120
    • /
    • 2020
  • The multi-view video is a collection of multiple videos capturing the same scene at different viewpoints. Thus, there is an advantage of providing for user oriented view pointed video. This paper is suggested that the compression performance of layered depth image structure expression has improved by using more improved method. We confirm the data size of layer depth image by encoding H.264 technology and the each performances of reconstructed images. The H.264/AVC technology has easily extended for H.264 technology of video contents. In this paper, we suggested that layered depth structure can be applied for an efficient new image contents. We show that the huge data size of multi-view video image is decreased, and the higher performance of image is provided, and there is an advantage of for stressing error restoring.

CLO (Cross Layer Optimization) Technique for Multi-view Video Streaming Service over WiBro Network (WiBro망에서의 다시점 비디오 스트리밍 서비스를 위한 계층 간 최적화 방식)

  • Son, Jung-Hyun;Cho, Ye-Jin;Suh, Doug-Young;Park, Gwang-Hoon;Kim, Kyu-Heon
    • Journal of Broadcast Engineering
    • /
    • v.13 no.5
    • /
    • pp.719-731
    • /
    • 2008
  • This paper defines QoE (Quality of Experience) for multi-view video streaming service over WiBro and proposes the CLO (Cross-Layer Optimization) algorithm can maximize this. Proposal CLO algorithm contains from physical layer to video layer. Under the time-varying wireless channel condition, the CLO technique takes view-wise and the temporal priority of the multi-view video into consideration in order to decide the transmission of frames and its FEC level. At the handover situation, it is shown through computer simulation that the optimal quality of the multi-view video can be achieved using the minimum amount of resources if the proposed CLO technique is applied.

Face Tracking for Multi-view Display System (다시점 영상 시스템을 위한 얼굴 추적)

  • Han, Chung-Shin;Jang, Se-Hoon;Bae, Jin-Woo;Yoo, Ji-Sang
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.30 no.2C
    • /
    • pp.16-24
    • /
    • 2005
  • In this paper, we proposed a face tracking algorithm for a viewpoint adaptive multi-view synthesis system. The original scene captured by a depth camera contains a texture image and 8 bit gray-scale depth map. From this original image, multi-view images can be synthesized which correspond to viewer's position by using geometrical transformation such as a rotation and a translation. The proposed face tracking technique gives a motion parallax cue by different viewpoints and view angles. In the proposed algorithm, tracking of viewer's dominant face initially established from camera by using statistical characteristics of face colors and deformable templates is done. As a result, we can provide motion parallax cue by detecting viewer's dominant face area and tracking it even under a heterogeneous background and can successfully display the synthesized sequences.

Multi-View Video Coding Using Illumination Change-Adaptive Motion Estimation and 2D Direct Mode (조명변화에 적응적인 움직임 검색 기법과 2차원 다이렉트 모드를 사용한 다시점 비디오 부호화)

  • Lee, Yung Ki;Hur, Jae Ho;Lee, Yung Lyul
    • Journal of Broadcast Engineering
    • /
    • v.10 no.3
    • /
    • pp.321-327
    • /
    • 2005
  • A MVC (Multi-view Video Coding) method, which uses both an illumination change-adaptive ME (Motion Estimation)/DC (Motion Compensation) and a 2D (Dimensional) direct mode, is proposed. Firstly, a new SAD (Sum of Absolute Difference) measure for ME/MC is proposed to compensate the Luma pixel value changes for spatio-temporal motion vector prediction. Illumination change-adaptive (ICA) ME/MC uses the new SAD to improve both MV (Motion Vector) accuracy and bit saving. Secondly, The proposed 2D direct mode that can be used in inter-view prediction is an extended version of the temporal direct mode in MPEG-4 AVC. The proposed MVC method obtains approximately 0.8dB PSNR (Peak Signal-to-Noise Ratio) increment compared with the MPEG-4 AVC simulcast coding.