• Title/Summary/Keyword: Depth segmentation

Search Result 175, Processing Time 0.031 seconds

High-Quality Depth Map Generation of Humans in Monocular Videos (단안 영상에서 인간 오브젝트의 고품질 깊이 정보 생성 방법)

  • Lee, Jungjin;Lee, Sangwoo;Park, Jongjin;Noh, Junyong
    • Journal of the Korea Computer Graphics Society
    • /
    • v.20 no.2
    • /
    • pp.1-11
    • /
    • 2014
  • The quality of 2D-to-3D conversion depends on the accuracy of the assigned depth to scene objects. Manual depth painting for given objects is labor intensive as each frame is painted. Specifically, a human is one of the most challenging objects for a high-quality conversion, as a human body is an articulated figure and has many degrees of freedom (DOF). In addition, various styles of clothes, accessories, and hair create a very complex silhouette around the 2D human object. We propose an efficient method to estimate visually pleasing depths of a human at every frame in a monocular video. First, a 3D template model is matched to a person in a monocular video with a small number of specified user correspondences. Our pose estimation with sequential joint angular constraints reproduces a various range of human motions (i.e., spine bending) by allowing the utilization of a fully skinned 3D model with a large number of joints and DOFs. The initial depth of the 2D object in the video is assigned from the matched results, and then propagated toward areas where the depth is missing to produce a complete depth map. For the effective handling of the complex silhouettes and appearances, we introduce a partial depth propagation method based on color segmentation to ensure the detail of the results. We compared the result and depth maps painted by experienced artists. The comparison shows that our method produces viable depth maps of humans in monocular videos efficiently.

Low Resolution Depth Interpolation using High Resolution Color Image (고해상도 색상 영상을 이용한 저해상도 깊이 영상 보간법)

  • Lee, Gyo-Yoon;Ho, Yo-Sung
    • Smart Media Journal
    • /
    • v.2 no.4
    • /
    • pp.60-65
    • /
    • 2013
  • In this paper, we propose a high-resolution disparity map generation method using a low-resolution time-of-flight (TOF) depth camera and color camera. The TOF depth camera is efficient since it measures the range information of objects using the infra-red (IR) signal in real-time. It also quantizes the range information and provides the depth image. However, there are some problems of the TOF depth camera, such as noise and lens distortion. Moreover, the output resolution of the TOF depth camera is too small for 3D applications. Therefore, it is essential to not only reduce the noise and distortion but also enlarge the output resolution of the TOF depth image. Our proposed method generates a depth map for a color image using the TOF camera and the color camera simultaneously. We warp the depth value at each pixel to the color image position. The color image is segmented using the mean-shift segmentation method. We define a cost function that consists of color values and segmented color values. We apply a weighted average filter whose weighting factor is defined by the random walk probability using the defined cost function of the block. Experimental results show that the proposed method generates the depth map efficiently and we can reconstruct good virtual view images.

  • PDF

Study on object detection and distance measurement functions with Kinect for windows version 2 (키넥트(Kinect) 윈도우 V2를 통한 사물감지 및 거리측정 기능에 관한 연구)

  • Niyonsaba, Eric;Jang, Jong-Wook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.21 no.6
    • /
    • pp.1237-1242
    • /
    • 2017
  • Computer vision is coming more interesting with new imaging sensors' new capabilities which enable it to understand more its surrounding environment by imitating human vision system with artificial intelligence techniques. In this paper, we made experiments with Kinect camera, a new depth sensor for object detection and distance measurement functions, most essential functions in computer vision such as for unmanned or manned vehicles, robots, drones, etc. Therefore, Kinect camera is used here to estimate the position or the location of objects in its field of view and measure the distance from them to its depth sensor in an accuracy way by checking whether that the detected object is real object or not to reduce processing time ignoring pixels which are not part of real object. Tests showed promising results with such low-cost range sensor, Kinect camera which can be used for object detection and distance measurement which are fundamental functions in computer vision applications for further processing.

An Efficient Object Extraction Scheme for Low Depth-of-Field Images (낮은 피사계 심도 영상에서 관심 물체의 효율적인 추출 방법)

  • Park Jung-Woo;Lee Jae-Ho;Kim Chang-Ick
    • Journal of Korea Multimedia Society
    • /
    • v.9 no.9
    • /
    • pp.1139-1149
    • /
    • 2006
  • This paper describes a novel and efficient algorithm, which extracts focused objects from still images with low depth-of-field (DOF). The algorithm unfolds into four modules. In the first module, a HOS map, in which the spatial distribution of the high-frequency components is represented, is obtained from an input low DOF image [1]. The second module finds OOI candidate by using characteristics of the HOS. Since it is possible to contain some holes in the region, the third module detects and fills them. In order to obtain an OOI, the last module gets rid of background pixels in the OOI candidate. The experimental results show that the proposed method is highly useful in various applications, such as image indexing for content-based retrieval from huge amounts of image database, image analysis for digital cameras, and video analysis for virtual reality, immersive video system, photo-realistic video scene generation and video indexing system.

  • PDF

A Real-Time Stereoscopic Image Conversion Method Using Motion Parallax (운동 시차를 이용한 실시간 입체 영상 변환 방법)

  • Choi, Chul-Ho;Kwon, Byong-Heon;Choi, Myung-Ryul
    • The KIPS Transactions:PartB
    • /
    • v.10B no.4
    • /
    • pp.359-366
    • /
    • 2003
  • We propose a real-time stereoscopic image conversion method that can generate stereoscopic image with different perspective depth using motion parallax from 2-D image and offer realistic 3-D effect regardless of the direction and velocity of the moving object in the 2-D image. The stereoscopic image is generated by computing the motion parallax between adjacent two 2-D images using the proposed method for motion detection, region segmentation and depth map generation. The proposed method is suitable for real-time stereoscopic conversion processing on various image formats. It has been verified the proposed method by comparing between the stereoscopic image of the proposed method and that of MTD.

Stereo Vision Based 3-D Motion Tracking for Human Animation

  • Han, Seung-Il;Kang, Rae-Won;Lee, Sang-Jun;Ju, Woo-Suk;Lee, Joan-Jae
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.6
    • /
    • pp.716-725
    • /
    • 2007
  • In this paper we describe a motion tracking algorithm for 3D human animation using stereo vision system. This allows us to extract the motion data of the end effectors of human body by following the movement through segmentation process in HIS or RGB color model, and then blob analysis is used to detect robust shape. When two hands or two foots are crossed at any position and become disjointed, an adaptive algorithm is presented to recognize whether it is left or right one. And the real motion is the 3-D coordinate motion. A mono image data is a data of 2D coordinate. This data doesn't acquire distance from a camera. By stereo vision like human vision, we can acquire a data of 3D motion such as left, right motion from bottom and distance of objects from camera. This requests a depth value including x axis and y axis coordinate in mono image for transforming 3D coordinate. This depth value(z axis) is calculated by disparity of stereo vision by using only end-effectors of images. The position of the inner joints is calculated and 3D character can be visualized using inverse kinematics.

  • PDF

Study on an Image Reconstruction Algorithm for 3D Cartilage OCT Images (A Preliminary Study) (3차원 연골 광간섭 단층촬영 이미지들에 대한 영상 재구성 알고리듬 연구)

  • Ho, Dong-Su;Kim, Ee-Hwa;Kim, Yong-Min;Kim, Beop-Min
    • Progress in Medical Physics
    • /
    • v.20 no.2
    • /
    • pp.62-71
    • /
    • 2009
  • Recently, optical coherence tomography (OCT) has demonstrated considerable promise for the noninvasive assessment of biological tissues. However, OCT images difficult to analyze due to speckle noise. In this paper, we tested various image processing techniques for speckle removal of human and rabbit cartilage OCT images. Also, we distinguished the images which get with methods of image segmentation for OCT images, and found the most suitable method for segmenting an image. And, we selected image segmentation suitable for OCT before image reconstruction. OCT was a weak point to system design and image processing. It was a limit owing to measure small a distance and depth size. So, good edge matching algorithms are important for image reconstruction. This paper presents such an algorithm, the chamfer matching algorithm. It is made of background for 3D image reconstruction. The purpose of this paper is to describe good image processing techniques for speckle removal, image segmentation, and the 3D reconstruction of cartilage OCT images.

  • PDF

Analysis of the Individual Tree Growth for Urban Forest using Multi-temporal airborne LiDAR dataset (다중시기 항공 LiDAR를 활용한 도시림 개체목 수고생장분석)

  • Kim, Seoung-Yeal;Kim, Whee-Moon;Song, Won-Kyong;Choi, Young-Eun;Choi, Jae-Yong;Moon, Guen-Soo
    • Journal of the Korean Society of Environmental Restoration Technology
    • /
    • v.22 no.5
    • /
    • pp.1-12
    • /
    • 2019
  • It is important to measure the height of trees as an essential element for assessing the forest health in urban areas. Therefore, an automated method that can measure the height of individual tree as a three-dimensional forest information is needed in an extensive and dense forest. Since airborne LiDAR dataset is easy to analyze the tree height(z-coordinate) of forests, studies on individual tree height measurement could be performed as an assessment forest health. Especially in urban forests, that adversely affected by habitat fragmentation and isolation. So this study was analyzed to measure the height of individual trees for assessing the urban forests health, Furthermore to identify environmental factors that affect forest growth. The survey was conducted in the Mt. Bongseo located in Seobuk-gu. Cheonan-si(Middle Chungcheong Province). We segment the individual trees on coniferous by automatic method using the airborne LiDAR dataset of the two periods (year of 2016 and 2017) and to find out individual tree growth. Segmentation of individual trees was performed by using the watershed algorithm and the local maximum, and the tree growth was determined by the difference of the tree height according to the two periods. After we clarify the relationship between the environmental factors affecting the tree growth. The tree growth of Mt. Bongseo was about 20cm for a year, and it was analyzed to be lower than 23.9cm/year of the growth of the dominant species, Pinus rigida. This may have an adverse effect on the growth of isolated urban forests. It also determined different trees growth according to age, diameter and density class in the stock map, effective soil depth and drainage grade in the soil map. There was a statistically significant positive correlation between the distance to the road and the solar radiation as an environmental factor affecting the tree growth. Since there is less correlation, it is necessary to determine other influencing factors affecting tree growth in urban forests besides anthropogenic influences. This study is the first data for the analysis of segmentation and the growth of the individual tree, and it can be used as a scientific data of the urban forest health assessment and management.

Three-Dimensional Conversion of Two-Dimensional Movie Using Optical Flow and Normalized Cut (Optical Flow와 Normalized Cut을 이용한 2차원 동영상의 3차원 동영상 변환)

  • Jung, Jae-Hyun;Park, Gil-Bae;Kim, Joo-Hwan;Kang, Jin-Mo;Lee, Byoung-Ho
    • Korean Journal of Optics and Photonics
    • /
    • v.20 no.1
    • /
    • pp.16-22
    • /
    • 2009
  • We propose a method to convert a two-dimensional movie to a three-dimensional movie using normalized cut and optical flow. In this paper, we segment an image of a two-dimensional movie to objects first, and then estimate the depth of each object. Normalized cut is one of the image segmentation algorithms. For improving speed and accuracy of normalized cut, we used a watershed algorithm and a weight function using optical flow. We estimate the depth of objects which are segmented by improved normalized cut using optical flow. Ordinal depth is estimated by the change of the segmented object label in an occluded region which is the difference of absolute values of optical flow. For compensating ordinal depth, we generate the relational depth which is the absolute value of optical flow as motion parallax. A final depth map is determined by multiplying ordinal depth by relational depth, then dividing by average optical flow. In this research, we propose the two-dimensional/three-dimensional movie conversion method which is applicable to all three-dimensional display devices and all two-dimensional movie formats. We present experimental results using sample two-dimensional movies.

Speed-up of Document Image Binarization Method Based on Water Flow Model (Water flow model에 기반한 문서영상 이진화 방법의 속도 개선)

  • 오현화;김도훈;이재용;김두식;임길택;진성일
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.4
    • /
    • pp.75-86
    • /
    • 2004
  • This paper proposes a method to speed up the document image binarization using a water flow model. The proposed method extracts the region of interest (ROI) around characters from a document image and restricts pouring water onto a 3-dimensional terrain surface of an image only within the ROI. The amount of water to be filed into a local valley is determined automatically depending on its depth and slope. The proposed method accumulates weighted water not only on the locally lowest position but also on its neighbors. Therefore, a valley is filed enough with only one try of pouring water onto the terrain surface of the ROI. Finally, the depth of each pond is adaptively thresholded for robust character segmentation, because the depth of a pond formed at a valley varies widely according to the gray-level difference between characters and backgrounds. In our experiments on real document images, the Proposed method has attained good binarization performance as well as remarkably reduced processing time compared with that of the existing method based on a water flow model.