• Title/Summary/Keyword: Stereo cameras.

Search Result 208, Processing Time 0.026 seconds

Efficient Implementation of Candidate Region Extractor for Pedestrian Detection System with Stereo Camera based on GP-GPU (스테레오 영상 보행자 인식 시스템의 후보 영역 검출을 위한 GP-GPU 기반의 효율적 구현)

  • Jeong, Geun-Yong;Jeong, Jun-Hee;Lee, Hee-Chul;Jeon, Gwang-Gil;Cho, Joong-Hwee
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.8 no.2
    • /
    • pp.121-128
    • /
    • 2013
  • There have been various research efforts for pedestrian recognition in embedded imaging systems. However, many suffer from their heavy computational complexities. SVM classification method has been widely used for pedestrian recognition. The reduction of candidate region is crucial for low-complexity scheme. In this paper, We propose a real time HOG based pedestrian detection system on GPU which images are captured by a pair of cameras. To speed up humans on road detection, the proposed method reduces a number of detection windows with disparity-search and near-search algorithm and uses the GPU and the NVIDIA CUDA framework. This method can be achieved speedups of 20% or more compared to the recent GPU implementations. The effectiveness of our algorithm is demonstrated in terms of the processing time and the detection performance.

3D Image Capturing and 3D Content Generation for Realistic Broadcasting (실감방송을 위한 3차원 영상 촬영 및 3차원 콘텐츠 제작 기술)

  • Kang, Y.S.;Ho, Y.S.
    • Smart Media Journal
    • /
    • v.1 no.1
    • /
    • pp.10-16
    • /
    • 2012
  • Stereo and multi-view cameras have been used to capture the three-dimensional (3D) scene for 3D contents generation. Besides, depth sensors are frequently used to obtain 3D information of the captured scene in real time. In order to generate 3D contents from captured images, we need several preprocessing operations to reduce noises and distortions in the images. 3D contents are considered as the basic media for realistic broadcasting that provides photo-realistic and immersive feeling to users. In this paper, we show technical trends of 3D image capturing and contents generation, and explain some core techniques for 3D image processing for realistic 3DTV broadcasting.

  • PDF

Development of a Biped Walking Robot Actuated by a Closed-Chain Mechanism

  • Choi, Hyeung-Sik;Oh, Jung-Min;Baek, Chang-Yul;Chung, Kyung-Sik
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.209-214
    • /
    • 2003
  • We developed a new type of human-sized BWR (biped walking robot), named KUBIR1 which is driven by the closed-chain type of actuator. A new type of the closed-chain actuator for the robot is developed, which is composed of the four-bar-link mechanism driven by the ball screw which has high strength and high gear ratio. Each leg of the robot is composed of 6 D.O.F joints. For front walking, three pitch joints and one roll joint at the ankle. In addition to this, one yaw joint for direction change, and another roll joint for balancing the body are attached. Also, the robot has two D.O.F joints of each hand and three D.O.F. for eye motion. There are three actuating motors for stereo cameras for eyes. In all, a 18 degree-of-freedom robot was developed. KUBIR1 was designed to walk autonomously by adapting small 90W DC motors as the robot actuators and batteries and controllers are on-boarded. The whole weight for Kubir1 is over 90Kg, and height is 167Cm. In the paper, the performance test of KUBIR1 will be shown.

  • PDF

Cooperative recognition using multi-view images

  • Kojoh, Toshiyuki;Nagata, Tadashi;Zha, Hong-Bin
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1993.10b
    • /
    • pp.70-75
    • /
    • 1993
  • We represent a method of 3-D object recognition using multi images in this paper. The recognition process is executed as follows. Object models as prior knowledgement are generated and stored on a computer. To extract features of a recognized object, three CCD cameras are set at vertices of a regular triangle and take images of an object to be recognized. By comparing extracted features with generated models, the object is recognized. In general, it is difficult to recognize 3-D objects because there are the following problems such as how to make the correspondence to both stereo images, generate and store an object model according to a recognition process, and effectively collate information gotten from input images. We resolve these problems using the method that the collation on the basis of features independent on the viewpoint, the generation of object models as enumerating some candidate models in an early recognition level, the execution a tight cooperative process among results gained by analyzing each image. We have made experiments based on real images in which polyhedral objects are used as objects to be recognized. Some of results reveal the usefulness of the proposed method.

  • PDF

Entity Matching for Vision-Based Tracking of Construction Workers Using Epipolar Geometry (영상 내 건설인력 위치 추적을 위한 등극선 기하학 기반의 개체 매칭 기법)

  • Lee, Yong-Joo;Kim, Do-Wan;Park, Man-Woo
    • Journal of KIBIM
    • /
    • v.5 no.2
    • /
    • pp.46-54
    • /
    • 2015
  • Vision-based tracking has been proposed as a means to efficiently track a large number of construction resources operating in a congested site. In order to obtain 3D coordinates of an object, it is necessary to employ stereo-vision theories. Detecting and tracking of multiple objects require an entity matching process that finds corresponding pairs of detected entities across the two camera views. This paper proposes an efficient way of entity matching for tracking of construction workers. The proposed method basically uses epipolar geometry which represents the relationship between the two fixed cameras. Each pixel coordinate in a camera view is projected onto the other camera view as an epipolar line. The proposed method finds the matching pair of a worker entity by comparing the proximity of the all detected entities in the other view to the epipolar line. Experimental results demonstrate its suitability for automated entity matching for 3D vision-based tracking of construction workers.

Robust 3D visual tracking for moving object using pan/tilt stereo cameras (Pan/Tilt스테레오 카메라를 이용한 이동 물체의 강건한 시각추적)

  • Cho, Che-Seung;Chung, Byeong-Mook;Choi, In-Su;Nho, Sang-Hyun;Lim, Yoon-Kyu
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.22 no.9 s.174
    • /
    • pp.77-84
    • /
    • 2005
  • In most vision applications, we are frequently confronted with determining the position of object continuously. Generally, intertwined processes ire needed for target tracking, composed with tracking and control process. Each of these processes can be studied independently. In case of actual implementation we must consider the interaction between them to achieve robust performance. In this paper, the robust real time visual tracking in complex background is considered. A common approach to increase robustness of a tracking system is to use known geometric models (CAD model etc.) or to attach the marker. In case an object has arbitrary shape or it is difficult to attach the marker to object, we present a method to track the target easily as we set up the color and shape for a part of object previously. Robust detection can be achieved by integrating voting-based visual cues. Kalman filter is used to estimate the motion of moving object in 3D space, and this algorithm is tested in a pan/tilt robot system. Experimental results show that fusion of cues and motion estimation in a tracking system has a robust performance.

Recent Technologies for the Acquisition and Processing of 3D Images Based on Deep Learning (딥러닝기반 입체 영상의 획득 및 처리 기술 동향)

  • Yoon, M.S.
    • Electronics and Telecommunications Trends
    • /
    • v.35 no.5
    • /
    • pp.112-122
    • /
    • 2020
  • In 3D computer graphics, a depth map is an image that provides information related to the distance from the viewpoint to the subject's surface. Stereo sensors, depth cameras, and imaging systems using an active illumination system and a time-resolved detector can perform accurate depth measurements with their own light sources. The 3D image information obtained through the depth map is useful in 3D modeling, autonomous vehicle navigation, object recognition and remote gesture detection, resolution-enhanced medical images, aviation and defense technology, and robotics. In addition, the depth map information is important data used for extracting and restoring multi-view images, and extracting phase information required for digital hologram synthesis. This study is oriented toward a recent research trend in deep learning-based 3D data analysis methods and depth map information extraction technology using a convolutional neural network. Further, the study focuses on 3D image processing technology related to digital hologram and multi-view image extraction/reconstruction, which are becoming more popular as the computing power of hardware rapidly increases.

Visual Sensing of the Light Spot of a Laser Pointer for Robotic Applications

  • Park, Sung-Ho;Kim, Dong Uk;Do, Yongtae
    • Journal of Sensor Science and Technology
    • /
    • v.27 no.4
    • /
    • pp.216-220
    • /
    • 2018
  • In this paper, we present visual sensing techniques that can be used to teach a robot using a laser pointer. The light spot of an off-the-shelf laser pointer is detected and its movement is tracked on consecutive images of a camera. The three-dimensional position of the spot is calculated using stereo cameras. The light spot on the image is detected based on its color, brightness, and shape. The detection results in a binary image, and morphological processing steps are performed on the image to refine the detection. The movement of the laser spot is measured using two methods. The first is a simple method of specifying the region of interest (ROI) centered at the current location of the light spot and finding the spot within the ROI on the next image. It is assumed that the movement of the spot is not large on two consecutive images. The second method is using a Kalman filter, which has been widely employed in trajectory estimation problems. In our simulation study of various cases, Kalman filtering shows better results mostly. However, there is a problem of fitting the system model of the filter to the pattern of the spot movement.

Depth Map Enhancement and Up-sampling Techniques of 3D Images for the Smart Media (스마트미디어를 위한 입체 영상의 깊이맵 화질 향상 및 업샘플링 기술)

  • Jung, Jae-Il;Ho, Yo-Sung
    • Smart Media Journal
    • /
    • v.1 no.3
    • /
    • pp.22-28
    • /
    • 2012
  • As the smart media becomes more popular, the demand for high-quality 3D images and depth maps is increasing. However, performance of the current technologies to acquire depth maps is not sufficient. The depth maps from stereo matching methods have low accuracy in homogeneous regions. The depth maps from depth cameras are noisy and have low-resolution due to technical limitations. In this paper, we introduce the state-of-the-art algorithms for depth map enhancement and up-sampling from conventional methods using only depth maps to the latest algorithms referring to both depth maps and their corresponding color images. We also present depth map enhancement algorithms for hybrid camera systems in detail.

  • PDF

Development of A Vision-based Lane Detection System with Considering Sensor Configuration Aspect (센서 구성을 고려한 비전 기반 차선 감지 시스템 개발)

  • Park Jaehak;Hong Daegun;Huh Kunsoo;Park Jahnghyon;Cho Dongil
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.13 no.4
    • /
    • pp.97-104
    • /
    • 2005
  • Vision-based lane sensing systems require accurate and robust sensing performance in lane detection. Besides, there exists trade-off between the computational burden and processor cost, which should be considered for implementing the systems in passenger cars. In this paper, a stereo vision-based lane detection system is developed with considering sensor configuration aspects. An inverse perspective mapping method is formulated based on the relative correspondence between the left and right cameras so that the 3-dimensional road geometry can be reconstructed in a robust manner. A new monitoring model for estimating the road geometry parameters is constructed to reduce the number of the measured signals. The selection of the sensor configuration and specifications is investigated by utilizing the characteristics of standard highways. Based on the sensor configurations, it is shown that appropriate sensing region on the camera image coordinate can be determined. The proposed system is implemented on a passenger car and verified experimentally.