• Title/Summary/Keyword: 3차원 객체 검출

Search Result 53, Processing Time 0.029 seconds

Deep Learning based Vehicle AR Manual for Improving User Experience (사용자 경험 향상을 위한 딥러닝 기반 차량용 AR 매뉴얼)

  • Lee, Jeong-Min;Kim, Jun-Hak;Seok, Jung-Won;Park, Jinho
    • Journal of the Korea Computer Graphics Society
    • /
    • v.28 no.3
    • /
    • pp.125-134
    • /
    • 2022
  • This paper implements an AR manual for a vehicle that can be used even in the vehicle interior space where it is difficult to apply the augmentation method of AR content, which is mainly used, and applies a deep learning model to improve the augmentation matching between real space and virtual objects. Through deep learning, the logo of the steering wheel is recognized regardless of the position, angle, and inclination, and 3D interior space coordinates are generated based on this, and the virtual button is precisely augmented on the actual vehicle parts. Based on the same learning model, the function to recognize the main warning light symbols of the vehicle is also implemented to increase the functionality and usability as an AR manual for vehicles.

Biomimetic approach object detection sensors using multiple imaging (다중 영상을 이용한 생체모방형 물체 접근 감지 센서)

  • Choi, Myoung Hoon;Kim, Min;Jeong, Jae-Hoon;Park, Won-Hyeon;Lee, Dong Heon;Byun, Gi-Sik;Kim, Gwan-Hyung
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2016.05a
    • /
    • pp.91-93
    • /
    • 2016
  • From the 2-D image extracting three-dimensional information as the latter is in the bilateral sibeop using two camera method and when using a monocular camera as a very important step generally as "stereo vision". There in today's CCTV and automatic object tracking system used in many medium much to know the site conditions or work developed more clearly by using a stereo camera that mimics the eyes of humans to maximize the efficiency of avoidance / control start and multiple jobs can do. Object tracking system of the existing 2D image will have but can not recognize the distance to the transition could not be recognized by the observer display using a parallax of a stereo image, and the object can be more effectively controlled.

  • PDF

Application of Digital Photogrammetry for The Automatic Extraction of Road Information (도로정보의 자동추출을 위한 수치사진측량기법의 적용)

  • 유환희
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.12 no.1
    • /
    • pp.89-94
    • /
    • 1994
  • A number of the latest research projects focus on the development of real-time mapping system. Typically, these devices are used to capture land-related information in digital form from airplanes or cars. The purpose of this paper is to automatically extract the road information from the digital images obtained using the so-called "GPS-Van" which has been developed by Center for Mapping at The Ohio State University, and to propose the method for the effective storage and management of the digital data. The edges of a road can be extracted from the digital image and determined real-time 3-dimensional position by digital photogrammetry. Also, the three storage level which consists of raster data level, object-oriented data level, and vector data level in the data storage and Quadtree data structure for the effective compression and search in the data management was proposed in this paper.his paper.

  • PDF

Depthmap Generation with Registration of LIDAR and Color Images with Different Field-of-View (다른 화각을 가진 라이다와 칼라 영상 정보의 정합 및 깊이맵 생성)

  • Choi, Jaehoon;Lee, Deokwoo
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.21 no.6
    • /
    • pp.28-34
    • /
    • 2020
  • This paper proposes an approach to the fusion of two heterogeneous sensors with two different fields-of-view (FOV): LIDAR and an RGB camera. Registration between data captured by LIDAR and an RGB camera provided the fusion results. Registration was completed once a depthmap corresponding to a 2-dimensional RGB image was generated. For this fusion, RPLIDAR-A3 (manufactured by Slamtec) and a general digital camera were used to acquire depth and image data, respectively. LIDAR sensor provided distance information between the sensor and objects in a scene nearby the sensor, and an RGB camera provided a 2-dimensional image with color information. Fusion of 2D image and depth information enabled us to achieve better performance with applications of object detection and tracking. For instance, automatic driver assistance systems, robotics or other systems that require visual information processing might find the work in this paper useful. Since the LIDAR only provides depth value, processing and generation of a depthmap that corresponds to an RGB image is recommended. To validate the proposed approach, experimental results are provided.

Development of Motion Recognition Platform Using Smart-Phone Tracking and Color Communication (스마트 폰 추적 및 색상 통신을 이용한 동작인식 플랫폼 개발)

  • Oh, Byung-Hun
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.17 no.5
    • /
    • pp.143-150
    • /
    • 2017
  • In this paper, we propose a novel motion recognition platform using smart-phone tracking and color communication. The interface requires only a camera and a personal smart-phone to provide a motion control interface rather than expensive equipment. The platform recognizes the user's gestures by the tracking 3D distance and the rotation angle of the smart-phone, which acts essentially as a motion controller in the user's hand. Also, a color coded communication method using RGB color combinations is included within the interface. Users can conveniently send or receive any text data through this function, and the data can be transferred continuously even while the user is performing gestures. We present the result that implementation of viable contents based on the proposed motion recognition platform.

An Implementation of a Feature Extraction Hardware Accelerator based on Memory Usage Improvement SURF Algorithm (메모리 사용률을 개선한 SURF 알고리즘 특징점 추출기의 하드웨어 가속기 설계)

  • Jung, Chang-min;Kwak, Jae-chang;Lee, Kwang-yeob
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2013.10a
    • /
    • pp.77-80
    • /
    • 2013
  • SURF algorithm is an algorithm to extract feature points and to generate descriptors from input images. It is robust to change of environment such as scale, rotation, illumination and view points. Because of these features, it is used for many image processing applications such as object recognition, constructing panorama pictures and 3D image restoration. But there is disadvantage for real time operation because many recognition algorithms such as SURF algorithm requires a lot of calculations. In this paper, we propose a design of feature extractor and descriptor generator based on SURF for high memory efficiency. The proposed design reduced a memory access and memory usage to operate in real time.

  • PDF

Automatic Liver Segmentation Method on MR Images using Normalized Gradient Magnitude Image (MR 영상에서 정규화된 기울기 크기 영상을 이용한 자동 간 분할 기법)

  • Lee, Jeong-Jin;Kim, Kyoung-Won;Lee, Ho
    • Journal of Korea Multimedia Society
    • /
    • v.13 no.11
    • /
    • pp.1698-1705
    • /
    • 2010
  • In this paper, we propose a fast liver segmentation method from magnetic resonance(MR) images. Our method efficiently divides a MR image into a set of discrete objects, and boundaries based on the normalized gradient magnitude information. Then, the objects belonging to the liver are detected by using 2D seeded region growing with seed points, which are extracted from the segmented liver region of the slice immediately above or below the current slice. Finally, rolling ball algorithm, and connected component analysis minimizes false positive error near the liver boundaries. Our method was validated by twenty data sets and the results were compared with the manually segmented result. The average volumetric overlap error was 5.2%, and average absolute volumetric measurement error was 1.9%. The average processing time for segmenting one data set was about three seconds. Our method could be used for computer-aided liver diagnosis, which requires a fast and accurate segmentation of liver.

Volume Measurement Method for Object on Pixel Area Basis through Depth Image (깊이 영상을 통한 화소 단위 물체 부피 측정 방법)

  • Ji-hwan Kim;Soon-kak Kwon
    • Journal of Korea Society of Industrial Information Systems
    • /
    • v.29 no.1
    • /
    • pp.125-133
    • /
    • 2024
  • In this paper, we propose a volume measurement method for an object based on depth image. The object volume is measured by calculating the object height and width in actual units through the depth image. The object area is detected through differences between the captured and background depth images. The volume of the 2×2 pixel area, formed by four adjacent pixels using the depth information associated with each pixel, is measured. The object volume is measured as the sum of the volumes for whole 2×2 areas in the object area. In simulation results, the average measurement error for the object volume is 2.1% when the distance from the camera is 60cm.

Data Augmentation for Tomato Detection and Pose Estimation (토마토 위치 및 자세 추정을 위한 데이터 증대기법)

  • Jang, Minho;Hwang, Youngbae
    • Journal of Broadcast Engineering
    • /
    • v.27 no.1
    • /
    • pp.44-55
    • /
    • 2022
  • In order to automatically provide information on fruits in agricultural related broadcasting contents, instance image segmentation of target fruits is required. In addition, the information on the 3D pose of the corresponding fruit may be meaningfully used. This paper represents research that provides information about tomatoes in video content. A large amount of data is required to learn the instance segmentation, but it is difficult to obtain sufficient training data. Therefore, the training data is generated through a data augmentation technique based on a small amount of real images. Compared to the result using only the real images, it is shown that the detection performance is improved as a result of learning through the synthesized image created by separating the foreground and background. As a result of learning augmented images using images created using conventional image pre-processing techniques, it was shown that higher performance was obtained than synthetic images in which foreground and background were separated. To estimate the pose from the result of object detection, a point cloud was obtained using an RGB-D camera. Then, cylinder fitting based on least square minimization is performed, and the tomato pose is estimated through the axial direction of the cylinder. We show that the results of detection, instance image segmentation, and cylinder fitting of a target object effectively through various experiments.

Interval Hough Transform For Prominent Line Detection (배경선 추출을 위한 구간 허프 변환)

  • Choi, Jin-Mo;Kim, Changick
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.11
    • /
    • pp.1288-1296
    • /
    • 2013
  • The prominent line at the singe image is the important fact for understanding spatial structure or estimating aesthetic scoring. According to this thesis, the abstraction of the background line helps analyzing vanishing point, reconstitution of 3 dimensions, and determining of image sloppiness. It also makes easy to calculate the rule of thirds. This thesis is composed of section hough transform mapping, prioritizing of the prominent line, and selection of the prominent line. These technologies are departmentalized to be applied abstraction of traffic lane, analyzing of building structure, abstraction of vanishing point, and abstraction of straight line documentation. This gives the choice that users are able to compose technology by considering characteristic of objects and luminous environment. This thesis also can be applied to abstract circle. The interval hough transform is able to select the number of prominent line which users want to abstract. It can analyze important prominent line numbers at the image and then abstract the lines, too. Results of prominent lines by experiments would be show at this thesis.