• Title/Summary/Keyword: RGB-D images

Search Result 109, Processing Time 0.025 seconds

Optimal Combination of Component Images for Segmentation of Color Codes (칼라 코드의 영역 분할을 위한 성분 영상들의 최적 조합)

  • Kwon B. H;Yoo H-J.;Kim T. W.;Kim K D.
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.42 no.1
    • /
    • pp.33-42
    • /
    • 2005
  • Identifying color codes needs precise color information of their constituents, and is far from trivial because colors usually suffer severe distortions throughout the entire procedures from printing to acquiring image data. To accomplish accurate identification of colors, we need a reliable segmentation method to separate different color regions from each other, which would enable us to process the whole pixels in the region of a color statistically, instead of a subset of pixels in the region. Color image segmentation can be accomplished by performing edge detection on component image(s). In this paper, we separately detected edges on component images from RGB, HSI, and YIQ color models, and performed mathematical analyses and experiments to find out a pair of component images that provided the best edge image when combined. The best result was obtained by combining Y- and R-component edge images.

360 RGBD Image Synthesis from a Sparse Set of Images with Narrow Field-of-View (소수의 협소화각 RGBD 영상으로부터 360 RGBD 영상 합성)

  • Kim, Soojie;Park, In Kyu
    • Journal of Broadcast Engineering
    • /
    • v.27 no.4
    • /
    • pp.487-498
    • /
    • 2022
  • Depth map is an image that contains distance information in 3D space on a 2D plane and is used in various 3D vision tasks. Many existing depth estimation studies mainly use narrow FoV images, in which a significant portion of the entire scene is lost. In this paper, we propose a technique for generating 360° omnidirectional RGBD images from a sparse set of narrow FoV images. The proposed generative adversarial network based image generation model estimates the relative FoV for the entire panoramic image from a small number of non-overlapping images and produces a 360° RGB and depth image simultaneously. In addition, it shows improved performance by configuring a network reflecting the spherical characteristics of the 360° image.

3D Image Processing for Recognition and Size Estimation of the Fruit of Plum(Japanese Apricot) (3D 영상을 활용한 매실 인식 및 크기 추정)

  • Jang, Eun-Chae;Park, Seong-Jin;Park, Woo-Jun;Bae, Yeonghwan;Kim, Hyuck-Joo
    • The Journal of the Korea Contents Association
    • /
    • v.21 no.2
    • /
    • pp.130-139
    • /
    • 2021
  • In this study, size of the fruit of Japanese apricot (plum) was estimated through a plum recognition and size estimation program using 3D images in order to control the Eurytoma maslovskii that causes the most damage to plum in a timely manner. In 2018, night shooting was carried out using a Kinect 2.0 Camera. For night shooting in 2019, a RealSense Depth Camera D415 was used. Based on the acquired images, a plum recognition and estimation program consisting of four stages of image preprocessing, sizeable plum extraction, RGB and depth image matching and plum size estimation was implemented using MATLAB R2018a. The results obtained by running the program on 10 images produced an average plum recognition error rate of 61.9%, an average plum recognition error rate of 0.5% and an average size measurement error rate of 3.6%. The continued development of these plum recognition and size estimation programs is expected to enable accurate fruit size monitoring in the future and the development of timely control systems for Eurytoma maslovskii.

Color assessment of resin composite by using cellphone images compared with a spectrophotometer

  • Rafaella Mariana Fontes de Braganca;Rafael Ratto Moraes ;Andre Luis Faria-e-Silva
    • Restorative Dentistry and Endodontics
    • /
    • v.46 no.2
    • /
    • pp.23.1-23.11
    • /
    • 2021
  • Objectives: This study assessed the reliability of digital color measurements using images of resin composite specimens captured with a cellphone. Materials and Methods: The reference color of cylindrical specimens built-up with the use of resin composite (shades A1, A2, A3, and A4) was measured with a portable spectrophotometer (CIELab). Images of the specimens were obtained individually or pairwise (compared shades in the same photograph) under standardized parameters. The color of the specimens was measured in the images using RGB system and converted to CIELab system using image processing software. Whiteness index (WID) and color differences (ΔE00) were calculated for each color measurement method. For the cellphone, the ΔE00 was calculated between the pairs of shades in separate images and in the same image. Data were analyzed using 2-way repeated-measures analysis of variance (α = 0.05). Linear regression models were used to predict the reference ΔE00 values of those calculated using color measured in the images. Results: Images captured with the cellphone resulted in different WID values from the spectrophotometer only for shades A3 and A4. No difference to the reference ΔE00 was observed when individual images were used. In general, a similar ranking of ΔE00 among resin composite shades was observed for all methods. Stronger correlation coefficients with the reference ΔE00 were observed using individual than pairwise images. Conclusions: This study showed that the use of cellphone images to measure the color difference seems to be a feasible alternative providing outcomes similar to those obtained with the spectrophotometer.

Transformation of Stereoscopic Images for 3D Perception Improvement (입체영상의 3D 증강을 위한 입체영상 변환)

  • Gil, Jong-In;Kim, Manbae
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2012.07a
    • /
    • pp.325-327
    • /
    • 2012
  • 최근 국내외 디지털 가전 업체들은 다양한 3D 기술을 앞세워 가정 내에서도 편하게 즐길 수 잇도록 다양한 3DTV를 출시하고 있다. 이러한 3DTV에서 입체영상을 시청하기 위해서는 입체콘텐츠가 제작되어 전송되어야 한다[1]. 이러한 입체 콘텐츠는 RGB 영상과 깊이맵을 이용하여 생성할 수 있는데, 이때 깊이맵은 사용자의 용도에 따라 다양한 형태로 변환될 수 있다. 최근엔 이러한 깊이맵과 3D 영상의 컬러를 변환하여 지각 깊이감을 개선하는 영상처리 기술에 대한 관심이 높아지고 있다. 이에 따라, 본 논문에서는 기존의 컬러 변환을 통한 2D 영상의 지각 깊이감 개선을 입체영상에 적용하여, 3D 지각 입체감을 동시에 향상시키는 방법을 제안한다. 이를 위해 대조 변환 및 배경 다크닝 방법을 제안하고, 실험을 통해 제안 방법이 상기 목적을 얻을 수 있는 것을 검증하였다.

  • PDF

Data Augmentation for Tomato Detection and Pose Estimation (토마토 위치 및 자세 추정을 위한 데이터 증대기법)

  • Jang, Minho;Hwang, Youngbae
    • Journal of Broadcast Engineering
    • /
    • v.27 no.1
    • /
    • pp.44-55
    • /
    • 2022
  • In order to automatically provide information on fruits in agricultural related broadcasting contents, instance image segmentation of target fruits is required. In addition, the information on the 3D pose of the corresponding fruit may be meaningfully used. This paper represents research that provides information about tomatoes in video content. A large amount of data is required to learn the instance segmentation, but it is difficult to obtain sufficient training data. Therefore, the training data is generated through a data augmentation technique based on a small amount of real images. Compared to the result using only the real images, it is shown that the detection performance is improved as a result of learning through the synthesized image created by separating the foreground and background. As a result of learning augmented images using images created using conventional image pre-processing techniques, it was shown that higher performance was obtained than synthetic images in which foreground and background were separated. To estimate the pose from the result of object detection, a point cloud was obtained using an RGB-D camera. Then, cylinder fitting based on least square minimization is performed, and the tomato pose is estimated through the axial direction of the cylinder. We show that the results of detection, instance image segmentation, and cylinder fitting of a target object effectively through various experiments.

Development of a Single-Arm Robotic System for Unloading Boxes in Cargo Truck (간선화물의 상자 하차를 위한 외팔 로봇 시스템 개발)

  • Jung, Eui-Jung;Park, Sungho;Kang, Jin Kyu;Son, So Eun;Cho, Gun Rae;Lee, Youngho
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.4
    • /
    • pp.417-424
    • /
    • 2022
  • In this paper, the developed trunk cargo unloading automation system is introduced, and the RGB-D sensor-based box loading situation recognition method and unloading plan applied to this system are suggested. First of all, it is necessary to recognize the position of the box in a truck. To do this, we first apply CNN-based YOLO, which can recognize objects in RGB images in real-time. Then, the normal vector of the center of the box is obtained using the depth image to reduce misrecognition in parts other than the box, and the inner wall of the truck in an image is removed. And a method of classifying the layers of the boxes according to the distance using the recognized depth information of the boxes is suggested. Given the coordinates of the boxes on the nearest layer, a method of generating the optimal path to take out the boxes the fastest using this information is introduced. In addition, kinematic analysis is performed to move the conveyor to the position of the box to be taken out of the truck, and kinematic analysis is also performed to control the robot arm that takes out the boxes. Finally, the effectiveness of the developed system and algorithm through a test bed is proved.

Development of a Reliable Real-time 3D Reconstruction System for Tiny Defects on Steel Surfaces (금속 표면 미세 결함에 대한 신뢰성 있는 실시간 3차원 형상 추출 시스템 개발)

  • Jang, Yu Jin;Lee, Joo Seob
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.19 no.12
    • /
    • pp.1061-1066
    • /
    • 2013
  • In the steel industry, the detection of tiny defects including its 3D characteristics on steel surfaces is very important from the point of view of quality control. A multi-spectral photometric stereo method is an attractive scheme because the shape of the defect can be obtained based on the images which are acquired at the same time by using a multi-channel camera. Moreover, the calculation time required for this scheme can be greatly reduced for real-time application with the aid of a GPU (Graphic Processing Unit). Although a more reliable shape reconstruction of defects can be possible when the numbers of available images are increased, it is not an easy task to construct a camera system which has more than 3 channels in the visible light range. In this paper, a new 6-channel camera system, which can distinguish the vertical/horizontal linearly polarized lights of RGB light sources, was developed by adopting two 3-CCD cameras and two polarized lenses based on the fact that the polarized light is preserved on the steel surface. The photometric stereo scheme with 6 images was accelerated by using a GPU, and the performance of the proposed system was validated through experiments.

Development and Performance Evaluation of an Image Detection System for Efficient 4D Images (효율적인 4D 영상을 위한 영상 검출 시스템 개발 및 성능평가)

  • Cho, Kyoung-Woo;Liu, Ze-Qi;Jeon, Min-Ho;Oh, Chang-Heon
    • Journal of Advanced Navigation Technology
    • /
    • v.17 no.6
    • /
    • pp.792-797
    • /
    • 2013
  • 4D film is just a film that made by adding some physical effects to 3D film or general film. In order to provide physical effects to the audience, the data that make the physical effect must be added to each frames. In this paper, we proposed a video detection system that can efficiently provide physical effects by assessing the present situation such as explosion scene, snowing scene. The proposed video detection system contains an algorithm for fire detection by using R color and $C_r$ value, and also an algorithm for snow detection by using RGB color model. The system constitutes in a MCU that from 8051 family. In the performance evaluations, the result shows that 91% of detection rate in case of fire and 25% of false detection rate in case of snow. Also the system is capable of providing physical effects automatically.

Predicting Unseen Object Pose with an Adaptive Depth Estimator (적응형 깊이 추정기를 이용한 미지 물체의 자세 예측)

  • Sungho, Song;Incheol, Kim
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.11 no.12
    • /
    • pp.509-516
    • /
    • 2022
  • Accurate pose prediction of objects in 3D space is an important visual recognition technique widely used in many applications such as scene understanding in both indoor and outdoor environments, robotic object manipulation, autonomous driving, and augmented reality. Most previous works for object pose estimation have the limitation that they require an exact 3D CAD model for each object. Unlike such previous works, this paper proposes a novel neural network model that can predict the poses of unknown objects based on only their RGB color images without the corresponding 3D CAD models. The proposed model can obtain depth maps required for unknown object pose prediction by using an adaptive depth estimator, AdaBins,. In this paper, we evaluate the usefulness and the performance of the proposed model through experiments using benchmark datasets.