• Title/Summary/Keyword: RGB-D images

Search Result 109, Processing Time 0.023 seconds

Pose Recognition of Soccer Players for Three Dimensional Animation (방송 축구 영상으로부터 3차원 애니메이션 변환을 위한 축구 선수 동작 인식)

  • 장원철;남시욱;김재희
    • Proceedings of the IEEK Conference
    • /
    • 2000.11d
    • /
    • pp.33-36
    • /
    • 2000
  • To create a more realistic soccer game derived from TV images, we are developing an image synthesis system that generates 3D image sequence from TV images. We propose the method for the team and the pose recognition of players in TV images. The representation includes camera calibration method, team recognition method and pose recognition method. To find the location of a player on the field, a field model is constructed and a player's field position is transformed by 4-feature points. To recognize the team information of players, we compute RGB mean values and standard deviations of a player in TV images. Finally, to recognize pose of a player, this system computes the velocity and the ratio of player(height/width). Experimental results are included to evaluate the performance of the team and the pose recognition.

  • PDF

Face Detection Method based Fusion RetinaNet using RGB-D Image (RGB-D 영상을 이용한 Fusion RetinaNet 기반 얼굴 검출 방법)

  • Nam, Eun-Jeong;Nam, Chung-Hyeon;Jang, Kyung-Sik
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.26 no.4
    • /
    • pp.519-525
    • /
    • 2022
  • The face detection task of detecting a person's face in an image is used as a preprocess or core process in various image processing-based applications. The neural network models, which have recently been performing well with the development of deep learning, are dependent on 2D images, so if noise occurs in the image, such as poor camera quality or pool focus of the face, the face may not be detected properly. In this paper, we propose a face detection method that uses depth information together to reduce the dependence of 2D images. The proposed model was trained after generating and preprocessing depth information in advance using face detection dataset, and as a result, it was confirmed that the FRN model was 89.16%, which was about 1.2% better than the RetinaNet model, which showed 87.95%.

Test of Fault Detection to Solar-Light Module Using UAV Based Thermal Infrared Camera (UAV 기반 열적외선 카메라를 이용한 태양광 모듈 고장진단 실험)

  • LEE, Geun-Sang;LEE, Jong-Jo
    • Journal of the Korean Association of Geographic Information Studies
    • /
    • v.19 no.4
    • /
    • pp.106-117
    • /
    • 2016
  • Recently, solar power plants have spread widely as part of the transition to greater environmental protection and renewable energy. Therefore, regular solar plant inspection is necessary to efficiently manage solar-light modules. This study implemented a test that can detect solar-light module faults using an UAV based thermal infrared camera and GIS spatial analysis. First, images were taken using fixed UAV and an RGB camera, then orthomosaic images were created using Pix4D SW. We constructed solar-light module layers from the orthomosaic images and inputted the module layer code. Rubber covers were installed in the solar-light module to detect solar-light module faults. The mean temperature of each solar-light module can be calculated using the Zonalmean function based on temperature information from the UAV thermal camera and solar-light module layer. Finally, locations of solar-light modules of more than $37^{\circ}C$ and those with rubber covers can be extracted automatically using GIS spatial analysis and analyzed specifically using the solar-light module's identifying code.

Robust Estimation of Hand Poses Based on Learning (학습을 이용한 손 자세의 강인한 추정)

  • Kim, Sul-Ho;Jang, Seok-Woo;Kim, Gye-Young
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.23 no.12
    • /
    • pp.1528-1534
    • /
    • 2019
  • Recently, due to the popularization of 3D depth cameras, new researches and opportunities have been made in research conducted on RGB images, but estimation of human hand pose is still classified as one of the difficult topics. In this paper, we propose a robust estimation method of human hand pose from various input 3D depth images using a learning algorithm. The proposed approach first generates a skeleton-based hand model and then aligns the generated hand model with three-dimensional point cloud data. Then, using a random forest-based learning algorithm, the hand pose is strongly estimated from the aligned hand model. Experimental results in this paper show that the proposed hierarchical approach makes robust and fast estimation of human hand posture from input depth images captured in various indoor and outdoor environments.

Dense RGB-D Map-Based Human Tracking and Activity Recognition using Skin Joints Features and Self-Organizing Map

  • Farooq, Adnan;Jalal, Ahmad;Kamal, Shaharyar
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.9 no.5
    • /
    • pp.1856-1869
    • /
    • 2015
  • This paper addresses the issues of 3D human activity detection, tracking and recognition from RGB-D video sequences using a feature structured framework. During human tracking and activity recognition, initially, dense depth images are captured using depth camera. In order to track human silhouettes, we considered spatial/temporal continuity, constraints of human motion information and compute centroids of each activity based on chain coding mechanism and centroids point extraction. In body skin joints features, we estimate human body skin color to identify human body parts (i.e., head, hands, and feet) likely to extract joint points information. These joints points are further processed as feature extraction process including distance position features and centroid distance features. Lastly, self-organized maps are used to recognize different activities. Experimental results demonstrate that the proposed method is reliable and efficient in recognizing human poses at different realistic scenes. The proposed system should be applicable to different consumer application systems such as healthcare system, video surveillance system and indoor monitoring systems which track and recognize different activities of multiple users.

Color Image Quantization Using Local Region Block in RGB Space (RGB 공간상의 국부 영역 블럭을 이용한 칼라 영상 양자화)

  • 박양우;이응주;김기석;정인갑;하영호
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1995.06a
    • /
    • pp.83-86
    • /
    • 1995
  • Many image display devices allow only a limited number of colors to be simultaneously displayed. In displaying of natural color image using color palette, it is necessary to construct an optimal color palette and map each pixel of the original image to a color palette with fast. In this paper, we proposed the clustering algorithm using local region block centered one color cluster in the prequantized 3-D histogram. Cluster pairs which have the least distortion error are merged by considering distortion measure. The clustering process is continued until to obtain the desired number of colors. Same as the clustering process, original color image is mapped to palette color via a local region block centering around prequantized original color value. The proposed algorithm incorporated with a spatial activity weighting value which is smoothing region. The method produces high quality display images and considerably reduces computation time.

Color image quantization considering distortion measure of local region block on RGB space (RGB 공간상의 국부 영역 블록의 왜곡척도를 고려한 칼라 영상 양자화)

  • 박양우;이응주;김경만;엄태억;하영호
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.21 no.4
    • /
    • pp.848-854
    • /
    • 1996
  • Many image display devices allow only a limited number of colors to be simultaneously displayed. in disphaying of natural color image using color palette, it is necessary to construct an optimal color palette and the optimal mapping of each pixed of the original image to a color from the palette. In this paper, we proposed the clustering algorithm using local region block centered one color cluster in the prequantized 3-D histogram. Cluster pairs which have the least distortion error are merged by considering distortion measure. The clustering process is continued until to obtain the desired number of colors. The same as the clustering process, original color value. The proposed algorithm incroporated with a spatial activity weighting value which is reflected sensitivity of HVS quantization errors in smoothing region. This method produces high quality display images and considerably reduces computation time.

  • PDF

2D to 3D Anaglyph Image Conversion using Linear Curve in HTML5 (HTML5에서 직선의 기울기를 이용한 2D to 3D 입체 이미지 변환)

  • Park, Young Soo
    • Journal of Digital Convergence
    • /
    • v.12 no.12
    • /
    • pp.521-528
    • /
    • 2014
  • In this paper, we propose the method of converting 2D image to 3D image using linear curves in HTML5. We use only one image without any other information about depth map for creating 3D images. So we filter the original image to extract RGB colors for left and right eyes. After selecting the ready-made control point of linear curves to set up depth values, users can set up the depth values and modify them. Based on the depth values that the end users select, we reflect them. Anaglyph 3D is automatically made with the whole and partial depth information. As all of this work has been designed and implemented in Web environment using HTML5, it is very easy and convenient and end users can create any 3D image that they want to make.

3D Position Tracking for Moving objects using Stereo CCD Cameras (스테레오 CCD 카메라를 이용한 이동체의 실시간 3차원 위치추적)

  • Kwon, Hyuk-Jong;Bae, Sang-Keun;Kim, Byung-Guk
    • Spatial Information Research
    • /
    • v.13 no.2 s.33
    • /
    • pp.129-138
    • /
    • 2005
  • In this paper, a 3D position tracking algorithm for a moving objects using a stereo CCD cameras was proposed. This paper purposed the method to extract the coordinates of the moving objects. That is improve the operating and data processing efficiency. We were applied the relative orientation far the stereo CCD cameras and image coordinates extraction in the left and right images after the moving object segmentation. Also, it is decided on 3D position far moving objects using an acquired image coordinates in the left and right images. We were used independent relative orientation to decide the relative location and attitude of the stereo CCD cameras and RGB pixel values to segment the moving objects. To calculate the coordinates of the moving objects by space intersection. And, We conducted the experiment the system and compared the accuracy of the results.

  • PDF

Region Extraction of License Plates in Noise Environment Using YUV Color Space Convert (YUV컬러 공간변환에 의한 잡음환경의 차량번호판 영역추출)

  • Kim Jae-Nam;Choi Tae-Il;Kim Byung-Ki
    • The KIPS Transactions:PartD
    • /
    • v.13D no.1 s.104
    • /
    • pp.125-132
    • /
    • 2006
  • The existing recognition system of license plates cannot get the satisfactory result in noise environments. The purpose of this paper is to propose an algorithm that can recognize the region of license plates accurately in a noise environment. The algorithm is formulated by reorganizing the U- and V-channels of YUV color space as YUV is insensitive to light and carries less data than RGB color information. The region of license plates has been extracted by the geometric characteristics, sizes, and places of labeling images. The proposed algorithm was found to improve the process of extracting the region of license plates in various noise environments.