• Title/Summary/Keyword: Binary Depth Image

Search Result 21, Processing Time 0.025 seconds

Surface Curvature Based 3D Pace Image Recognition Using Depth Weighted Hausdorff Distance (표면 곡률을 이용하여 깊이 가중치 Hausdorff 거리를 적용한 3차원 얼굴 영상 인식)

  • Lee Yeung hak;Shim Jae chang
    • Journal of Korea Multimedia Society
    • /
    • v.8 no.1
    • /
    • pp.34-45
    • /
    • 2005
  • In this paper, a novel implementation of a person verification system based on depth-weighted Hausdorff distance (DWHD) using the surface curvature of the face is proposed. The definition of Hausdorff distance is a measure of the correspondence of two point sets. The approach works by finding the nose tip that has a protrusion shape on the face. In feature recognition of 3D face image, one has to take into consideration the orientated frontal posture to normalize after extracting face area from original image. The binary images are extracted by using the threshold values for the curvature value of surface for the person which has differential depth and surface characteristic information. The proposed DWHD measure for comparing two pixel sets were used, because it is simple and robust. In the experimental results, the minimum curvature which has low pixel distribution achieves recognition rate of 98% among the proposed methods.

  • PDF

Spatial-temporal texture features for 3D human activity recognition using laser-based RGB-D videos

  • Ming, Yue;Wang, Guangchao;Hong, Xiaopeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.11 no.3
    • /
    • pp.1595-1613
    • /
    • 2017
  • The IR camera and laser-based IR projector provide an effective solution for real-time collection of moving targets in RGB-D videos. Different from the traditional RGB videos, the captured depth videos are not affected by the illumination variation. In this paper, we propose a novel feature extraction framework to describe human activities based on the above optical video capturing method, namely spatial-temporal texture features for 3D human activity recognition. Spatial-temporal texture feature with depth information is insensitive to illumination and occlusions, and efficient for fine-motion description. The framework of our proposed algorithm begins with video acquisition based on laser projection, video preprocessing with visual background extraction and obtains spatial-temporal key images. Then, the texture features encoded from key images are used to generate discriminative features for human activity information. The experimental results based on the different databases and practical scenarios demonstrate the effectiveness of our proposed algorithm for the large-scale data sets.

Application of Image Processing Method to Evaluate Ultimate Strain of Rebar (철근의 한계상태변형률 평가를 위한 이미지 프로세싱의 적용)

  • Kim, Seong-Do;Jung, Chi-Young;Woo, Tae-Ryeon;Cheung, Jin-Hwan
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.20 no.3
    • /
    • pp.111-121
    • /
    • 2016
  • In this study, measurements were conducted by image processing to do an in-depth evaluation of strain of rebar in a uniaxial tension test. The distribution of strain and the necking region were evaluated. The image processing is used to analyze the color information of a colored image, so that the parts consistent with desired targets can be distinguished from the other parts. After this process, the image was converted to a binary one. Centroids of each target region are obtained in the binary images. After repeating such process on the images from starting point to the finishing point of the test, elongation between targets is calculated based on the centroid of each target. The tensile test were conducted on grade 60 #7(D22) and #9(D29) rebars fabricated in accordance with ASTM A615 standards. Strain results from image processing were compared to the results from a conventional strain gauge, in order to see the validity of the image processing. With the image processing, the measuring was possible in not only the initial elastic region but also the necking region of more than 0.5(50%) strain. The image processing can remove the measuring limits as long as the targets can be video recorded. It also can measure strain at various spots because the targets can easily be attached and detached. Thus it is concluded that the image processing helps overcome limits in strain measuring and will be used in various ways.

Extraction of an Effective Saliency Map for Stereoscopic Images using Texture Information and Color Contrast (색상 대비와 텍스처 정보를 이용한 효과적인 스테레오 영상 중요도 맵 추출)

  • Kim, Seong-Hyun;Kang, Hang-Bong
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.9
    • /
    • pp.1008-1018
    • /
    • 2015
  • In this paper, we propose a method that constructs a saliency map in which important regions are accurately specified and the colors of the regions are less influenced by the similar surrounding colors. Our method utilizes LBP(Local Binary Pattern) histogram information to compare and analyze texture information of surrounding regions in order to reduce the effect of color information. We extract the saliency of stereoscopic images by integrating a 2D saliency map with depth information of stereoscopic images. We then measure the distance between two different sizes of the LBP histograms that are generated from pixels. The distance we measure is texture difference between the surrounding regions. We then assign a saliency value according to the distance in LBP histogram. To evaluate our experimental results, we measure the F-measure compared to ground-truth by thresholding a saliency map at 0.8. The average F-Measure is 0.65 and our experimental results show improved performance in comparison with existing other saliency map extraction methods.

Stereoscopic Video Services for Terrestrial DMB (지상파 DMB를 위한 스테레오스코픽 영상 서비스)

  • Kim, Yong-Han
    • Journal of Broadcast Engineering
    • /
    • v.14 no.1
    • /
    • pp.85-88
    • /
    • 2009
  • Recently "DMB Video-Associated Stereoscopic Data Services" standard has been published by TTA. The standard enables DMB broadcasters to provide 3D or stereoscopic interactive data services based on MPEG-4 BIFS (Binary Format for Scenes). The purpose is to entice viewers to utilize DMB interactive data services more often by providing realistic and protrusive image objects overlaid on top of the main video in the background. This paper provides the background, technical analysis, and in-depth considerations for the standard. Also the results of standard verification are provided including the results of interoperability test with the existing terrestrial DMB receivers.

Stereo Matching Using Analog Neural Network (아날로그 신경 회로망을 이용한 스테레오 정합)

  • 도경훈;이준재;조석제;이왕국;하영호
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.6
    • /
    • pp.59-66
    • /
    • 1993
  • Stereo vision is useful in obtaining three dimensional depth information from two images taken from different view points. Neural network modeling for stereo matching, the key step in stereo vision, is defined by an energy function satisfying with three constraints proposed by Marr and Poggio. Stereo matching is then carried out through the network to find minimum energy corresponding to the optimized solution of the problem. An algorithm for stereo matching using an analog neural network is presented here. The network can reduce errors in initial state an early iteration steps by adoption of continuous sigmoid function in stead of binary state. The experimental results show good matching performance for sparse random dot stereogram and real image.

  • PDF

Performance Evaluation of Underwater Acoustic Communication in Frequency Selective Shallow Water (주파수 선택적인 천해해역에서 수중음향통신 성능해석)

  • Park, Kyu-Chil;Park, Jihyun;Lee, Seung Wook;Jung, Jin Woo;Shin, Jungchae;Yoon, Jong Rak
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.2
    • /
    • pp.95-103
    • /
    • 2013
  • An underwater acoustic (UWA) communication in shallow water is strongly affected by the water surface and the seabed acoustical properties. Every reflected signal to receiver experiences a time-variant scattering in sea surface roughness and a grazing-angle-dependent reflection loss in bottom. Consequently, the performance of UWA communication systems is degraded, and high-speed digital communication is disrupted. If there is a dominant signal path such as a direct path, the received signal is modeled statistically as Rice fading but if not, it is modeled as Rayleigh fading. However, it has been known to be very difficult to reproduce the statistical estimation by real experimental evaluation in the sea. To give an insight for this scattering and grazing-angle-dependent bottom reflection loss effect in UWA communication, authors conduct experiments to quantify these effects. The image is transmitted using binary frequency shift keying (BFSK) modulation. The quality of the received image is shown to be affected by water surface scattering and grazing-angle-dependent bottom reflection loss. The analysis is based on the transmitter to receiver range and the receiver depth dependent image quality and bit error rate (BER). The results show that the received image quality is highly dependent on the transmitter-receiver range and receiver depth which characterizes the channel coherence bandwidth.

An Efficient Median Filter Algorithm for Floating-point Images (부동소수점 형식 이미지를 위한 효율적인 중간값 필터 알고리즘)

  • Kim, Jin Wook
    • Journal of IKEEE
    • /
    • v.26 no.2
    • /
    • pp.240-248
    • /
    • 2022
  • Floating-point images that express pixel information as real numbers are used in HDR images. There have been various researches on efficient median filter algorithms, but most of them are applicable to 8-bit depth images and there are only a few number of algorithms applicable to floating-point images, including Gil and Werman's algorithm. In this paper, we propose a median filter algorithm that works efficiently on floating-point images by improving Kim's algorithm, which improved Gil and Werman's algorithm. Experimental results show that the execution time is improved by about 10% compared to the Kim's algorithm by reducing the redundant work for the repetitively used binary search tree and applying the inverted index.

Recognition of Vehicle Number Plate Using Color Decomposition Method and Back Propagation Neural Network (색 분해법과 역전파 신경 회로망을 이용한 차량 번호판 인식)

  • 이재수;김수인;서춘원
    • Journal of the Korean Institute of Telematics and Electronics T
    • /
    • v.35T no.3
    • /
    • pp.46-52
    • /
    • 1998
  • In this paper, after inputting the computer with the attached number plate on the vehicle, using it, the color decomposition method and back propagation neural network proposed the extractable method of the vehicle number plate at high speed. This method separated R, G, B signal form input moving vehicle image to computer through video camera, then after transform this R, G, B signal into input image data of the computer by using color depth of vehicle number plate and store up binary value in the memory frame buffer. After adapting character's recognition algorithm, also improving this, by adapting back propagation neural network makes the vehicle number plate recognition system. Also minimalizing the similar color's confusion, adapting horizontal and vertical extracting algorithm by using the vehicle's rectangular architecture shows the extract and character's recognition of the vehicle number plate at high speed.

  • PDF

Computer Vision Approach for Phenotypic Characterization of Horticultural Crops (컴퓨터 비전을 활용한 토마토, 파프리카, 멜론 및 오이 작물의 표현형 특성화)

  • Seungri Yoon;Minju Shin;Jin Hyun Kim;Ho Jeong Jeong;Junyoung Park;Tae In Ahn
    • Journal of Bio-Environment Control
    • /
    • v.33 no.1
    • /
    • pp.63-70
    • /
    • 2024
  • This study explored computer vision methods using the OpenCV open-source library to characterize the phenotypes of various horticultural crops. In the case of tomatoes, image color was examined to assess ripeness, while support vector machine (SVM) and histogram of oriented gradients (HOG) methods effectively identified ripe tomatoes. For sweet pepper, we visualized the color distribution and used the Gaussian mixture model for clustering to analyze its post-harvest color characteristics. For the quality assessment of netted melons, the LAB (lightness, a, b) color space, binary images, and depth mapping were used to measure the net patterns of the melon. In addition, a combination of depth and color data proved successful in identifying flowers of different sizes and distances in cucumber greenhouses. This study highlights the effectiveness of these computer vision strategies in monitoring the growth and development, ripening, and quality assessment of fruits and vegetables. For broader applications in agriculture, future researchers and developers should enhance these techniques with plant physiological indicators to promote their adoption in both research and practical agricultural settings.