• Title/Summary/Keyword: single-image detection

Search Result 357, Processing Time 0.023 seconds

Algorithm for improving the position of vanishing point using multiple images and homography matrix (다중 영상과 호모그래피 행렬을 이용한 소실점 위치 향상 알고리즘)

  • Lee, Chang-Hyung;Choi, Hyung-Il
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.1
    • /
    • pp.477-483
    • /
    • 2019
  • In this paper, we propose vanishing-point position-improvement algorithms by using multiple images and a homography matrix. Vanishing points can be detected from a single image, but the positions of detected vanishing points can be improved if we adjust their positions by using information from multiple images. More accurate indoor space information detection is possible through vanishing points with improved positional accuracy. To adjust a position, we take three images and detect the information, detect the homography matrix between the walls of the images, and convert the vanishing point positions using the detected homography. Finally, we find an optimal position among the converted vanishing points and improve the vanishing point position. The experimental results compared an existing algorithm and the proposed algorithm. With the proposed algorithm, we confirmed that the error angle to the vanishing point position was reduced by about 1.62%, and more accurate vanishing point detection was possible. In addition, we can confirm that the layout detected by using improved vanishing points through the proposed algorithm is more accurate than the result from the existing algorithm.

Design of YOLO-based Removable System for Pet Monitoring (반려동물 모니터링을 위한 YOLO 기반의 이동식 시스템 설계)

  • Lee, Min-Hye;Kang, Jun-Young;Lim, Soon-Ja
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.1
    • /
    • pp.22-27
    • /
    • 2020
  • Recently, as the number of households raising pets increases due to the increase of single households, there is a need for a system for monitoring the status or behavior of pets. There are regional limitations in the monitoring of pets using domestic CCTVs, which requires a large number of CCTVs or restricts the behavior of pets. In this paper, we propose a mobile system for detecting and tracking cats using deep learning to solve the regional limitations of pet monitoring. We use YOLO (You Look Only Once), an object detection neural network model, to learn the characteristics of pets and apply them to Raspberry Pi to track objects detected in an image. We have designed a mobile monitoring system that connects Raspberry Pi and a laptop via wireless LAN and can check the movement and condition of cats in real time.

Daily adaptive proton therapy: Feasibility study of detection of tumor variations based on tomographic imaging of prompt gamma emission from proton-boron fusion reaction

  • Choi, Min-Geon;Law, Martin;Djeng, Shin-Kien;Kim, Moo-Sub;Shin, Han-Back;Choe, Bo-Young;Yoon, Do-Kun;Suh, Tae Suk
    • Nuclear Engineering and Technology
    • /
    • v.54 no.8
    • /
    • pp.3006-3016
    • /
    • 2022
  • In this study, the images of specific prompt gamma (PG)-rays of 719 keV emitted from proton-boron reactions were analyzed using single-photon emission computed tomography (SPECT). Quantitative evaluation of the images verified the detection of anatomical changes in tumors, one of the important factors in daily adaptive proton therapy (DAPT) and verified the possibility of application of the PG-ray images to DAPT. Six scenarios were considered based on various sizes and locations compared to the reference virtual tumor to observe the anatomical alterations in the virtual tumor. Subsequently, PG-rays SPECT images were acquired using the modified ordered subset expectation-maximization algorithm, and these were evaluated using quantitative analysis methods. The results confirmed that the pixel range and location of the highest value of the normalized pixel in the PG-rays SPECT image profile changed according to the size and location of the virtual tumor. Moreover, the alterations in the virtual tumor size and location in the PG-rays SPECT images were similar to the true size and location alterations set in the phantom. Based on the above results, the tumor anatomical alterations in DAPT could be adequately detected and verified through SPECT imaging using the 719 keV PG-rays acquired during treatment.

Machine Parts(O-Ring) Defect Detection Using Adaptive Binarization and Convex Hull Method Based on Deep Learning (적응형 이진화와 컨벡스 헐 기법을 적용한 심층학습 기반 기계부품(오링) 불량 판별)

  • Kim, Hyun-Tae;Seong, Eun-San
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.12
    • /
    • pp.1853-1858
    • /
    • 2021
  • O-rings fill the gaps between mechanical parts. Until now, the sorting of defective products has been performed visually and manually, so classification errors often occur. Therefore, a camera-based defect classification system without human intervention is required. However, a binarization process is required to separate the required region from the background in the camera input image. In this paper, an adaptive binarization technique that considers the surrounding pixel values is applied to solve the problem that single-threshold binarization is difficult to apply due to factors such as changes in ambient lighting or reflections. In addition, the convex hull technique is also applied to compensate for the missing pixel part. And the learning model to be applied to the separated region applies the residual error-based deep learning neural network model, which is advantageous when the defective characteristic is non-linear. It is suggested that the proposed system through experiments can be applied to the automation of O-ring defect detection.

Development for Analysis Service of Crowd Density in CCTV Video using YOLOv4 (YOLOv4를 이용한 CCTV 영상 내 군중 밀집도 분석 서비스 개발)

  • Seung-Yeon Hwang;Jeong-Joon Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.24 no.3
    • /
    • pp.177-182
    • /
    • 2024
  • In this paper, the purpose of this paper is to predict and prevent the risk of crowd concentration in advance for possible future crowd accidents based on the Itaewon crush accident in Korea on October 29, 2022. In the case of a single CCTV, the administrator can determine the current situation in real time, but since the screen cannot be seen throughout the day, objects are detected using YOLOv4, which learns images taken with CCTV angle, and safety accidents due to crowd concentration are prevented by notification when the number of clusters exceeds. The reason for using the YOLO v4 model is that it improves with higher accuracy and faster speed than the previous YOLO model, making object detection techniques easier. This service will go through the process of testing with CCTV image data registered on the AI-Hub site. Currently, CCTVs have increased exponentially in Korea, and if they are applied to actual CCTVs, it is expected that various accidents, including accidents caused by crowd concentration in the future, can be prevented.

A Novel Character Segmentation Method for Text Images Captured by Cameras

  • Lue, Hsin-Te;Wen, Ming-Gang;Cheng, Hsu-Yung;Fan, Kuo-Chin;Lin, Chih-Wei;Yu, Chih-Chang
    • ETRI Journal
    • /
    • v.32 no.5
    • /
    • pp.729-739
    • /
    • 2010
  • Due to the rapid development of mobile devices equipped with cameras, instant translation of any text seen in any context is possible. Mobile devices can serve as a translation tool by recognizing the texts presented in the captured scenes. Images captured by cameras will embed more external or unwanted effects which need not to be considered in traditional optical character recognition (OCR). In this paper, we segment a text image captured by mobile devices into individual single characters to facilitate OCR kernel processing. Before proceeding with character segmentation, text detection and text line construction need to be performed in advance. A novel character segmentation method which integrates touched character filters is employed on text images captured by cameras. In addition, periphery features are extracted from the segmented images of touched characters and fed as inputs to support vector machines to calculate the confident values. In our experiment, the accuracy rate of the proposed character segmentation system is 94.90%, which demonstrates the effectiveness of the proposed method.

Active Contour Model for Boundary Detection of Multiple Objects (복수 객체의 윤곽 검출 방법에 대한 능동윤곽모델)

  • Jang, Jong-Whan
    • The KIPS Transactions:PartB
    • /
    • v.17B no.5
    • /
    • pp.375-380
    • /
    • 2010
  • Most of previous algorithms of object boundary extraction have been studied for extracting the boundary of single object. However, multiple objects are much common in the real image. The proposed algorithm of extracting the boundary of each of multiple objects has two steps. In the first step, we propose the fast method using the outer and inner products; the initial contour including multiple objects is split and connected and each of new contours includes only one object. In the second step, an improved active contour model is studied to extract the boundary of each object included each of contours. Experimental results with various test images have shown that our algorithm produces much better results than the previous algorithms.

Terrain Aided Inertial Navigation for Precise Planetary Landing (정밀 행성 착륙을 위한 지형 보조 관성 항법 연구)

  • Jeong, Bo-Young;Choi, Yoon-Hyuk;Jo, Su-Jang;Bang, Hyo-Choong
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.38 no.7
    • /
    • pp.673-683
    • /
    • 2010
  • This study investigates Terrain Aided Inertial Navigation(TAIN) which consists of Inertial Navigation System (INS) with the optical sensor for precise planetary landing. Image processing is conducted to extract the feature points between measured terrain data and on-board implemented terrain information. The navigation algorithm with Iterated Extended Kalman Filter(IEKF) can compensate for the navigation error, and provide precise navigation information compared to single INS. Simulation results are used to demonstrate the feasibility of integration to accomplish precise planetary landing. The proposed navigation approach can be implemented to the whole system coupled with guidance and control laws.

Semiautomated Analysis of Data from an Imaging Sonar for Fish Counting, Sizing, and Tracking in a Post-Processing Application

  • Kang, Myoung-Hee
    • Fisheries and Aquatic Sciences
    • /
    • v.14 no.3
    • /
    • pp.218-225
    • /
    • 2011
  • Dual frequency identification sonar (DIDSON) is an imaging sonar that has been used for numerous fisheries investigations in a diverse range of freshwater and marine environments. The main purpose of DIDSON is fish counting, fish sizing, and fish behavioral studies. DIDSON records video-quality data, so processing power for handling the vast amount of data with high speed is a priority. Therefore, a semiautomated analysis of DIDSON data for fish counting, sizing, and fish behavior in Echoview (fisheries acoustic data analysis software) was accomplished using testing data collected on the Rakaia River, New Zealand. Using this data, the methods and algorithms for background noise subtraction, image smoothing, target (fish) detection, and conversion to single targets were precisely illustrated. Verification by visualization identified the resulting targets. As a result, not only fish counts but also fish sizing information such as length, thickness, perimeter, compactness, and orientation were obtained. The alpha-beta fish tracking algorithm was employed to extract the speed, change in depth, and the distributed depth relating to fish behavior. Tail-beat pattern was depicted using the maximum intensity of all beams. This methodology can be used as a template and applied to data from BlueView two-dimensional imaging sonar.

Learning Algorithm for Multiple Distribution Data using Haar-like Feature and Decision Tree (다중 분포 학습 모델을 위한 Haar-like Feature와 Decision Tree를 이용한 학습 알고리즘)

  • Kwak, Ju-Hyun;Woen, Il-Young;Lee, Chang-Hoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.2 no.1
    • /
    • pp.43-48
    • /
    • 2013
  • Adaboost is widely used for Haar-like feature boosting algorithm in Face Detection. It shows very effective performance on single distribution model. But when detecting front and side face images at same time, Adaboost shows it's limitation on multiple distribution data because it uses linear combination of basic classifier. This paper suggest the HDCT, modified decision tree algorithm for Haar-like features. We still tested the performance of HDCT compared with Adaboost on multiple distributed image recognition.