• Title/Summary/Keyword: object detection system

Search Result 1,079, Processing Time 0.028 seconds

Development of visitor counter system for disaster situations and marketing based on real-time object recognition technology (재난상황과 마케팅을 위한 실시간 객체인식 기술기반 출입자 카운터시스템 개발)

  • Kim, Young-gwon;Jeong, Jae-hoon;Kim, Jae-hyeon;Kang, Myeung-jin;Kang, Min-sung;Ju, Hui-je;Jang, Woo-hyun;Yun, Tae-jin
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.01a
    • /
    • pp.187-188
    • /
    • 2021
  • 최근 COVID19 상황에서 생활 속 거리두기가 강조되면서 관광지나 다중이용시설 등의 이용객 수와 밀집도를 파악하는 것이 중요해지고 있다. 따라서, CCTV 영상을 활용하여 저렴한 비용으로 다중이용시설의 출입자수에 대한 정보를 실시간으로 모니터링할 수 있는 시스템이 필요하다. 이를 위해 본 논문에서는 딥러닝 실시간 객체인식기술을 활용한 출입자의 수와 동선을 측정하여 출입자에 대한 통계정보를 웹브라우저를 통해 제공하는 시스템을 개발하였다. 실시간 객체인식기술인 YOLOv4와 YOLOv4-tiny 알고리즘을 Nvidia사의 Jetson AGX Xavier 와 데스크톱PC에 적용하여 각 알고리즘의 FPS와 객체 인식률을 비교 분석 하여 알고리즘을 적용하였다.

  • PDF

Two-Stage Deep Learning Based Algorithm for Cosmetic Object Recognition (화장품 물체 인식을 위한 Two-Stage 딥러닝 기반 알고리즘)

  • Jongmin Kim;Daeho Seo
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.46 no.4
    • /
    • pp.101-106
    • /
    • 2023
  • With the recent surge in YouTube usage, there has been a proliferation of user-generated videos where individuals evaluate cosmetics. Consequently, many companies are increasingly utilizing evaluation videos for their product marketing and market research. However, a notable drawback is the manual classification of these product review videos incurring significant costs and time. Therefore, this paper proposes a deep learning-based cosmetics search algorithm to automate this task. The algorithm consists of two networks: One for detecting candidates in images using shape features such as circles, rectangles, etc and Another for filtering and categorizing these candidates. The reason for choosing a Two-Stage architecture over One-Stage is that, in videos containing background scenes, it is more robust to first detect cosmetic candidates before classifying them as specific objects. Although Two-Stage structures are generally known to outperform One-Stage structures in terms of model architecture, this study opts for Two-Stage to address issues related to the acquisition of training and validation data that arise when using One-Stage. Acquiring data for the algorithm that detects cosmetic candidates based on shape and the algorithm that classifies candidates into specific objects is cost-effective, ensuring the overall robustness of the algorithm.

Automatic Reading System for On-off Type DNA Chip

  • Ryu, Mun-Ho;Kim, Jong-Dae;Kim, Jong-Won
    • Journal of Information Processing Systems
    • /
    • v.2 no.3 s.4
    • /
    • pp.189-193
    • /
    • 2006
  • In this study we propose an automatic reading system for diagnostic DNA chips. We define a general specification for an automatic reading system and propose a possible implementation method. The proposed system performs the whole reading process automatically without any user intervention, covering image acquisition, image analysis, and report generation. We applied the system for the automatic report generation of a commercialized DNA chip for cervical cancer detection. The fluorescence image of the hybridization result was acquired with a $GenePix^{TM}$ scanner using its library running in HTML pages. The processing of the acquired image and the report generation were executed by a component object module programmed with Microsoft Visual C++ 6.0. To generate the report document, we made an HWP 2002 document template with marker strings that were supposed to be searched and replaced with the corresponding information such as patient information and diagnosis results. The proposed system generates the report document by reading the template and changing the marker strings with the resultant contents. The system is expected to facilitate the usage of a diagnostic DNA chip for mass screening by the automation of a conventional manual reading process, shortening its processing time, and quantifying the reading criteria.

Real time instruction classification system

  • Sang-Hoon Lee;Dong-Jin Kwon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.3
    • /
    • pp.212-220
    • /
    • 2024
  • A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.

Illumination Environment Adaptive Real-time Video Surveillance System for Security of Important Area (중요지역 보안을 위한 조명환경 적응형 실시간 영상 감시 시스템)

  • An, Sung-Jin;Lee, Kwan-Hee;Kwon, Goo-Rak;Kim, Nam-Hyung;Ko, Sung-Jea
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.44 no.2 s.314
    • /
    • pp.116-125
    • /
    • 2007
  • In this paper, we propose a illumination environment adaptive real-time surveillance system for security of important area such as military bases, prisons, and strategic infra structures. The proposed system recognizes movement of objects on the bright environments as well as in dark illumination. The procedure of proposed system may be summarized as follows. First, the system discriminates between bright and dark with input image distribution. Then, if the input image is dark, the system has a pre-processing. The Multi-scale Retinex Color Restoration(MSRCR) is processed to enhance the contrast of image captured in dark environments. Secondly, the enhanced input image is subtracted with the revised background image. And then, we take a morphology image processing to obtain objects correctly. Finally, each bounding box enclosing each objects are tracked. The center point of each bounding box obtained by the proposed algorithm provides more accurate tracking information. Experimental results show that the proposed system provides good performance even though an object moves very fast and the background is quite dark.

Alternative Tracing Method for Moving Object Using Reference Template in Real-time Image - Focusing on Parking Management System (참조 템플릿 기반 실시간 이동체 영상을 이용한 대안적 탐지 방안 - 주차관리시스템을 대상으로)

  • Joo, Yong Jin;Kang, Lee Seul;Hahm, Chang Hahk
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.32 no.5
    • /
    • pp.495-503
    • /
    • 2014
  • As the number of vehicles has been sharply increases, the significance of safety and effective operation issues in the parking lot is being emphasized, which takes a part of the transportation system. Recently, there have been several studies for the parking management by detecting moving object, however, recognizing numbers of fast-moving vehicles simultaneously in the picture is still a challenging problem. The parking lot in public area, or large-sized buildings has clear parking section, whereas the sensor system is configured to monitor a plurality of parking spaces. Therefore, by considering those parking lots, we suggested to develop the real-time parking availability information system by applying the real-time image processing techniques. with the help of template matching. Following the study, we wanted to provide the alternative method for parking management system through the reference template makers by recognizing movements of parked vehicles with the size and shape, regardless of direct detecting of driving movements. In addition, we evaluated the applicability and performances of the information system, presented in this study, and implemented a prototype system to simulate the parking statuses of each floor. In fat, it was possible to manage and analyze statistics about the total number of parking spaces and the number of vehicles parked through real-time video flames. We expected that the result of the study will be advanced, following the user-friendliness and cost reduction in operating parking management system and giving information by efficient analysis of parking situation.

Laser Tracking Analysis of Space Debris using SOLT System at Mt. Gamak (감악산 SOLT 시스템을 이용한 우주잔해물 레이저추적 성능분석)

  • Lim, Hyung-Chul;Park, Jong-Uk;Kim, Dong-Jin;Seong, Kipyung;Ka, Neung-Hyun
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.43 no.9
    • /
    • pp.830-837
    • /
    • 2015
  • Space debris has been a major issue recently for the space-active nations because its growing population is expected to increase the collision risk with operational satellites. Radar and electro-optical system has been used for space debris surveillance, which may cause unnecessary anti-collision manoeuvers due to their low tracking accuracy. So an additional tracking system is required to improve the predicted orbit accuracy and then to jude the anti-collision maneouvers more efficiently. The laser tracking system has been considered as an alternative to decrease these unnecessary manoeuvers. Korea Astronomy and Space Science Institute has been developing a space object laser tracking system which is capable of laser tracking for satellites with retro-reflectors and for space debris using high power laser, and satellite imaging using adaptive optics. In this study, the tracking capability is analyzed for space debris using high power laser based on link budget, false alarm probability and signal detection probability.

A study on the design of an efficient hardware and software mixed-mode image processing system for detecting patient movement (환자움직임 감지를 위한 효율적인 하드웨어 및 소프트웨어 혼성 모드 영상처리시스템설계에 관한 연구)

  • Seungmin Jung;Euisung Jung;Myeonghwan Kim
    • Journal of Internet Computing and Services
    • /
    • v.25 no.1
    • /
    • pp.29-37
    • /
    • 2024
  • In this paper, we propose an efficient image processing system to detect and track the movement of specific objects such as patients. The proposed system extracts the outline area of an object from a binarized difference image by applying a thinning algorithm that enables more precise detection compared to previous algorithms and is advantageous for mixed-mode design. The binarization and thinning steps, which require a lot of computation, are designed based on RTL (Register Transfer Level) and replaced with optimized hardware blocks through logic circuit synthesis. The designed binarization and thinning block was synthesized into a logic circuit using the standard 180n CMOS library and its operation was verified through simulation. To compare software-based performance, performance analysis of binary and thinning operations was also performed by applying sample images with 640 × 360 resolution in a 32-bit FPGA embedded system environment. As a result of verification, it was confirmed that the mixed-mode design can improve the processing speed by 93.8% in the binary and thinning stages compared to the previous software-only processing speed. The proposed mixed-mode system for object recognition is expected to be able to efficiently monitor patient movements even in an edge computing environment where artificial intelligence networks are not applied.

Proposal for Deep Learning based Character Recognition System by Virtual Data Generation (가상 데이터 생성을 통한 딥러닝 기반 문자인식 시스템 제안)

  • Lee, Seungju;Park, Gooman
    • Journal of Broadcast Engineering
    • /
    • v.25 no.2
    • /
    • pp.275-278
    • /
    • 2020
  • In this paper, we proposed a deep learning based character recognition system through virtual data generation. In order to secure the learning data that takes the largest weight in supervised learning, virtual data was created. Also, after creating virtual data, data generalization was performed to cope with various data by using augmentation parameter. Finally, the learning data composition generated data by assigning various values to augmentation parameter and font parameter. Test data for measuring the character recognition performance was constructed by cropping the text area from the actual image data. The test data was augmented considering the image distortion that may occur in real environment. Deep learning algorithm uses YOLO v3 which performs detection in real time. Inference result outputs the final detection result through post-processing.

Detecting and Counting People system based on Vision Sensor (비전 센서 기반의 사람 검출 및 계수 시스템)

  • Park, Ho-Sik
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.6 no.1
    • /
    • pp.1-5
    • /
    • 2013
  • The number of pedestrians is considered essential information which can be used to control a person who makes a entrance or a exit into a building. The number of pedestrians, also, can be used to help to manage pedestrian traffic and the volume of pedestrian flow within the building. Due to the fact there is incorrect detection by occluded, shadows, and illumination, however, difficulty can arise in existing system which is for detection and counts of a person who makes a entrance or a exit into a building. In this paper, it is minimized that the change of illumination and the effect of shadow through the transmitted image from camera which is created and processed with great adaptability. The accuracy of the calculations can be increase as well by using Kalman Filter and Mean-Shift Algorithm in order to avoid overlapped counts. As a result of the test, it is proved that the count method that shows the accuracy of 95.4% should be effective for detection and counts.