• Title/Summary/Keyword: Real Time Object Detection

Search Result 535, Processing Time 0.025 seconds

Shipping Container Load State and Accident Risk Detection Techniques Based Deep Learning (딥러닝 기반 컨테이너 적재 정렬 상태 및 사고 위험도 검출 기법)

  • Yeon, Jeong Hum;Seo, Yong Uk;Kim, Sang Woo;Oh, Se Yeong;Jeong, Jun Ho;Park, Jin Hyo;Kim, Sung-Hee;Youn, Joosang
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.11 no.11
    • /
    • pp.411-418
    • /
    • 2022
  • Incorrectly loaded containers can easily knock down by strong winds. Container collapse accidents can lead to material damage and paralysis of the port system. In this paper, We propose a deep learning-based container loading state and accident risk detection technique. Using Darknet-based YOLO, the container load status identifies in real-time through corner casting on the top and bottom of the container, and the risk of accidents notifies the manager. We present criteria for classifying container alignment states and select efficient learning algorithms based on inference speed, classification accuracy, detection accuracy, and FPS in real embedded devices in the same environment. The study found that YOLOv4 had a weaker inference speed and performance of FPS than YOLOv3, but showed strong performance in classification accuracy and detection accuracy.

Deep Learning Image Processing Technology for Vehicle Occupancy Detection (차량탑승인원 탐지를 위한 딥러닝 영상처리 기술 연구)

  • Jang, SungJin;Jang, JongWook
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.8
    • /
    • pp.1026-1031
    • /
    • 2021
  • With the development of global automotive technology and the expansion of market size, demand for vehicles is increasing, which is leading to a decrease in the number of passengers on the road and an increase in the number of vehicles on the road. This causes traffic jams, and in order to solve these problems, the number of illegal vehicles continues to increase. Various technologies are being studied to crack down on these illegal activities. Previously developed systems use trigger equipment to recognize vehicles and photograph vehicles using infrared cameras to detect the number of passengers on board. In this paper, we propose a vehicle occupant detection system with deep learning model techniques without exploiting existing system-applied trigger equipment. The proposed technique proposes a system to detect vehicles by establishing triggers within images and to apply deep learning object recognition models to detect real-time boarding personnel.

Character Detection and Recognition of Steel Materials in Construction Drawings using YOLOv4-based Small Object Detection Techniques (YOLOv4 기반의 소형 물체탐지기법을 이용한 건설도면 내 철강 자재 문자 검출 및 인식기법)

  • Sim, Ji-Woo;Woo, Hee-Jo;Kim, Yoonhwan;Kim, Eung-Tae
    • Journal of Broadcast Engineering
    • /
    • v.27 no.3
    • /
    • pp.391-401
    • /
    • 2022
  • As deep learning-based object detection and recognition research have been developed recently, the scope of application to industry and real life is expanding. But deep learning-based systems in the construction system are still much less studied. Calculating materials in the construction system is still manual, so it is a reality that transactions of wrong volumn calculation are generated due to a lot of time required and difficulty in accurate accumulation. A fast and accurate automatic drawing recognition system is required to solve this problem. Therefore, we propose an AI-based automatic drawing recognition accumulation system that detects and recognizes steel materials in construction drawings. To accurately detect steel materials in construction drawings, we propose data augmentation techniques and spatial attention modules for improving small object detection performance based on YOLOv4. The detected steel material area is recognized by text, and the number of steel materials is integrated based on the predicted characters. Experimental results show that the proposed method increases the accuracy and precision by 1.8% and 16%, respectively, compared with the conventional YOLOv4. As for the proposed method, Precision performance was 0.938. The recall was 1. Average Precision AP0.5 was 99.4% and AP0.5:0.95 was 67%. Accuracy for character recognition obtained 99.9.% by configuring and learning a suitable dataset that contains fonts used in construction drawings compared to the 75.6% using the existing dataset. The average time required per image was 0.013 seconds in the detection, 0.65 seconds in character recognition, and 0.16 seconds in the accumulation, resulting in 0.84 seconds.

Performance Evaluation of Object Detection Deep Learning Model for Paralichthys olivaceus Disease Symptoms Classification (넙치 질병 증상 분류를 위한 객체 탐지 딥러닝 모델 성능 평가)

  • Kyung won Cho;Ran Baik;Jong Ho Jeong;Chan Jin Kim;Han Suk Choi;Seok Won Jung;Hvun Seung Son
    • Smart Media Journal
    • /
    • v.12 no.10
    • /
    • pp.71-84
    • /
    • 2023
  • Paralichthys olivaceus accounts for a large proportion, accounting for more than half of Korea's aquaculture industry. However, about 25-30% of the total breeding volume throughout the year occurs due to diseases, which has a very bad impact on the economic feasibility of fish farms. For the economic growth of Paralichthys olivaceus farms, it is necessary to quickly and accurately diagnose disease symptoms by automating the diagnosis of Paralichthys olivaceus diseases. In this study, we create training data using innovative data collection methods, refining data algorithms, and techniques for partitioning dataset, and compare the Paralichthys olivaceus disease symptom detection performance of four object detection deep learning models(such as YOLOv8, Swin, Vitdet, MvitV2). The experimental findings indicate that the YOLOv8 model demonstrates superiority in terms of average detection rate (mAP) and Estimated Time of Arrival (ETA). If the performance of the AI model proposed in this study is verified, Paralichthys olivaceus farms can diagnose disease symptoms in real time, and it is expected that the productivity of the farm will be greatly improved by rapid preventive measures according to the diagnosis results.

Boundary estimation in electrical impedance tomography with multi-layer neural networks.

  • Kim, J.H.;Jeon, H.J.;Choi, B.Y.;Kim, M.C.;Kim, S.;Kim, K.Y.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2003.10a
    • /
    • pp.553-558
    • /
    • 2003
  • The boundary estimation problem is used to estimate the shape of organic depend on the phase of the cardiac cycle or interested in the detection of the location and size of anomalies with resistivity values different from the background tissues such as nuclear reactor. And we can use the method to solve the optimal solution such as modified Newton raphson, kalman filter, extended kalman filter, etc. But, this method consumes much time and is sensitive to the initial value and noise in the estimation of the unknown shape. In the paper, we propose that multi-layer neural networks estimate the boundary of the unknown object using Fourier coefficient. This method can be used at the real time estimation and have strong characteristics at the noise and initial value. It uses voltage change; difference the homogeneous voltage to the non-homogeneous voltage, and change of Fourier coefficient change to train multi-layer neural network. After train, we can have real time estimation using this method.

  • PDF

Real time instruction classification system

  • Sang-Hoon Lee;Dong-Jin Kwon
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.16 no.3
    • /
    • pp.212-220
    • /
    • 2024
  • A recently the advancement of society, AI technology has made significant strides, especially in the fields of computer vision and voice recognition. This study introduces a system that leverages these technologies to recognize users through a camera and relay commands within a vehicle based on voice commands. The system uses the YOLO (You Only Look Once) machine learning algorithm, widely used for object and entity recognition, to identify specific users. For voice command recognition, a machine learning model based on spectrogram voice analysis is employed to identify specific commands. This design aims to enhance security and convenience by preventing unauthorized access to vehicles and IoT devices by anyone other than registered users. We converts camera input data into YOLO system inputs to determine if it is a person, Additionally, it collects voice data through a microphone embedded in the device or computer, converting it into time-domain spectrogram data to be used as input for the voice recognition machine learning system. The input camera image data and voice data undergo inference tasks through pre-trained models, enabling the recognition of simple commands within a limited space based on the inference results. This study demonstrates the feasibility of constructing a device management system within a confined space that enhances security and user convenience through a simple real-time system model. Finally our work aims to provide practical solutions in various application fields, such as smart homes and autonomous vehicles.

A Study on the Moving Object Tracking Algorithm of Static Camera and Active Camera in Environment (고정카메라 및 능동카메라 환경에서 이동물체 추적 알고리즘에 관한 연구)

  • 남기환;배철수
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.7 no.2
    • /
    • pp.344-352
    • /
    • 2003
  • An effective algorithm for implementation of which detects moving object from image sequences. predicts the direction of it. and drives the camera in real time is proposed. In static camera, for robust motion detection from a dynamic background scene, the proposed algorithm performs statistical modeling of moving objects and background, and trains the statistical modeling of moving objects and background, and trains the statistical feature of background with the initial parts of sequence which have no moving objects. Active camera moving objects are segmented by following procedure, an improved order adaptive lattice structured linear predictor is used. The proposed algorithm shows robust object tracking results in the environment of static or active camera. It can be used for the unmanned surveillance system, traffic monitoring system, and autonomous vehicle.

A study of effective contents construction for AR based English learning (AR기반 영어학습을 위한 효과적 콘텐츠 구성 방향에 대한 연구)

  • Kim, Young-Seop;Jeon, Soo-Jin;Lim, Sang-Min
    • Journal of The Institute of Information and Telecommunication Facilities Engineering
    • /
    • v.10 no.4
    • /
    • pp.143-147
    • /
    • 2011
  • The system using augmented reality can save the time and cost. It is verified in various fields under the possibility of a technology by solving unrealistic feeling in the virtual space. Therefore, augmented reality has a variety of the potential to be used. Generally, multimodal senses such as visual/auditory/tactile feed back are well known as a method for enhancing the immersion in case of interaction with virtual object. By adapting tangible object we can provide touch sensation to users. a 3D model of the same scale overlays the whole area of the tangible object; thus, the marker area is invisible. This contributes to enhancing immersive and natural images to users. Finally, multimodal feedback also creates better immersion. In this paper, sound feedback is considered. By further improving immersion learning augmented reality for children with the initial step learning content is presented. Augmented reality is in the intermediate stages between future world and real world as well as its adaptability is estimated more than virtual reality.

  • PDF

A Study on the Distance Error Correction of Maritime Object Detection System (해상물체탐지시스템 거리오차 보정에 관한 연구)

  • Byung-Sun Kang;Chang-Hyun Jung
    • Journal of the Korean Society of Marine Environment & Safety
    • /
    • v.29 no.2
    • /
    • pp.139-146
    • /
    • 2023
  • Maritime object detection systems, which detects small maritime obstacles such as fish farm buoys and visualizes distance and direction, is equipped with a 3-axis gimbal to compensate for errors caused by hull motion, but there is a limit to distance error corrections necessitated by the vertical movement of the camera and the maritime object due to wave motions. Therefore, in this study, the distance error of maritime object detection systems caused by the movement of the water surface according to the external environment is analyzed and corrected using average filter and moving average filter. Random numbers following a Gaussian standard normal distribution were added to or subtracted from the image coordinates to reproduce the rise or fall of the buoy under irregular waves. The distance calculated according to the change of image coordinates, the predicted distance through the average filter and the moving average filter, and the actual distance measured by laser distance meter were compared. In phases 1 and 2, the error rate increased to a maximum of 98.5% due to the changes of image coordinates due to irregular waves, but the error rate decreased to 16.3% with the moving average filter. This error correction capability was better than with the average filter, but there was a limit due to failure to respond to the distance change. Therefore, it is considered that use of the moving average filter to correct the distance error of the maritime object detection system will enhance responses to the real-time distance change and greatly improve the error rate.

Process fault diagnostics using the integrated graph model

  • Yoon, Yeo-Hong;Nam, Dong-Soo;Jeong, Chang-Wook;Yoon, En-Sup
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1991.10b
    • /
    • pp.1705-1711
    • /
    • 1991
  • On-line fault detection and diagnosis has an increasing interest in a chemical process industry, especially for a process control and automation. The chemical process needs an intelligent operation-aided workstation which can do such tasks as process monitoring, fault detection, fault diagnosis and action guidance in semiautomatic mode. These tasks can increase the performance of a process operation and give merits in economics, safety and reliability. Aiming these tasks, series of researches have been done in our lab. Main results from these researches are building appropriate knowledge representation models and a diagnosis mechanism for fault detection and diagnosis in a chemical process. The knowledge representation schemes developed in our previous research, the symptom tree model and the fault-consequence digraph, showed the effectiveness and the usefulness in a real-time application, of the process diagnosis, especially in large and complex plants. However in our previous approach, the diagnosis speed is its demerit in spite of its merits of high resolution, mainly due to using two knowledge models complementarily. In our current study, new knowledge representation scheme is developed which integrates the previous two knowledge models, the symptom tree and the fault-consequence digraph, into one. This new model is constructed using a material balance, energy balance, momentum balance and equipment constraints. Controller related constraints are included in this new model, which possesses merits of the two previous models. This new integrated model will be tested and verified by the real-time application in a BTX process or a crude unit process. The reliability and flexibility will be greatly enhanced compared to the previous model in spite of the low diagnosis speed. Nexpert Object for the expert system shell and SUN4 workstation for the hardware platform are used. TCP/IP for a communication protocol and interfacing to a dynamic simulator, SPEEDUP, for a dynamic data generation are being studied.

  • PDF