• Title/Summary/Keyword: 객체 탐지 알고리즘

Search Result 168, Processing Time 0.026 seconds

Perceptual Generative Adversarial Network for Single Image De-Snowing (단일 영상에서 눈송이 제거를 위한 지각적 GAN)

  • Wan, Weiguo;Lee, Hyo Jong
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.8 no.10
    • /
    • pp.403-410
    • /
    • 2019
  • Image de-snowing aims at eliminating the negative influence by snow particles and improving scene understanding in images. In this paper, a perceptual generative adversarial network based a single image snow removal method is proposed. The residual U-Net is designed as a generator to generate the snow free image. In order to handle various sizes of snow particles, the inception module with different filter kernels is adopted to extract multiple resolution features of the input snow image. Except the adversarial loss, the perceptual loss and total variation loss are employed to improve the quality of the resulted image. Experimental results indicate that our method can obtain excellent performance both on synthetic and realistic snow images in terms of visual observation and commonly used visual quality indices.

A Robust Real-Time License Plate Recognition System Using Anchor-Free Method and Convolutional Neural Network

  • Kim, Dae-Hoon;Kim, Do-Hyeon;Lee, Dong-Hoon;Kim, Yoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.4
    • /
    • pp.19-26
    • /
    • 2022
  • With the recent development of intelligent transportation systems, car license plate recognition systems are being used in various fields. Such systems need to guarantee real-time performance to recognize the license plate of a driving car. Also, they should keep a high recognition rate even in problematic situations such as small license plates in low-resolution and unclear image due to distortion. In this paper, we propose a real-time car license plate recognition system that improved processing speed using object detection algorithm based on anchor-free method and text recognition algorithm based on Convolutional Neural Network(CNN). In addition, we used Spatial Transformer Network to increase the recognition rate on the low resolution or distorted images. We confirm that the proposed system is faster than previously existing car license plate recognition systems and maintains a high recognition rate in a variety of environment and quality images because the proposed system's recognition rate is 93.769% and the processing speed per image is about 0.006 seconds.

Development of deep learning network based low-quality image enhancement techniques for improving foreign object detection performance (이물 객체 탐지 성능 개선을 위한 딥러닝 네트워크 기반 저품질 영상 개선 기법 개발)

  • Ki-Yeol Eom;Byeong-Seok Min
    • Journal of Internet Computing and Services
    • /
    • v.25 no.1
    • /
    • pp.99-107
    • /
    • 2024
  • Along with economic growth and industrial development, there is an increasing demand for various electronic components and device production of semiconductor, SMT component, and electrical battery products. However, these products may contain foreign substances coming from manufacturing process such as iron, aluminum, plastic and so on, which could lead to serious problems or malfunctioning of the product, and fire on the electric vehicle. To solve these problems, it is necessary to determine whether there are foreign materials inside the product, and may tests have been done by means of non-destructive testing methodology such as ultrasound ot X-ray. Nevertheless, there are technical challenges and limitation in acquiring X-ray images and determining the presence of foreign materials. In particular Small-sized or low-density foreign materials may not be visible even when X-ray equipment is used, and noise can also make it difficult to detect foreign objects. Moreover, in order to meet the manufacturing speed requirement, the x-ray acquisition time should be reduced, which can result in the very low signal- to-noise ratio(SNR) lowering the foreign material detection accuracy. Therefore, in this paper, we propose a five-step approach to overcome the limitations of low resolution, which make it challenging to detect foreign substances. Firstly, global contrast of X-ray images are increased through histogram stretching methodology. Second, to strengthen the high frequency signal and local contrast, we applied local contrast enhancement technique. Third, to improve the edge clearness, Unsharp masking is applied to enhance edges, making objects more visible. Forth, the super-resolution method of the Residual Dense Block (RDB) is used for noise reduction and image enhancement. Last, the Yolov5 algorithm is employed to train and detect foreign objects after learning. Using the proposed method in this study, experimental results show an improvement of more than 10% in performance metrics such as precision compared to low-density images.

High Resolution InSAR Phase Simulation using DSM in Urban Areas (도심지역 DSM을 이용한 고해상도 InSAR 위상 시뮬레이션)

  • Yoon, Geun-Won;Kim, Sang-Wan;Lee, Yong-Woong;Lee, Dong-Cheon;Won, Joong-Sun
    • Korean Journal of Remote Sensing
    • /
    • v.27 no.2
    • /
    • pp.181-190
    • /
    • 2011
  • Since the radar satellite missions such as TerraSAR-X and COSMO-SkyMed were launched in 2007, the spatial resolution of spaceborne SAR(Synthetic Aperture Radar) images reaches about 1 meter at spotlight mode. In 2011, the first Korean SAR satellite, KOMPSAT-5, will be launched, operating at X-band with the highest spatial resolution of 1 m as well. The improved spatial resolution of state-of-the-art SAR sensor suggests expanding InSAR(Interferometric SAR) analysis in urban monitoring. By the way, the shadow and layover phenomena are more prominent in urban areas due to building structure because of inherent side-looking geometry of SAR system. Up to date the most conventional algorithms do not consider the return signals at the frontage of building during InSAR phase and SAR intensity simulation. In this study the new algorithm introducing multi-scattering in layover region is proposed for phase and intensity simulation, which is utilized a precise LIDAR DSM(Digital Surface Model) in urban areas. The InSAR phases simulated by the proposed method are compared with TerraSAR-X spotlight data. As a result, both InSAR phases are well matched, even in layover areas. This study will be applied to urban monitoring using high resolution SAR data, in terms of change detection and displacement monitoring at the scale of building unit.

Method for determining flood risk in construction sites using artificial neural network techniques (인공 신경망 기법을 활용한 건설 현장 침수 위험 판정 방법)

  • Im Jang Hyuk;Cho Hye Rin
    • Proceedings of the Korea Water Resources Association Conference
    • /
    • 2023.05a
    • /
    • pp.344-344
    • /
    • 2023
  • 최근 기후변화에 따라 극한 강우로 전 세계적으로 국지적 홍수 피해가 증가하고 있다. 또한 극한 강우 발생시 다양한 건설 현장의 상황에 따라 침수 취약성이 나타나 인적 물적 피해로 이어질 수 있다. 특히, 시공에 따른 현장 지형 변화에 대해 실시간으로 침수 예측이 불가하여 위험 판단이 어려운 실정이며, 극한 강우 발생에 대비하기 위해 강우 정보 획득 및 분석을 효율화하여 강우예측 정확성을 높일 필요가 있다. 이러한 필요성에 따라 본 연구에서는 건설 현장의 침수 피해를 최소화하기 위해 침수 위험을 판정하고 예측하는 방법을 제시하고자 한다. 본 연구의 침수 위험 판정 방법은 건설 현장에서 실시간 지형변화 정보 확보와 침수 위험 판정의 정확도를 높이기 위한 침수심 분석에 인공 신경망 기법을 활용하였다. 또한, 침수판정 알고리즘은 지형, 강우 분석 모듈과 침수판정 모듈로 구성하였다. 지형 분석 모듈은 건설 현장이 시공진행에 따른 지형 데이터의 변화를 고려하기 위해 실시간 영상 정보의 객체 탐지를 구분하는 인공 신경망 기법을 적용해 지형 분석 모듈을 구축하였다. 강우 분석 모듈은 다양한 강우 정보를 취합할 수 있는 서버를 구축하여 강우 임베딩 정보를 실시간으로 분석하도록 고안하여 정확도를 높였다. 이러한 자료를 바탕으로 강우-유출해석에 의한 침수심 값과 실측값, 침수 지표를 활용하여 인공 신경망 기법으로 침수 위험을 판정하도록 제시하였다. 본 연구를 통해 건설 현장에서 지형 상태의 지속적인 변화와 강우데이터의 정확도 향상에 대응할 수 있는 침수 위험 판정이 가능하고 인적 물적 피해 최소화를 기대할 수 있다. 향후, 본 연구에서 제시된 방법은 건설 현장에서 분석 시스템과 실측 모니터링에 의해 검증되어야 할 것이며, 건설 현장 외에도 스마트 도시 및 지하 공간에서 확대하여 적용할 수 있을 것으로 판단된다.

  • PDF

A Comparison of Image Classification System for Building Waste Data based on Deep Learning (딥러닝기반 건축폐기물 이미지 분류 시스템 비교)

  • Jae-Kyung Sung;Mincheol Yang;Kyungnam Moon;Yong-Guk Kim
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.23 no.3
    • /
    • pp.199-206
    • /
    • 2023
  • This study utilizes deep learning algorithms to automatically classify construction waste into three categories: wood waste, plastic waste, and concrete waste. Two models, VGG-16 and ViT (Vision Transformer), which are convolutional neural network image classification algorithms and NLP-based models that sequence images, respectively, were compared for their performance in classifying construction waste. Image data for construction waste was collected by crawling images from search engines worldwide, and 3,000 images, with 1,000 images for each category, were obtained by excluding images that were difficult to distinguish with the naked eye or that were duplicated and would interfere with the experiment. In addition, to improve the accuracy of the models, data augmentation was performed during training with a total of 30,000 images. Despite the unstructured nature of the collected image data, the experimental results showed that VGG-16 achieved an accuracy of 91.5%, and ViT achieved an accuracy of 92.7%. This seems to suggest the possibility of practical application in actual construction waste data management work. If object detection techniques or semantic segmentation techniques are utilized based on this study, more precise classification will be possible even within a single image, resulting in more accurate waste classification

An Approach Using LSTM Model to Forecasting Customer Congestion Based on Indoor Human Tracking (실내 사람 위치 추적 기반 LSTM 모델을 이용한 고객 혼잡 예측 연구)

  • Hee-ju Chae;Kyeong-heon Kwak;Da-yeon Lee;Eunkyung Kim
    • Journal of the Korea Society for Simulation
    • /
    • v.32 no.3
    • /
    • pp.43-53
    • /
    • 2023
  • In this detailed and comprehensive study, our primary focus has been placed on accurately gauging the number of visitors and their real-time locations in commercial spaces. Particularly, in a real cafe, using security cameras, we have developed a system that can offer live updates on available seating and predict future congestion levels. By employing YOLO, a real-time object detection and tracking algorithm, the number of visitors and their respective locations in real-time are also monitored. This information is then used to update a cafe's indoor map, thereby enabling users to easily identify available seating. Moreover, we developed a model that predicts the congestion of a cafe in real time. The sophisticated model, designed to learn visitor count and movement patterns over diverse time intervals, is based on Long Short Term Memory (LSTM) to address the vanishing gradient problem and Sequence-to-Sequence (Seq2Seq) for processing data with temporal relationships. This innovative system has the potential to significantly improve cafe management efficiency and customer satisfaction by delivering reliable predictions of cafe congestion to all users. Our groundbreaking research not only demonstrates the effectiveness and utility of indoor location tracking technology implemented through security cameras but also proposes potential applications in other commercial spaces.

Detecting Vehicles That Are Illegally Driving on Road Shoulders Using Faster R-CNN (Faster R-CNN을 이용한 갓길 차로 위반 차량 검출)

  • Go, MyungJin;Park, Minju;Yeo, Jiho
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.21 no.1
    • /
    • pp.105-122
    • /
    • 2022
  • According to the statistics about the fatal crashes that have occurred on the expressways for the last 5 years, those who died on the shoulders of the road has been as 3 times high as the others who died on the expressways. It suggests that the crashes on the shoulders of the road should be fatal, and that it would be important to prevent the traffic crashes by cracking down on the vehicles intruding the shoulders of the road. Therefore, this study proposed a method to detect a vehicle that violates the shoulder lane by using the Faster R-CNN. The vehicle was detected based on the Faster R-CNN, and an additional reading module was configured to determine whether there was a shoulder violation. For experiments and evaluations, GTAV, a simulation game that can reproduce situations similar to the real world, was used. 1,800 images of training data and 800 evaluation data were processed and generated, and the performance according to the change of the threshold value was measured in ZFNet and VGG16. As a result, the detection rate of ZFNet was 99.2% based on Threshold 0.8 and VGG16 93.9% based on Threshold 0.7, and the average detection speed for each model was 0.0468 seconds for ZFNet and 0.16 seconds for VGG16, so the detection rate of ZFNet was about 7% higher. The speed was also confirmed to be about 3.4 times faster. These results show that even in a relatively uncomplicated network, it is possible to detect a vehicle that violates the shoulder lane at a high speed without pre-processing the input image. It suggests that this algorithm can be used to detect violations of designated lanes if sufficient training datasets based on actual video data are obtained.