• Title/Summary/Keyword: a neural-net

Search Result 672, Processing Time 0.03 seconds

Classification of Clothing Using Googlenet Deep Learning and IoT based on Artificial Intelligence (인공지능 기반 구글넷 딥러닝과 IoT를 이용한 의류 분류)

  • Noh, Sun-Kuk
    • Smart Media Journal
    • /
    • v.9 no.3
    • /
    • pp.41-45
    • /
    • 2020
  • Recently, artificial intelligence (AI) and the Internet of things (IoT), which are represented by machine learning and deep learning among IT technologies related to the Fourth Industrial Revolution, are applied to our real life in various fields through various researches. In this paper, IoT and AI using object recognition technology are applied to classify clothing. For this purpose, the image dataset was taken using webcam and raspberry pi, and GoogLeNet, a convolutional neural network artificial intelligence network, was applied to transfer the photographed image data. The clothing image dataset was classified into two categories (shirtwaist, trousers): 900 clean images, 900 loss images, and total 1800 images. The classification measurement results showed that the accuracy of the clean clothing image was about 97.78%. In conclusion, the study confirmed the applicability of other objects using artificial intelligence networks on the Internet of Things based platform through the measurement results and the supplementation of more image data in the future.

DEVELOPMENT OF GREEN'S FUNCTION APPROACH CONSIDERING TEMPERATURE-DEPENDENT MATERIAL PROPERTIES AND ITS APPLICATION

  • Ko, Han-Ok;Jhung, Myung Jo;Choi, Jae-Boong
    • Nuclear Engineering and Technology
    • /
    • v.46 no.1
    • /
    • pp.101-108
    • /
    • 2014
  • About 40% of reactors in the world are being operated beyond design life or are approaching the end of their life cycle. During long-term operation, various degradation mechanisms occur. Fatigue caused by alternating operational stresses in terms of temperature or pressure change is an important damage mechanism in continued operation of nuclear power plants. To monitor the fatigue damage of components, Fatigue Monitoring System (FMS) has been installed. Most FMSs have used Green's Function Approach (GFA) to calculate the thermal stresses rapidly. However, if temperature-dependent material properties are used in a detailed FEM, there is a maximum peak stress discrepancy between a conventional GFA and a detailed FEM because constant material properties are used in a conventional method. Therefore, if a conventional method is used in the fatigue evaluation, thermal stresses for various operating cycles may be calculated incorrectly and it may lead to an unreliable estimation. So, in this paper, the modified GFA which can consider temperature-dependent material properties is proposed by using an artificial neural network and weight factor. To verify the proposed method, thermal stresses by the new method are compared with those by FEM. Finally, pros and cons of the new method as well as technical findings from the assessment are discussed.

Abnormal state diagnosis model tolerant to noise in plant data

  • Shin, Ji Hyeon;Kim, Jae Min;Lee, Seung Jun
    • Nuclear Engineering and Technology
    • /
    • v.53 no.4
    • /
    • pp.1181-1188
    • /
    • 2021
  • When abnormal events occur in a nuclear power plant, operators must conduct appropriate abnormal operating procedures. It is burdensome though for operators to choose the appropriate procedure considering the numerous main plant parameters and hundreds of alarms that should be judged in a short time. Recently, various research has applied deep-learning algorithms to support this problem by classifying each abnormal condition with high accuracy. Most of these models are trained with simulator data because of a lack of plant data for abnormal states, and as such, developed models may not have tolerance for plant data in actual situations. In this study, two approaches are investigated for a deep-learning model trained with simulator data to overcome the performance degradation caused by noise in actual plant data. First, a preprocessing method using several filters was employed to smooth the test data noise, and second, a data augmentation method was applied to increase the acceptability of the untrained data. Results of this study confirm that the combination of these two approaches can enable high model performance even in the presence of noisy data as in real plants.

Point-level deep learning approach for 3D acoustic source localization

  • Lee, Soo Young;Chang, Jiho;Lee, Seungchul
    • Smart Structures and Systems
    • /
    • v.29 no.6
    • /
    • pp.777-783
    • /
    • 2022
  • Even though several deep learning-based methods have been applied in the field of acoustic source localization, the previous works have only been conducted using the two-dimensional representation of the beamforming maps, particularly with the planar array system. While the acoustic sources are more required to be localized in a spherical microphone array system considering that we live and hear in the 3D world, the conventional 2D equirectangular map of the spherical beamforming map is highly vulnerable to the distortion that occurs when the 3D map is projected to the 2D space. In this study, a 3D deep learning approach is proposed to fulfill accurate source localization via distortion-free 3D representation. A target function is first proposed to obtain 3D source distribution maps that can represent multiple sources' positional and strength information. While the proposed target map expands the source localization task into a point-wise prediction task, a PointNet-based deep neural network is developed to precisely estimate the multiple sources' positions and strength information. While the proposed model's localization performance is evaluated, it is shown that the proposed method can achieve improved localization results from both quantitative and qualitative perspectives.

Deep learning framework for bovine iris segmentation

  • Heemoon Yoon;Mira Park;Hayoung Lee;Jisoon An;Taehyun Lee;Sang-Hee Lee
    • Journal of Animal Science and Technology
    • /
    • v.66 no.1
    • /
    • pp.167-177
    • /
    • 2024
  • Iris segmentation is an initial step for identifying the biometrics of animals when establishing a traceability system for livestock. In this study, we propose a deep learning framework for pixel-wise segmentation of bovine iris with a minimized use of annotation labels utilizing the BovineAAEyes80 public dataset. The proposed image segmentation framework encompasses data collection, data preparation, data augmentation selection, training of 15 deep neural network (DNN) models with varying encoder backbones and segmentation decoder DNNs, and evaluation of the models using multiple metrics and graphical segmentation results. This framework aims to provide comprehensive and in-depth information on each model's training and testing outcomes to optimize bovine iris segmentation performance. In the experiment, U-Net with a VGG16 backbone was identified as the optimal combination of encoder and decoder models for the dataset, achieving an accuracy and dice coefficient score of 99.50% and 98.35%, respectively. Notably, the selected model accurately segmented even corrupted images without proper annotation data. This study contributes to the advancement of iris segmentation and the establishment of a reliable DNN training framework.

Rotation and Size Invariant Fingerprint Recognition Using The Neural Net (회전과 크기변화에 무관한 신경망을 이용한 지문 인식)

  • Lee, Nam-Il;U, Yong-Tae;Lee, Jeong-Hwan
    • The Transactions of the Korea Information Processing Society
    • /
    • v.1 no.2
    • /
    • pp.215-224
    • /
    • 1994
  • In this paper, the rotation and size invariant fingerprint recognition using the neural network EART (Extended Adaptive Resonance Theory) is studied ($515{\times}512$) gray level fingerprint images are converted into the binary thinned images based on the adaptive threshold and a thinning algorithm. From these binary thinned images, we extract the ending points and the bifurcation points, which are the most useful critical feature points in the fingerprint images, using the $3{\times}3$ MASK. And we convert the number of these critical points and the interior angles of convex polygon composed of the bifurcation points into the 40*10 critical using the weighted code which is invariant of rotation and size as the input of EART. This system produces very good and efficient results for the rotation and size variations without the restoration of the binary thinned fingerprints.

  • PDF

Image-based Soft Drink Type Classification and Dietary Assessment System Using Deep Convolutional Neural Network with Transfer Learning

  • Rubaiya Hafiz;Mohammad Reduanul Haque;Aniruddha Rakshit;Amina khatun;Mohammad Shorif Uddin
    • International Journal of Computer Science & Network Security
    • /
    • v.24 no.2
    • /
    • pp.158-168
    • /
    • 2024
  • There is hardly any person in modern times who has not taken soft drinks instead of drinking water. The rate of people taking soft drinks being surprisingly high, researchers around the world have cautioned from time to time that these drinks lead to weight gain, raise the risk of non-communicable diseases and so on. Therefore, in this work an image-based tool is developed to monitor the nutritional information of soft drinks by using deep convolutional neural network with transfer learning. At first, visual saliency, mean shift segmentation, thresholding and noise reduction technique, collectively known as 'pre-processing' are adopted to extract the location of drinks region. After removing backgrounds and segment out only the desired area from image, we impose Discrete Wavelength Transform (DWT) based resolution enhancement technique is applied to improve the quality of image. After that, transfer learning model is employed for the classification of drinks. Finally, nutrition value of each drink is estimated using Bag-of-Feature (BoF) based classification and Euclidean distance-based ratio calculation technique. To achieve this, a dataset is built with ten most consumed soft drinks in Bangladesh. These images were collected from imageNet dataset as well as internet and proposed method confirms that it has the ability to detect and recognize different types of drinks with an accuracy of 98.51%.

A Study on the Feasibility of Self-Organizing Net for the Pattern Recognition (패턴인식을 위한 자율조직망의 적용가능성에 관한 연구)

  • 정은호;김진구
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.16 no.5
    • /
    • pp.403-412
    • /
    • 1991
  • This paper proposes a type of self organizing neural network which recognizes arbitrary symbols as well as numerical or alphabetic characters. The proposed algorithm autonomically organizes and classifies similar patterns on the basis of the distribution types of characteristics in the input images. Thus it can be appliced for the recognition of arbitrary images when it is difficult to establish a learning rule. It performs a stale recognition process with in the limit of the memory capacity. The cheme was applied and tested to 50 different image patterns with increased noise level up to 44%(SNR 2dB). The implementation results demonstrate that the proposed algorithm successfully recognizes the image patterns changed due to the various noise levels and thus proves excellent antinoise characteristics.

  • PDF

One-step deep learning-based method for pixel-level detection of fine cracks in steel girder images

  • Li, Zhihang;Huang, Mengqi;Ji, Pengxuan;Zhu, Huamei;Zhang, Qianbing
    • Smart Structures and Systems
    • /
    • v.29 no.1
    • /
    • pp.153-166
    • /
    • 2022
  • Identifying fine cracks in steel bridge facilities is a challenging task of structural health monitoring (SHM). This study proposed an end-to-end crack image segmentation framework based on a one-step Convolutional Neural Network (CNN) for pixel-level object recognition with high accuracy. To particularly address the challenges arising from small object detection in complex background, efforts were made in loss function selection aiming at sample imbalance and module modification in order to improve the generalization ability on complicated images. Specifically, loss functions were compared among alternatives including the Binary Cross Entropy (BCE), Focal, Tversky and Dice loss, with the last three specialized for biased sample distribution. Structural modifications with dilated convolution, Spatial Pyramid Pooling (SPP) and Feature Pyramid Network (FPN) were also performed to form a new backbone termed CrackDet. Models of various loss functions and feature extraction modules were trained on crack images and tested on full-scale images collected on steel box girders. The CNN model incorporated the classic U-Net as its backbone, and Dice loss as its loss function achieved the highest mean Intersection-over-Union (mIoU) of 0.7571 on full-scale pictures. In contrast, the best performance on cropped crack images was achieved by integrating CrackDet with Dice loss at a mIoU of 0.7670.

Enhanced 3D Residual Network for Human Fall Detection in Video Surveillance

  • Li, Suyuan;Song, Xin;Cao, Jing;Xu, Siyang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.12
    • /
    • pp.3991-4007
    • /
    • 2022
  • In the public healthcare, a computational system that can automatically and efficiently detect and classify falls from a video sequence has significant potential. With the advancement of deep learning, which can extract temporal and spatial information, has become more widespread. However, traditional 3D CNNs that usually adopt shallow networks cannot obtain higher recognition accuracy than deeper networks. Additionally, some experiences of neural network show that the problem of gradient explosions occurs with increasing the network layers. As a result, an enhanced three-dimensional ResNet-based method for fall detection (3D-ERes-FD) is proposed to directly extract spatio-temporal features to address these issues. In our method, a 50-layer 3D residual network is used to deepen the network for improving fall recognition accuracy. Furthermore, enhanced residual units with four convolutional layers are developed to efficiently reduce the number of parameters and increase the depth of the network. According to the experimental results, the proposed method outperformed several state-of-the-art methods.