• Title/Summary/Keyword: Image Learning

Search Result 3,147, Processing Time 0.035 seconds

Defect Classification of Cross-section of Additive Manufacturing Using Image-Labeling (이미지 라벨링을 이용한 적층제조 단면의 결함 분류)

  • Lee, Jeong-Seong;Choi, Byung-Joo;Lee, Moon-Gu;Kim, Jung-Sub;Lee, Sang-Won;Jeon, Yong-Ho
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.19 no.7
    • /
    • pp.7-15
    • /
    • 2020
  • Recently, the fourth industrial revolution has been presented as a new paradigm and additive manufacturing (AM) has become one of the most important topics. For this reason, process monitoring for each cross-sectional layer of additive metal manufacturing is important. Particularly, deep learning can train a machine to analyze, optimize, and repair defects. In this paper, image classification is proposed by learning images of defects in the metal cross sections using the convolution neural network (CNN) image labeling algorithm. Defects were classified into three categories: crack, porosity, and hole. To overcome a lack-of-data problem, the amount of learning data was augmented using a data augmentation algorithm. This augmentation algorithm can transform an image to 180 images, increasing the learning accuracy. The number of training and validation images was 25,920 (80 %) and 6,480 (20 %), respectively. An optimized case with a combination of fully connected layers, an optimizer, and a loss function, showed that the model accuracy was 99.7 % and had a success rate of 97.8 % for 180 test images. In conclusion, image labeling was successfully performed and it is expected to be applied to automated AM process inspection and repair systems in the future.

Semantic Indoor Image Segmentation using Spatial Class Simplification (공간 클래스 단순화를 이용한 의미론적 실내 영상 분할)

  • Kim, Jung-hwan;Choi, Hyung-il
    • Journal of Internet Computing and Services
    • /
    • v.20 no.3
    • /
    • pp.33-41
    • /
    • 2019
  • In this paper, we propose a method to learn the redesigned class with background and object for semantic segmentation of indoor scene image. Semantic image segmentation is a technique that divides meaningful parts of an image, such as walls and beds, into pixels. Previous work of semantic image segmentation has proposed methods of learning various object classes of images through neural networks, and it has been pointed out that there is insufficient accuracy compared to long learning time. However, in the problem of separating objects and backgrounds, there is no need to learn various object classes. So we concentrate on separating objects and backgrounds, and propose method to learn after class simplification. The accuracy of the proposed learning method is about 5 ~ 12% higher than the existing methods. In addition, the learning time is reduced by about 14 ~ 60 minutes when the class is configured differently In the same environment, and it shows that it is possible to efficiently learn about the problem of separating the object and the background.

Bolt-Loosening Detection using Vision-Based Deep Learning Algorithm and Image Processing Method (영상기반 딥러닝 및 이미지 프로세싱 기법을 이용한 볼트풀림 손상 검출)

  • Lee, So-Young;Huynh, Thanh-Canh;Park, Jae-Hyung;Kim, Jeong-Tae
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.32 no.4
    • /
    • pp.265-272
    • /
    • 2019
  • In this paper, a vision-based deep learning algorithm and image processing method are proposed to detect bolt-loosening in steel connections. To achieve this objective, the following approaches are implemented. First, a bolt-loosening detection method that includes regional convolutional neural network(RCNN)-based deep learning algorithm and Hough line transform(HLT)-based image processing algorithm are designed. The RCNN-based deep learning algorithm is developed to identify and crop bolts in a connection image. The HLT-based image processing algorithm is designed to estimate the bolt angles from the cropped bolt images. Then, the proposed vision-based method is evaluated for verifying bolt-loosening detection in a lab-scale girder connection. The accuracy of the RCNN-based bolt detector and HLT-based bolt angle estimator are examined with respect to various perspective distortions.

Evaluation of Machine Learning Methods to Reduce Stripe Artifacts in the Phase Contrast Image due to Line-Integration Process (선적분에 의한 위상차 영상의 줄무늬 아티팩트 감소를 위한 기계학습법에 대한 평가)

  • Kim, Myungkeun;Oh, Ohsung;Lee, Seho;Lee, Seung Wook
    • Journal of the Korean Society of Radiology
    • /
    • v.14 no.7
    • /
    • pp.937-946
    • /
    • 2020
  • The grating interferometer provides the differential phase contrast image of an phase object due to refraction of the wavefront by the object, and it needs to be converted to the phase contrast image. The line-integration process to obtain the phase contrast image from a differential phase contrast image accumulates noise and generate stripe artifacts. The stripe artifacts have noise and distortion increases to the integration direction in the line-integrated phase contrast image. In this study, we have configured and compared several machine learning methods to reduce the artifacts. The machine learning methods have been applied to simulated numerical phantoms as well as experimental data from the X-ray and neutron grating interferometer for comparison. As a result, the combination of the wavelet preprocessing and machine learning method (WCNN) has shown to be the most effective.

Text Augmentation Using Hierarchy-based Word Replacement

  • Kim, Museong;Kim, Namgyu
    • Journal of the Korea Society of Computer and Information
    • /
    • v.26 no.1
    • /
    • pp.57-67
    • /
    • 2021
  • Recently, multi-modal deep learning techniques that combine heterogeneous data for deep learning analysis have been utilized a lot. In particular, studies on the synthesis of Text to Image that automatically generate images from text are being actively conducted. Deep learning for image synthesis requires a vast amount of data consisting of pairs of images and text describing the image. Therefore, various data augmentation techniques have been devised to generate a large amount of data from small data. A number of text augmentation techniques based on synonym replacement have been proposed so far. However, these techniques have a common limitation in that there is a possibility of generating a incorrect text from the content of an image when replacing the synonym for a noun word. In this study, we propose a text augmentation method to replace words using word hierarchy information for noun words. Additionally, we performed experiments using MSCOCO data in order to evaluate the performance of the proposed methodology.

Research Trend of the Remote Sensing Image Analysis Using Deep Learning (딥러닝을 이용한 원격탐사 영상분석 연구동향)

  • Kim, Hyungwoo;Kim, Minho;Lee, Yangwon
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.5_3
    • /
    • pp.819-834
    • /
    • 2022
  • Artificial Intelligence (AI) techniques have been effectively used for image classification, object detection, and image segmentation. Along with the recent advancement of computing power, deep learning models can build deeper and thicker networks and achieve better performance by creating more appropriate feature maps based on effective activation functions and optimizer algorithms. This review paper examined technical and academic trends of Convolutional Neural Network (CNN) and Transformer models that are emerging techniques in remote sensing and suggested their utilization strategies and development directions. A timely supply of satellite images and real-time processing for deep learning to cope with disaster monitoring will be required for future work. In addition, a big data platform dedicated to satellite images should be developed and integrated with drone and Closed-circuit Television (CCTV) images.

Character Recognition Algorithm in Low-Quality Legacy Contents Based on Alternative End-to-End Learning (대안적 통째학습 기반 저품질 레거시 콘텐츠에서의 문자 인식 알고리즘)

  • Lee, Sung-Jin;Yun, Jun-Seok;Park, Seon-hoo;Yoo, Seok Bong
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.25 no.11
    • /
    • pp.1486-1494
    • /
    • 2021
  • Character recognition is a technology required in various platforms, such as smart parking and text to speech, and many studies are being conducted to improve its performance through new attempts. However, with low-quality image used for character recognition, a difference in resolution of the training image and test image for character recognition occurs, resulting in poor accuracy. To solve this problem, this paper designed an end-to-end learning neural network that combines image super-resolution and character recognition so that the character recognition model performance is robust against various quality data, and implemented an alternative whole learning algorithm to learn the whole neural network. An alternative end-to-end learning and recognition performance test was conducted using the license plate image among various text images, and the effectiveness of the proposed algorithm was verified with the performance test.

A Study on Deep Learning Structure of Multi-Block Method for Improving Face Recognition (얼굴 인식률 향상을 위한 멀티 블록 방식의 딥러닝 구조에 관한 연구)

  • Ra, Seung-Tak;Kim, Hong-Jik;Lee, Seung-Ho
    • Journal of IKEEE
    • /
    • v.22 no.4
    • /
    • pp.933-940
    • /
    • 2018
  • In this paper, we propose a multi-block deep learning structure for improving face recognition rate. The recognition structure of the proposed deep learning consists of three steps: multi-blocking of the input image, multi-block selection by facial feature numerical analysis, and perform deep learning of the selected multi-block. First, the input image is divided into 4 blocks by multi-block. Secondly, in the multi-block selection by feature analysis, the feature values of the quadruple multi-blocks are checked, and only the blocks with many features are selected. The third step is to perform deep learning with the selected multi-block, and the result is obtained as an efficient block with high feature value by performing recognition on the deep learning model in which the selected multi-block part is learned. To evaluate the performance of the proposed deep learning structure, we used CAS-PEAL face database. Experimental results show that the proposed multi-block deep learning structure shows 2.3% higher face recognition rate than the existing deep learning structure.

Performance Analysis of Data Augmentation for Surface Defects Detection (표면 결함 검출을 위한 데이터 확장 및 성능분석)

  • Kim, Junbong;Seo, Kisung
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.67 no.5
    • /
    • pp.669-674
    • /
    • 2018
  • Data augmentation is an efficient way to reduce overfitting on models and to improve a performance supplementing extra data for training. It is more important in deep learning based industrial machine vision. Because deep learning requires huge scale of learning data to learn a model, but acquisition of data can be limited in most of industrial applications. A very generic method for augmenting image data is to perform geometric transformations, such as cropping, rotating, translating and adjusting brightness of the image. The effectiveness of data augmentation in image classification has been reported, but it is rare in defect inspections. We explore and compare various basic augmenting operations for the metal surface defects. The experiments were executed for various types of defects and different CNN networks and analysed for performance improvements by the data augmentations.

A study on the image design PBL class that can be used for e-Digital contents production

  • Ahn, In-Soo
    • Journal of the Korea Society of Computer and Information
    • /
    • v.23 no.2
    • /
    • pp.77-82
    • /
    • 2018
  • In this paper, we propose an improvement plan to increase the learning effect and satisfaction through the PBL - related video design class. PBL To prepare for the Fourth Industrial Revolution era, we must acquire diverse knowledge and skills to discover problems and solve them creatively. Therefore, various learning methods are being studied, and one of them is PBL learning. PBL is a learner-centered education that explores problems that may arise from specific topics other than existing curriculum-based education methods and finds solutions to problems. In this study, two lectures on video design related to video contents and image contents were taught in PBL class, and PBL class problem was analyzed and the improvement plan was studied.