• Title/Summary/Keyword: 이미지 라벨링

Search Result 67, Processing Time 0.023 seconds

Defect Classification of Cross-section of Additive Manufacturing Using Image-Labeling (이미지 라벨링을 이용한 적층제조 단면의 결함 분류)

  • Lee, Jeong-Seong;Choi, Byung-Joo;Lee, Moon-Gu;Kim, Jung-Sub;Lee, Sang-Won;Jeon, Yong-Ho
    • Journal of the Korean Society of Manufacturing Process Engineers
    • /
    • v.19 no.7
    • /
    • pp.7-15
    • /
    • 2020
  • Recently, the fourth industrial revolution has been presented as a new paradigm and additive manufacturing (AM) has become one of the most important topics. For this reason, process monitoring for each cross-sectional layer of additive metal manufacturing is important. Particularly, deep learning can train a machine to analyze, optimize, and repair defects. In this paper, image classification is proposed by learning images of defects in the metal cross sections using the convolution neural network (CNN) image labeling algorithm. Defects were classified into three categories: crack, porosity, and hole. To overcome a lack-of-data problem, the amount of learning data was augmented using a data augmentation algorithm. This augmentation algorithm can transform an image to 180 images, increasing the learning accuracy. The number of training and validation images was 25,920 (80 %) and 6,480 (20 %), respectively. An optimized case with a combination of fully connected layers, an optimizer, and a loss function, showed that the model accuracy was 99.7 % and had a success rate of 97.8 % for 180 test images. In conclusion, image labeling was successfully performed and it is expected to be applied to automated AM process inspection and repair systems in the future.

Convolutional Neural Network-based Iris Lesion Classification Algorithm (CNN기반 알츠하이머 치매 중증도 판별 알고리즘 오차 검증)

  • Kim, June-Gyeom;Seo, Jin-Beom;Cho, Young-Bok
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2021.10a
    • /
    • pp.100-101
    • /
    • 2021
  • In Korea, which has entered an aging society, 87% of the elderly population suffers from chronic diseases such as dementia and stroke, of which Alzheimer's dementia accounts for 71.3% of all dementia. In this paper, labeling verification was performed to review the error problem of deep learning results divided by Alzheimer's dementia MRI image into three stages.

  • PDF

Dataset Construction of Taekwondo Beginner AI (태권도 초심자를 위한 AI의 DataSet 구축)

  • Cho, Kyu Cheol;Kim, Ju Yeon
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.249-252
    • /
    • 2022
  • 세계 태권도 연맹은 국제 축구 연맹의 가입국과 동일한 수의 가입국을 보유할 만큼 태권도는 점점 더 세계적으로 나아가고 있다. 하지만 태권도의 교육방법은 예전과 다르지 않다. 도장의 관장이나 사범이 직접 자세를 눈으로 보고 판단하여 지도해야 한다. 본 연구는 기술이 발전하고 변화함에 따라 태권도를 조금 더 다양하고 흥미롭게 배울 수 있는 방법을 개발하고자 진행하였다. 본 논문에서는 피사체 모델을 촬영하여 이미지를 추출하고 이미지에서 사람의 관절 KeyPoint를 라벨링 한 후 이를 바탕으로 COCO 형식의 DataSet을 만들어낸다. 이후 이 DataSet을 기계에 학습을 시킨다면 초심자를 위한 교육용 태권도 AI가 만들어질 수 있다. 또한, 기계학습 이후 이 AI를 실제 교육현장에 적용하여 교육과정에 직접 사용할 수 있으며 이 AI를 바탕으로 교육용 게임 개발 등 다양한 방면으로 활용할 수 있을 것이라고 기대한다.

  • PDF

Allergy checking system using artificial intelligence (인공지능 기법을 이용한 알레르기 반응 여부 판단)

  • Kim, So-Young;Lee, Yang-Gyu;Jo, Eun-Young;Weon, Ill-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.553-555
    • /
    • 2022
  • 다양한 양념과 조리법을 활용한 식품의 수는 시간이 지날수록 증가하는 추세이다. 따라서 처음 접하는 식품의 알레르기를 판단하는 연구가 필요하다. 우리는 이미지만으로 알레르기 유발 성분을 판단하는 시스템을 제안한다. 알레르기 성분으로 라벨링한 식품 이미지에 VGGNet 알고리즘을 적용하여 실험을 진행하고 제안된 시스템의 유용성을 판단하였다.

Improved Anatomical Landmark Detection Using Attention Modules and Geometric Data Augmentation in X-ray Images (어텐션 모듈과 기하학적 데이터 증강을 통한 X-ray 영상 내 해부학적 랜드마크 검출 성능 향상)

  • Lee, Hyo-Jeong;Ma, Se-Rie;Choi, Jang-Hwan
    • Journal of the Korea Computer Graphics Society
    • /
    • v.28 no.3
    • /
    • pp.55-65
    • /
    • 2022
  • Recently, deep learning-based automated systems for identifying and detecting landmarks have been proposed. In order to train such a deep learning-based model without overfitting, a large amount of image and labeling data is required. Conventionally, an experienced reader manually identifies and labels landmarks in a patient's image. However, such measurement is not only expensive, but also has poor reproducibility, so the need for an automated labeling method has been raised. In addition, in the X-ray image, since various human tissues on the path through which the photons pass are displayed, it is difficult to identify the landmark compared to a general natural image or a 3D image modality image. In this study, we propose a geometric data augmentation technique that enables the generation of a large amount of labeling data in X-ray images. In addition, the optimal attention mechanism for landmark detection was presented through the implementation and application of various attention techniques to improve the detection performance of 16 major landmarks in the skull. Finally, among the major cranial landmarks, markers that ensure stable detection are derived, and these markers are expected to have high clinical application potential.

GAN System Using Noise for Image Generation (이미지 생성을 위해 노이즈를 이용한 GAN 시스템)

  • Bae, Sangjung;Kim, Mingyu;Jung, Hoekyung
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.24 no.6
    • /
    • pp.700-705
    • /
    • 2020
  • Generative adversarial networks are methods of generating images by opposing two neural networks. When generating the image, randomly generated noise is rearranged to generate the image. The image generated by this method is not generated well depending on the noise, and it is difficult to generate a proper image when the number of pixels of the image is small In addition, the speed and size of data accumulation in data classification increases, and there are many difficulties in labeling them. In this paper, to solve this problem, we propose a technique to generate noise based on random noise using real data. Since the proposed system generates an image based on the existing image, it is confirmed that it is possible to generate a more natural image, and if it is used for learning, it shows a higher hit rate than the existing method using the hostile neural network respectively.

Comparison of number plate recognition performance of Synthetic number plate generator using 2D and 3D rotation (3차원 회전을 이용한 인조 번호판 생성기의 번호판 인식 성능 비교)

  • Lee, Yu-Jin;Kim, Sang-Joon;Park, Gyeong-Moo;Park, Goo-Man
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2020.07a
    • /
    • pp.232-235
    • /
    • 2020
  • 최근 딥러닝을 이용한 자동차 번호판 인식 알고리즘에 있어서 인조 번호판을 생성하여 데이터 수집과 라벨링 작업 시간을 줄이기 위한 연구가 진행되고 있다. 하지만 인조 번호판의 특성상 정면의 이미지로 구성되어 있기 때문에 자동차의 정면에서 촬영된 번호판의 인식률은 높지만 측면에서 촬영된 번호판의 경우 인식률이 낮아진다. 본 논문에서는 다양한 카메라 설치 위치에 따른 다각도로 촬영된 번호판 영상의 인식률을 보완하기 위해 이미지를 3차원으로 회전하여 데이터를 생성하는 인조 번호판 생성기 프로그램을 개발하였다. 3차원 회전을 하였을 때 번호판 인식 성능을 비교하기 위해 기존 방식으로 생성한 번호판과 제안 방식으로 생성한 번호판 각 600,000장씩 생성하여 총 1,200,000장을 생성하였으며, 데이터의 비율에 따라 10가지의 학습 데이터 셋을 구성하였다. 인조 번호판 데이터의 학습 결과를 평가하기 위해 실제 번호판 이미지 1789장으로 테스트 셋을 구성하였고, 기존의 인조 번호판 생성 방식과 인식 정확도를 비교 분석하였다.

  • PDF

Panorama Image Stitching Using Sythetic Fisheye Image (Synthetic fisheye 이미지를 이용한 360° 파노라마 이미지 스티칭)

  • Kweon, Hyeok-Joon;Cho, Donghyeon
    • Journal of Broadcast Engineering
    • /
    • v.27 no.1
    • /
    • pp.20-30
    • /
    • 2022
  • Recently, as VR (Virtual Reality) technology has been in the spotlight, 360° panoramic images that can view lively VR contents are attracting a lot of attention. Image stitching technology is a major technology for producing 360° panorama images, and many studies are being actively conducted. Typical stitching algorithms are based on feature point-based image stitching. However, conventional feature point-based image stitching methods have a problem that stitching results are intensely affected by feature points. To solve this problem, deep learning-based image stitching technologies have recently been studied, but there are still many problems when there are few overlapping areas between images or large parallax. In addition, there is a limit to complete supervised learning because labeled ground-truth panorama images cannot be obtained in a real environment. Therefore, we produced three fisheye images with different camera centers and corresponding ground truth image through carla simulator that is widely used in the autonomous driving field. We propose image stitching model that creates a 360° panorama image with the produced fisheye image. The final experimental results are virtual datasets configured similar to the actual environment, verifying stitching results that are strong against various environments and large parallax.

Text extraction in images using simplify color and edges pattern analysis (색상 단순화와 윤곽선 패턴 분석을 통한 이미지에서의 글자추출)

  • Yang, Jae-Ho;Park, Young-Soo;Lee, Sang-Hun
    • Journal of the Korea Convergence Society
    • /
    • v.8 no.8
    • /
    • pp.33-40
    • /
    • 2017
  • In this paper, we propose a text extraction method by pattern analysis on contour for effective text detection in image. Text extraction algorithms using edge based methods show good performance in images with simple backgrounds, The images of complex background has a poor performance shortcomings. The proposed method simplifies the color of the image by using K-means clustering in the preprocessing process to detect the character region in the image. Enhance the boundaries of the object through the High pass filter to improve the inaccuracy of the boundary of the object in the color simplification process. Then, by using the difference between the expansion and erosion of the morphology technique, the edges of the object is detected, and the character candidate region is discriminated by analyzing the pattern of the contour portion of the acquired region to remove the unnecessary region (picture, background). As a final result, we have shown that the characters included in the candidate character region are extracted by removing unnecessary regions.

A Visitor Study of The Exhibition of Using Big Data Analysis which reflects viewing experiences

  • Kang, Ji-Su;Rhee, Bo-A
    • Journal of the Korea Society of Computer and Information
    • /
    • v.27 no.2
    • /
    • pp.81-89
    • /
    • 2022
  • This study aims to analyze the images of Instagram posts and to draw implcations regarding the exhibition of . This study collects and crawl 24,295 images from Instagram posts as a dataset. We use the Google Cloud Vision API for labeling the images and a total of 212,567 clusters of labels are finally classified into 9 categories using Word2Vec. The categories of museum spaces, photo zone, architecture category are dominant along with people category. In conclusion, visitors curate their experiences and memories of physical places and spaces while they are experiencing with the exhibition. This result reproves the results of previous studies which emphasize a sense of social presence and place making. The convergent approach of art management and art technology used in this study help museum professionals have an insight on big data based visitor research on a practical level.