• Title/Summary/Keyword: 인공지능 기반 이미지 생성

Search Result 79, Processing Time 0.029 seconds

Map-Based Obstacle Avoidance Algorithm for Mobile Robot Using Deep Reinforcement Learning (심층 강화학습을 이용한 모바일 로봇의 맵 기반 장애물 회피 알고리즘)

  • Sunwoo, Yung-Min;Lee, Won-Chang
    • Journal of IKEEE
    • /
    • v.25 no.2
    • /
    • pp.337-343
    • /
    • 2021
  • Deep reinforcement learning is an artificial intelligence algorithm that enables learners to select optimal behavior based on raw and, high-dimensional input data. A lot of research using this is being conducted to create an optimal movement path of a mobile robot in an environment in which obstacles exist. In this paper, we selected the Dueling Double DQN (D3QN) algorithm that uses the prioritized experience replay to create the moving path of mobile robot from the image of the complex surrounding environment. The virtual environment is implemented using Webots, a robot simulator, and through simulation, it is confirmed that the mobile robot grasped the position of the obstacle in real time and avoided it to reach the destination.

Phychological Counseling Service using CNN (Convolutional Neural Network) (CNN을 이용한 심리 상담 서비스에 관한 연구)

  • Kim, Jungwook;Kang, Byunghun;Kim, Mingyu;Yoo, Seunghan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.834-837
    • /
    • 2020
  • CNN(Convolution Neural Network)은 합성곱(Convolution)을 이용해서 시각적 이미지를 분석하는데 사용되는 인공지능 기술이다. 본 논문에서는 CNN을 이용한 실시간 심리 상담 서비스에 대해 논한다. 상담 서비스에 심리학과 CNN을 접목시킴으로써 내담자의 사진을 심리학적 비언어 행동을 기반으로 분석하여 내담자의 예상 심리를 파악하고, 유의미한 상담 자료를 생성해 상담의 질을 향상시킬 수 있도록 한다.

A Study on Generative Artificial Intelligence-Based Data Augmentation Techniques for Enhancing Object Detection Performance (객체 탐지 성능 향상을 위한 생성형 인공지능 기반 데이터 증강 기법 연구)

  • Dohee Kim;Myongho Kim
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2023.07a
    • /
    • pp.51-54
    • /
    • 2023
  • 최근 딥러닝 기술의 발달로 물체 탐지를 위한 객체 인식 분야가 기계학습을 접목한 연구가 급격히 증가하고 있다. 하지만, 탐지하려는 물체가 다른 객체에 가려진 경우와 같이 특수한 상황에 대한 데이터의 수량이 부족하여 성능 저하를 야기한다는 점과, 객체 탐지 수행 과정에서 작은 객체의 탐지가 어렵다는 한계점이 있다. 본 연구는 전술한 문제점을 보완할 방법을 제안한다. 데이터 증강 기법을 이용하여 클래스가 부족한 데이터의 양을 늘려 학습 데이터를 증강시켰다. 한편, SRGAN을 사용하여 작은 객체를 확대시킨 뒤 이미지를 합성시켜 데이터를 구성하였다. 제안된 방법은 PyTorch 환경에서 YOLOv5를 수행한 결과, 객체 탐지 성능이 향상되는 것을 확인할 수 있었다.

  • PDF

Yoga Poses Image Classification and Interpretation Using Explainable AI (XAI) (XAI 를 활용한 설명 가능한 요가 자세 이미지 분류 모델)

  • Yu Rim Park;Hyon Hee Kim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2023.05a
    • /
    • pp.590-591
    • /
    • 2023
  • 최근 사람들의 건강에 대한 관심이 많아지고 다양한 운동 컨텐츠가 확산되면서 실내에서 운동을 할 수 있는 기회가 많아졌다. 하지만, 전문가의 도움없이 정확하지 않은 동작을 수행하다 큰 부상을 입을 위험성이 높다. 본 연구는 CNN 기반 요가 자세 분류 모델을 생성하고 설명가능 인공지능 기술을 적용하여 예측 결과에 대한 해석을 제시한다. 사용자에게 설명성과 신뢰성 있는 모델을 제공하여 자신에게 맞게 올바른 자세를 결정할 수 있고, 무리한 동작으로 부상을 입을 확률 또한 낮출 수 있을 것으로 보인다.

Generating A Synthetic Multimodal Dataset for Vision Tasks Involving Hands (손을 다루는 컴퓨터 비전 작업들을 위한 멀티 모달 합성 데이터 생성 방법)

  • Lee, Changhwa;Lee, Seongyeong;Kim, Donguk;Jeong, Chanyang;Baek, Seungryul
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.11a
    • /
    • pp.1052-1055
    • /
    • 2020
  • 본 논문에서는 3D 메시 정보, RGB-D 손 자세 및 2D/3D 손/세그먼트 마스크를 포함하여 인간의 손과 관련된 다양한 컴퓨터 비전 작업에 사용할 수 있는 새로운 다중 모달 합성 벤치마크를 제안 하였다. 생성된 데이터셋은 기존의 대규모 데이터셋인 BigHand2.2M 데이터셋과 변형 가능한 3D 손 메시(mesh) MANO 모델을 활용하여 다양한 손 포즈 변형을 다룬다. 첫째, 중복되는 손자세를 줄이기 위해 전략적으로 샘플링하는 방법을 이용하고 3D 메시 모델을 샘플링된 손에 피팅한다. 3D 메시의 모양 및 시점 파라미터를 탐색하여 인간 손 이미지의 자연스러운 가변성을 처리한다. 마지막으로, 다중 모달리티 데이터를 생성한다. 손 관절, 모양 및 관점의 데이터 공간을 기존 벤치마크의 데이터 공간과 비교한다. 이 과정을 통해 제안된 벤치마크가 이전 작업의 차이를 메우고 있음을 보여주고, 또한 네트워크 훈련 과정에서 제안된 데이터를 사용하여 RGB 기반 손 포즈 추정 실험을 하여 생성된 데이터가 양질의 질과 양을 가짐을 보여준다. 제안된 데이터가 RGB 기반 3D 손 포즈 추정 및 시맨틱 손 세그멘테이션과 같은 품질 좋은 큰 데이터셋이 부족하여 방해되었던 작업에 대한 발전을 가속화할 것으로 기대된다.

Development of Deep Recognition of Similarity in Show Garden Design Based on Deep Learning (딥러닝을 활용한 전시 정원 디자인 유사성 인지 모형 연구)

  • Cho, Woo-Yun;Kwon, Jin-Wook
    • Journal of the Korean Institute of Landscape Architecture
    • /
    • v.52 no.2
    • /
    • pp.96-109
    • /
    • 2024
  • The purpose of this study is to propose a method for evaluating the similarity of Show gardens using Deep Learning models, specifically VGG-16 and ResNet50. A model for judging the similarity of show gardens based on VGG-16 and ResNet50 models was developed, and was referred to as DRG (Deep Recognition of similarity in show Garden design). An algorithm utilizing GAP and Pearson correlation coefficient was employed to construct the model, and the accuracy of similarity was analyzed by comparing the total number of similar images derived at 1st (Top1), 3rd (Top3), and 5th (Top5) ranks with the original images. The image data used for the DRG model consisted of a total of 278 works from the Le Festival International des Jardins de Chaumont-sur-Loire, 27 works from the Seoul International Garden Show, and 17 works from the Korea Garden Show. Image analysis was conducted using the DRG model for both the same group and different groups, resulting in the establishment of guidelines for assessing show garden similarity. First, overall image similarity analysis was best suited for applying data augmentation techniques based on the ResNet50 model. Second, for image analysis focusing on internal structure and outer form, it was effective to apply a certain size filter (16cm × 16cm) to generate images emphasizing form and then compare similarity using the VGG-16 model. It was suggested that an image size of 448 × 448 pixels and the original image in full color are the optimal settings. Based on these research findings, a quantitative method for assessing show gardens is proposed and it is expected to contribute to the continuous development of garden culture through interdisciplinary research moving forward.

Comparative Analysis of Self-supervised Deephashing Models for Efficient Image Retrieval System (효율적인 이미지 검색 시스템을 위한 자기 감독 딥해싱 모델의 비교 분석)

  • Kim Soo In;Jeon Young Jin;Lee Sang Bum;Kim Won Gyum
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.12 no.12
    • /
    • pp.519-524
    • /
    • 2023
  • In hashing-based image retrieval, the hash code of a manipulated image is different from the original image, making it difficult to search for the same image. This paper proposes and evaluates a self-supervised deephashing model that generates perceptual hash codes from feature information such as texture, shape, and color of images. The comparison models are autoencoder-based variational inference models, but the encoder is designed with a fully connected layer, convolutional neural network, and transformer modules. The proposed model is a variational inference model that includes a SimAM module of extracting geometric patterns and positional relationships within images. The SimAM module can learn latent vectors highlighting objects or local regions through an energy function using the activation values of neurons and surrounding neurons. The proposed method is a representation learning model that can generate low-dimensional latent vectors from high-dimensional input images, and the latent vectors are binarized into distinguishable hash code. From the experimental results on public datasets such as CIFAR-10, ImageNet, and NUS-WIDE, the proposed model is superior to the comparative model and analyzed to have equivalent performance to the supervised learning-based deephashing model. The proposed model can be used in application systems that require low-dimensional representation of images, such as image search or copyright image determination.

Pose Creation of Character in Two-Dimensional Cartoon through Human Pose Estimation (인간자세 추정방법에 의한 2차원 웹툰 캐릭터 포즈 생성)

  • Jeong, Hieyong;Shin, Choonsung
    • Journal of Broadcast Engineering
    • /
    • v.27 no.5
    • /
    • pp.718-727
    • /
    • 2022
  • The Korean domestic cartoon industry has grown explosively by 65% compared to the previous year. Then the market size is expected to exceed KRW 1 trillion. However, excessive work results in health deterioration. Moreover, this working environment makes the production of human resources insufficient, repeating a vicious cycle. Although some tasks require creation activity during cartoon production, there are still a lot of simple repetitive tasks. Therefore, this study aimed to develop a method for creating a character pose through human pose estimation (HPE). The HPE is to detect key points for each joint of a user. The primary role of the proposed method was to make each joint of the character match that of the human. The proposed method enabled us to create the pose of the two-dimensional cartoon character through the results. Furthermore, it was possible to save the static image for one character pose and the video for continuous character pose.

A Study for Generation of Artificial Lunar Topography Image Dataset Using a Deep Learning Based Style Transfer Technique (딥러닝 기반 스타일 변환 기법을 활용한 인공 달 지형 영상 데이터 생성 방안에 관한 연구)

  • Na, Jong-Ho;Lee, Su-Deuk;Shin, Hyu-Soung
    • Tunnel and Underground Space
    • /
    • v.32 no.2
    • /
    • pp.131-143
    • /
    • 2022
  • The lunar exploration autonomous vehicle operates based on the lunar topography information obtained from real-time image characterization. For highly accurate topography characterization, a large number of training images with various background conditions are required. Since the real lunar topography images are difficult to obtain, it should be helpful to be able to generate mimic lunar image data artificially on the basis of the planetary analogs site images and real lunar images available. In this study, we aim to artificially create lunar topography images by using the location information-based style transfer algorithm known as Wavelet Correct Transform (WCT2). We conducted comparative experiments using lunar analog site images and real lunar topography images taken during China's and America's lunar-exploring projects (i.e., Chang'e and Apollo) to assess the efficacy of our suggested approach. The results show that the proposed techniques can create realistic images, which preserve the topography information of the analog site image while still showing the same condition as an image taken on lunar surface. The proposed algorithm also outperforms a conventional algorithm, Deep Photo Style Transfer (DPST) in terms of temporal and visual aspects. For future work, we intend to use the generated styled image data in combination with real image data for training lunar topography objects to be applied for topographic detection and segmentation. It is expected that this approach can significantly improve the performance of detection and segmentation models on real lunar topography images.

Collection and Utilization of Unstructured Environmental Disaster by Using Disaster Information Standardization (재난정보 표준화를 통한 환경 재난정보 수집 및 활용)

  • Lee, Dong Seop;Kim, Byung Sik
    • Ecology and Resilient Infrastructure
    • /
    • v.6 no.4
    • /
    • pp.236-242
    • /
    • 2019
  • In this study, we developed the system that can collect and store environmental disaster data into the database and use it for environmental disaster management by converting structured and unstructured documents such as images into electronic documents. In the 4th Industrial Revolution, various intelligent technologies have been developed in many fields. Environmental disaster information is one of important elements of disaster cycle. Environment disaster information management refers to the act of managing and processing electronic data about disaster cycle. However, these information are mainly managed in the structured and unstructured form of reports. It is necessary to manage unstructured data for disaster information. In this paper, the intelligent generation approach is used to convert handout into electronic documents. Following that, the converted disaster data is organized into the disaster code system as disaster information. Those data are stored into the disaster database system. These converted structured data is managed in a standardized disaster information form connected with the disaster code system. The disaster code system is covered that the structured information is stored and retrieve on entire disaster cycle. The expected effect of this research will be able to apply it to smart environmental disaster management and decision making by combining artificial intelligence technologies and historical big data.