• Title/Summary/Keyword: Deep Learning Convergence Study

Search Result 321, Processing Time 0.022 seconds

A Case Study of Creative Art Based on AI Generation Technology

  • Qianqian Jiang;Jeanhun Chung
    • International journal of advanced smart convergence
    • /
    • v.12 no.2
    • /
    • pp.84-89
    • /
    • 2023
  • In recent years, with the breakthrough of Artificial Intelligence (AI) technology in deep learning algorithms such as Generative Adversarial Networks (GANs) and Variational Autoencoders (VAE), AI generation technology has rapidly expanded in various sub-sectors in the art field. 2022 as the explosive year of AI-generated art, especially in the creation of AI-generated art creative design, many excellent works have been born, which has improved the work efficiency of art design. This study analyzed the application design characteristics of AI generation technology in two sub fields of artistic creative design of AI painting and AI animation production , and compares the differences between traditional painting and AI painting in the field of painting. Through the research of this paper, the advantages and problems in the process of AI creative design are summarized. Although AI art designs are affected by technical limitations, there are still flaws in artworks and practical problems such as copyright and income, but it provides a strong technical guarantee in the expansion of subdivisions of artistic innovation and technology integration, and has extremely high research value.

Study of an AI Model for Airfoil Parameterization and Aerodynamic Coefficient Prediction from Image Data (이미지 데이터를 이용한 익형 매개변수화 및 공력계수 예측을 위한 인공지능 모델 연구)

  • Seung Hun Lee;Bo Ra Kim;Jeong Hun Lee;Joon Young Kim;Min Yoon
    • Journal of the Korean Society of Visualization
    • /
    • v.21 no.2
    • /
    • pp.83-90
    • /
    • 2023
  • The shape of an airfoil is a critical factor in determining aerodynamic characteristics such as lift and drag. Aerodynamic properties of an airfoil have a decisive impact on the performance of various engineering applications, including airplane wings and wind turbine blades. Therefore, it is essential to analyze the aerodynamic characteristics of airfoils. Various analytical tools such as experiments, computational fluid dynamics, and Xfoil are used to perform these analyses, but each tool has its limitation. In this study, airfoil parameterization, image recognition, and artificial intelligence are combined to overcome these limitations. Image and coordinate data are collected from the UIUC airfoil database. Airfoil parameterization is performed by recognizing images from image data to build a database for deep learning. Trained model can predict the aerodynamic characteristics not only of airfoil images but also of sketches. The mean absolute error of untrained data is 0.0091.

Context-Awareness Cat Behavior Captioning System (반려묘의 상황인지형 행동 캡셔닝 시스템)

  • Chae, Heechan;Choi, Yoona;Lee, Jonguk;Park, Daihee;Chung, Yongwha
    • Journal of Korea Multimedia Society
    • /
    • v.24 no.1
    • /
    • pp.21-29
    • /
    • 2021
  • With the recent increase in the number of households raising pets, various engineering studies have been underway for pets. The final purpose of this study is to automatically generate situation-sensitive captions that can express implicit intentions based on the behavior and sound of cats by embedding the already mature behavioral detection technology of pets as basic element technology in the video capturing research. As a pilot project to this end, this paper proposes a high-level capturing system using optical-flow, RGB, and sound information of cat videos. That is, the proposed system uses video datasets collected in an actual breeding environment to extract feature vectors from the video and sound, then through hierarchical LSTM encoder and decoder, to identify the cat's behavior and its implicit intentions, and to perform learning to create context-sensitive captions. The performance of the proposed system was verified experimentally by utilizing video data collected in the environment where actual cats are raised.

Development of Deep Learning Model for Detecting Road Cracks Based on Drone Image Data (드론 촬영 이미지 데이터를 기반으로 한 도로 균열 탐지 딥러닝 모델 개발)

  • Young-Ju Kwon;Sung-ho Mun
    • Land and Housing Review
    • /
    • v.14 no.2
    • /
    • pp.125-135
    • /
    • 2023
  • Drones are used in various fields, including land survey, transportation, forestry/agriculture, marine, environment, disaster prevention, water resources, cultural assets, and construction, as their industrial importance and market size have increased. In this study, image data for deep learning was collected using a mavic3 drone capturing images at a shooting altitude was 20 m with ×7 magnification. Swin Transformer and UperNet were employed as the backbone and architecture of the deep learning model. About 800 sheets of labeled data were augmented to increase the amount of data. The learning process encompassed three rounds. The Cross-Entropy loss function was used in the first and second learning; the Tversky loss function was used in the third learning. In the future, when the crack detection model is advanced through convergence with the Internet of Things (IoT) through additional research, it will be possible to detect patching or potholes. In addition, it is expected that real-time detection tasks of drones can quickly secure the detection of pavement maintenance sections.

Artificial Intelligence and Air Pollution : A Bibliometric Analysis from 2012 to 2022

  • Yong Sauk Hau
    • International journal of advanced smart convergence
    • /
    • v.13 no.1
    • /
    • pp.48-56
    • /
    • 2024
  • The application of artificial intelligence (AI) is becoming increasingly important to coping with air pollution. AI is effective in coping with it in various ways including air pollution forecasting, monitoring, and control, which is attracting a lot of attention. This attention has created high need for analyzing studies on AI and air pollution. To contribute for satisfying it, this study performed bibliometric analyses on the studies on AI and air pollution from 2012 to 2022 using the Web of Science database. This study analyzed them in various aspects such as the trend in the number of articles, the trend in the number of citations, the top 10 countries of origin, the top 10 research organizations, the top 10 research funding agencies, the top 10 journals, the top 10 articles in terms of total citations, and the distribution by languages. This study not only reports the bibliometric analysis results but also reveals the eight distinct features in the research steam in studies on AI and air pollution, identified from the bibliometric analysis results. They are expected to make a useful contribution for understanding the research stream in AI and air pollution.

Comparative Analysis of CNN Deep Learning Model Performance Based on Quantification Application for High-Speed Marine Object Classification (고속 해상 객체 분류를 위한 양자화 적용 기반 CNN 딥러닝 모델 성능 비교 분석)

  • Lee, Seong-Ju;Lee, Hyo-Chan;Song, Hyun-Hak;Jeon, Ho-Seok;Im, Tae-ho
    • Journal of Internet Computing and Services
    • /
    • v.22 no.2
    • /
    • pp.59-68
    • /
    • 2021
  • As artificial intelligence(AI) technologies, which have made rapid growth recently, began to be applied to the marine environment such as ships, there have been active researches on the application of CNN-based models specialized for digital videos. In E-Navigation service, which is combined with various technologies to detect floating objects of clash risk to reduce human errors and prevent fires inside ships, real-time processing is of huge importance. More functions added, however, mean a need for high-performance processes, which raises prices and poses a cost burden on shipowners. This study thus set out to propose a method capable of processing information at a high rate while maintaining the accuracy by applying Quantization techniques of a deep learning model. First, videos were pre-processed fit for the detection of floating matters in the sea to ensure the efficient transmission of video data to the deep learning entry. Secondly, the quantization technique, one of lightweight techniques for a deep learning model, was applied to reduce the usage rate of memory and increase the processing speed. Finally, the proposed deep learning model to which video pre-processing and quantization were applied was applied to various embedded boards to measure its accuracy and processing speed and test its performance. The proposed method was able to reduce the usage of memory capacity four times and improve the processing speed about four to five times while maintaining the old accuracy of recognition.

Estimation of Bridge Vehicle Loading using CCTV images and Deep Learning (CCTV 영상과 딥러닝을 이용한 교량통행 차량하중 추정)

  • Suk-Kyoung Bae;Wooyoung Jeong;Soohyun Choi;Byunghyun Kim;Soojin Cho
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.28 no.3
    • /
    • pp.10-18
    • /
    • 2024
  • Vehicle loading is one of the main causes of bridge deterioration. Although WiM (Weigh in Motion) can be used to measure vehicle loading on a bridge, it has disadvantage of high installation and maintenance cost due to its contactness. In this study, a non-contact method is proposed to estimate the vehicle loading history of bridges using deep learning and CCTV images. The proposed method recognizes the vehicle type using an object detection deep learning model and estimates the vehicle loading based on the load-based vehicle type classification table developed using the weights of empty vehicles of major domestic vehicle models. Faster R-CNN, an object detection deep learning model, was trained using vehicle images classified by the classification table. The performance of the model is verified using images of CCTVs on actual bridges. Finally, the vehicle loading history of an actual bridge was obtained for a specific time by continuously estimating the vehicle loadings on the bridge using the proposed method.

Improving Chest X-ray Image Classification via Integration of Self-Supervised Learning and Machine Learning Algorithms

  • Tri-Thuc Vo;Thanh-Nghi Do
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.2
    • /
    • pp.165-171
    • /
    • 2024
  • In this study, we present a novel approach for enhancing chest X-ray image classification (normal, Covid-19, edema, mass nodules, and pneumothorax) by combining contrastive learning and machine learning algorithms. A vast amount of unlabeled data was leveraged to learn representations so that data efficiency is improved as a means of addressing the limited availability of labeled data in X-ray images. Our approach involves training classification algorithms using the extracted features from a linear fine-tuned Momentum Contrast (MoCo) model. The MoCo architecture with a Resnet34, Resnet50, or Resnet101 backbone is trained to learn features from unlabeled data. Instead of only fine-tuning the linear classifier layer on the MoCopretrained model, we propose training nonlinear classifiers as substitutes for softmax in deep networks. The empirical results show that while the linear fine-tuned ImageNet-pretrained models achieved the highest accuracy of only 82.9% and the linear fine-tuned MoCo-pretrained models an increased highest accuracy of 84.8%, our proposed method offered a significant improvement and achieved the highest accuracy of 87.9%.

The long-term agricultural weather forcast methods using machine learning and GloSea5 : on the cultivation zone of Chinese cabbage. (기계학습과 GloSea5를 이용한 장기 농업기상 예측 : 고랭지배추 재배 지역을 중심으로)

  • Kim, Junseok;Yang, Miyeon;Yoon, Sanghoo
    • Journal of Digital Convergence
    • /
    • v.18 no.4
    • /
    • pp.243-250
    • /
    • 2020
  • Systematic farming can be planned and managed if long-term agricultural weather information of the plantation is available. Because the greatest risk factor for crop cultivation is the weather. In this study, a method for long-term predicting of agricultural weather using the GloSea5 and machine learning is presented for the cultivation of Chinese cabbage. The GloSea5 is a long-term weather forecast that is available up to 240 days. The deep neural networks and the spatial randomforest were considered as the method of machine learning. The longterm prediction performance of the deep neural networks was slightly better than the spatial randomforest in the sense of root mean squared error and mean absolute error. However, the spatial randomforest has the advantage of predicting temperatures with a global model, which reduces the computation time.

Comparison of a Deep Learning-Based Reconstruction Algorithm with Filtered Back Projection and Iterative Reconstruction Algorithms for Pediatric Abdominopelvic CT

  • Wookon Son;MinWoo Kim;Jae-Yeon Hwang;Young-Woo Kim;Chankue Park;Ki Seok Choo;Tae Un Kim;Joo Yeon Jang
    • Korean Journal of Radiology
    • /
    • v.23 no.7
    • /
    • pp.752-762
    • /
    • 2022
  • Objective: To compare a deep learning-based reconstruction (DLR) algorithm for pediatric abdominopelvic computed tomography (CT) with filtered back projection (FBP) and iterative reconstruction (IR) algorithms. Materials and Methods: Post-contrast abdominopelvic CT scans obtained from 120 pediatric patients (mean age ± standard deviation, 8.7 ± 5.2 years; 60 males) between May 2020 and October 2020 were evaluated in this retrospective study. Images were reconstructed using FBP, a hybrid IR algorithm (ASiR-V) with blending factors of 50% and 100% (AV50 and AV100, respectively), and a DLR algorithm (TrueFidelity) with three strength levels (low, medium, and high). Noise power spectrum (NPS) and edge rise distance (ERD) were used to evaluate noise characteristics and spatial resolution, respectively. Image noise, edge definition, overall image quality, lesion detectability and conspicuity, and artifacts were qualitatively scored by two pediatric radiologists, and the scores of the two reviewers were averaged. A repeated-measures analysis of variance followed by the Bonferroni post-hoc test was used to compare NPS and ERD among the six reconstruction methods. The Friedman rank sum test followed by the Nemenyi-Wilcoxon-Wilcox all-pairs test was used to compare the results of the qualitative visual analysis among the six reconstruction methods. Results: The NPS noise magnitude of AV100 was significantly lower than that of the DLR, whereas the NPS peak of AV100 was significantly higher than that of the high- and medium-strength DLR (p < 0.001). The NPS average spatial frequencies were higher for DLR than for ASiR-V (p < 0.001). ERD was shorter with DLR than with ASiR-V and FBP (p < 0.001). Qualitative visual analysis revealed better overall image quality with high-strength DLR than with ASiR-V (p < 0.001). Conclusion: For pediatric abdominopelvic CT, the DLR algorithm may provide improved noise characteristics and better spatial resolution than the hybrid IR algorithm.