• Title/Summary/Keyword: Data for AI training

Search Result 261, Processing Time 0.027 seconds

A Survey of Applications of Artificial Intelligence Algorithms in Eco-environmental Modelling

  • Kim, Kang-Suk;Park, Joon-Hong
    • Environmental Engineering Research
    • /
    • v.14 no.2
    • /
    • pp.102-110
    • /
    • 2009
  • Application of artificial intelligence (AI) approaches in eco-environmental modeling has gradually increased for the last decade. Comprehensive understanding and evaluation on the applicability of this approach to eco-environmental modeling are needed. In this study, we reviewed the previous studies that used AI-techniques in eco-environmental modeling. Decision Tree (DT) and Artificial Neural Network (ANN) were found to be major AI algorithms preferred by researchers in ecological and environmental modeling areas. When the effect of the size of training data on model prediction accuracy was explored using the data from the previous studies, the prediction accuracy and the size of training data showed nonlinear correlation, which was best-described by hyperbolic saturation function among the tested nonlinear functions including power and logarithmic functions. The hyperbolic saturation equations were proposed to be used as a guideline for optimizing the size of training data set, which is critically important in designing the field experiments required for training AI-based eco-environmental modeling.

A Study on the Artificial Intelligence (AI) Training Data Quality: Fuzzy-set Qualitative Comparative Analysis (fsQCA) Approach (인공지능 학습용 데이터 품질에 대한 연구: 퍼지셋 질적비교분석)

  • Hyunmok Oh;Seoyoun Lee;Younghoon Chang
    • Information Systems Review
    • /
    • v.26 no.1
    • /
    • pp.19-56
    • /
    • 2024
  • This study is empirical research to enhance understanding of AI (artificial intelligence) training data project in South Korea. It primarily focuses on the various concerns regarding data quality from policy-executing institutions, data construction companies, and organizations utilizing AI training data to develop the most reliable algorithm for society. For academic contribution, this study suggests a theoretical foundation and research model for understanding AI training data quality and its antecedents, as well as the unique data and ethical aspects of AI. For this purpose, this study proposes a research model with important antecedents related to AI training data quality, such as data attribute factors, data building environmental factors, and data type-related factors. The study collects 393 sample data from actual practitioners and personnel from companies building artificial intelligence training data and companies developing artificial intelligence services. Data analysis was conducted through Fuzzy Set Qualitative Comparative Analysis (fsQCA) and Artificial Neural Network analysis (ANN), presenting academic and practical implications related to the quality of AI training data.

Understanding MyData-Based Platform Adoption for SW·AI Education & Training Programs

  • Hansung Kim;Sae Bom Lee;Yunjae Jang
    • Journal of the Korea Society of Computer and Information
    • /
    • v.29 no.9
    • /
    • pp.269-277
    • /
    • 2024
  • This study aims to explore the key factors for the systematic development and activation of a MyData-based platform for SW·AI education and training programs recently initiated by the government. To achieve this, a research model based on the Value-based Adoption Model (VAM) was established, and a survey was conducted with 178 participants who had experience in SW·AI education and training programs. The research model was validated using confirmatory factor analysis and Partial Least Squares Structural Equation Modeling (PLS-SEM). The main findings of the study are as follows: First, transparency and self-determination significantly influenced perceived benefits, while technical effort and security significantly influenced perceived risks. Second, perceived benefits positively affected the intention to use the platform, whereas perceived risks did not show a significant impact. Based on these results, this study suggests implications for the systematic development and activation of a MyData-based platform in the field of SW·AI education and training.

Implementation of a Job Prediction Program and Analysis of Vocational Training Evaluation Data Based on Artificial Intelligence (인공지능(AI) 기반 직업 훈련 평가 데이터 분석 및 취업 예측 프로그램 구현)

  • Jae-Sung Chun;Il-Young Moon
    • Journal of Practical Engineering Education
    • /
    • v.16 no.4
    • /
    • pp.409-414
    • /
    • 2024
  • This paper utilizes artificial intelligence to analyze vocational training evaluation data for people with disabilities and selects the optimal prediction model using various machine learning algorithms. It predicts the job categories most likely to employ trainees based on data such as gender, age, education level, type of disability, and basic learning abilities. The goal is to design customized training programs based on these predictions to enhance training efficiency and employment success rates.

A Study on Designing Metadata Standard for Building AI Training Dataset of Landmark Images (랜드마크 이미지 AI 학습용 데이터 구축을 위한 메타데이터 표준 설계 방안 연구)

  • Kim, Jinmook
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.54 no.2
    • /
    • pp.419-434
    • /
    • 2020
  • The purpose of the study is to design and propose metadata standard for building AI training dataset of landmark images. In order to achieve the purpose, we first examined and analyzed the state of art of the types of image retrieval systems and their indexing methods, comprehensively. We then investigated open training dataset and machine learning tools for image object recognition. Sequentially, we selected metadata elements optimized for the AI training dataset of landmark images and defined the input data for each element. We then concluded the study with implications and suggestions for the development of application services using the results of the study.

AI Model-Based Automated Data Cleaning for Reliable Autonomous Driving Image Datasets (자율주행 영상데이터의 신뢰도 향상을 위한 AI모델 기반 데이터 자동 정제)

  • Kana Kim;Hakil Kim
    • Journal of Broadcast Engineering
    • /
    • v.28 no.3
    • /
    • pp.302-313
    • /
    • 2023
  • This paper aims to develop a framework that can fully automate the quality management of training data used in large-scale Artificial Intelligence (AI) models built by the Ministry of Science and ICT (MSIT) in the 'AI Hub Data Dam' project, which has invested more than 1 trillion won since 2017. Autonomous driving technology using AI has achieved excellent performance through many studies, but it requires a large amount of high-quality data to train the model. Moreover, it is still difficult for humans to directly inspect the processed data and prove it is valid, and a model trained with erroneous data can cause fatal problems in real life. This paper presents a dataset reconstruction framework that removes abnormal data from the constructed dataset and introduces strategies to improve the performance of AI models by reconstructing them into a reliable dataset to increase the efficiency of model training. The framework's validity was verified through an experiment on the autonomous driving dataset published through the AI Hub of the National Information Society Agency (NIA). As a result, it was confirmed that it could be rebuilt as a reliable dataset from which abnormal data has been removed.

Object Detection Accuracy Improvements of Mobility Equipments through Substitution Augmentation of Similar Objects (유사물체 치환증강을 통한 기동장비 물체 인식 성능 향상)

  • Heo, Jiseong;Park, Jihun
    • Journal of the Korea Institute of Military Science and Technology
    • /
    • v.25 no.3
    • /
    • pp.300-310
    • /
    • 2022
  • A vast amount of labeled data is required for deep neural network training. A typical strategy to improve the performance of a neural network given a training data set is to use data augmentation technique. The goal of this work is to offer a novel image augmentation method for improving object detection accuracy. An object in an image is removed, and a similar object from the training data set is placed in its area. An in-painting algorithm fills the space that is eliminated but not filled by a similar object. Our technique shows at most 2.32 percent improvements on mAP in our testing on a military vehicle dataset using the YOLOv4 object detector.

A Study on the Land Cover Classification and Cross Validation of AI-based Aerial Photograph

  • Lee, Seong-Hyeok;Myeong, Soojeong;Yoon, Donghyeon;Lee, Moung-Jin
    • Korean Journal of Remote Sensing
    • /
    • v.38 no.4
    • /
    • pp.395-409
    • /
    • 2022
  • The purpose of this study is to evaluate the classification performance and applicability when land cover datasets constructed for AI training are cross validation to other areas. For study areas, Gyeongsang-do and Jeolla-do in South Korea were selected as cross validation areas, and training datasets were obtained from AI-Hub. The obtained datasets were applied to the U-Net algorithm, a semantic segmentation algorithm, for each region, and the accuracy was evaluated by applying them to the same and other test areas. There was a difference of about 13-15% in overall classification accuracy between the same and other areas. For rice field, fields and buildings, higher accuracy was shown in the Jeolla-do test areas. For roads, higher accuracy was shown in the Gyeongsang-do test areas. In terms of the difference in accuracy by weight, the result of applying the weights of Gyeongsang-do showed high accuracy for forests, while that of applying the weights of Jeolla-do showed high accuracy for dry fields. The result of land cover classification, it was found that there is a difference in classification performance of existing datasets depending on area. When constructing land cover map for AI training, it is expected that higher quality datasets can be constructed by reflecting the characteristics of various areas. This study is highly scalable from two perspectives. First, it is to apply satellite images to AI study and to the field of land cover. Second, it is expanded based on satellite images and it is possible to use a large scale area and difficult to access.

A Study on Student Satisfaction according to Likert Scale in Big Data Training (빅데이터 양성 교육에서 리커트 척도에 따른 만족도 분석에 관한 연구)

  • Choi, Hyun
    • Journal of the Korean Society of Industry Convergence
    • /
    • v.22 no.6
    • /
    • pp.775-783
    • /
    • 2019
  • The big data industry market continues to grow and is expected to grow further. In this paper, based on the five-point Likert scale of college students in the process of developing big data young people, the satisfaction of instructors in big data training and improvement of job (education) ability based on AI convergence The survey was conducted on the expectations of the participants and their intention to participate in the training process for the young talents. Male students were more satisfied than students. In terms of students, students who are less than 6th semester have the highest satisfaction, but students who are less than 7th and 8th semesters are less satisfied. By department, the satisfaction level of science and statistics students was the highest, while the satisfaction level of other students was low. According to the average of college credits, the satisfaction of students under 3.5~4.0 was the highest, and the satisfaction of students below 3.0 was the lowest.

Necessity of AI Literacy Education to Enhance for the Effectiveness of AI Education (AI교육 효과성 제고를 위한 AI리터러시 교육의 필요성)

  • Yang, Seokjae;Shin, Seungki
    • 한국정보교육학회:학술대회논문집
    • /
    • 2021.08a
    • /
    • pp.295-301
    • /
    • 2021
  • This study tried to examine the necessity of AI literacy education to increase the effectiveness of artificial intelligence education ahead of the revision of the next revised curriculum. To this end, AI modeling classes were conducted for high school students and the necessity, content, and training period of AI literacy perceived by students in AI education were investigated through a questionnaire. The results showed that they generally agreed on the need for data utilization and data preprocessing in the AI class, and in the course of the AI class, there were many cases of difficulties due to lack of basic competencies for database use. In particular, it was observed that the understanding of the file structure for data analysis was insufficient and the understanding of the data storage format for data analysis was low. In order to overcome this part, the necessity of prior education for data processing was recognized, and there were many opinions that it is generally appropriate to go to high school at that time. As for the content elements of AI literacy, it was found that there were high demands on the content of data visualization along with data transformation, including data creation and deletion.

  • PDF