Search | Korea Science

Transfer Learning-based Multi-Modal Fusion Answer Selection Model for Video Question Answering System (비디오 질의 응답 시스템을 위한 전이 학습 기반의 멀티 모달 퓨전 정답 선택 모델)

Park, Gyu-Min;Park, Seung-Bae
- Annual Conference on Human and Language Technology
- /
- 2021.10a
- /
- pp.548-553
- /
- 2021
비디오 질의 응답은 입력으로 주어진 비디오와 질문에 적절할 정답을 제공하기 위해 텍스트, 이미지 등 다양한 정보처리가 요구되는 대표적인 multi-modal 문제이다. 질의 응답 시스템은 질의 응답의 성능을 높이기 위해 다수의 서로 다른 응답 모듈을 사용하기도 하며 생성된 정답 후보군 중 가장 적절할 정답을 선택하는 정답 선택 모듈이 필요하다. 정답 선택 모듈은 응답 모듈의 서로 다른 관점을 고려하여 응답 선택을 선택할 필요성이 있다. 하지만 응답 모듈이 black-box 모델인 경우 정답 선택 모듈은 응답 모듈의 parameter와 예측 분포를 통해 지식을 전달 받기 어렵다. 그리고 학습 데이터셋은 응답 모듈이 학습에 사용했기 때문에 과적합 문제로 각 모듈의 관점을 학습하기엔 어려우며 학습 데이터셋 이외 비교적 적은 데이터셋으로 학습해야 하는 문제점이 있다. 본 논문에서는 정답 선택 성능을 높이기 위해 전이 학습 기반의 멀티모달 퓨전 정답 선택 모델을 제안한다. DramaQA 데이터셋을 통해 성능을 측정하여 제안된 모델의 우수성을 실험적으로 증명하였다.
PDF

Analysis of the relationship between regional economic growth and obesity by using Lasso Regression (Lasso Regression을 이용한 지역 경제 성장과 비만율의 상관관계 분석)

Kil, Eungyu;OH, Sujin;Kim, Ung-Mo
- Annual Conference of KIPS
- /
- 2018.10a
- /
- pp.565-568
- /
- 2018
본 연구에서는 Lasso Regression을 기반으로 하여 지역 경제 성장과 비만율을 예측한다. 연구는 3단계로 나누어 진행된다. 우선 지역성장을 대변할 수 있는 가상의 GDP 수치를 구한다. 그 다음 가상의 GDP 수치와 비만율 데이터를 이용하여 학습모델을 만든다. 마지막으로 이전의 데이터를 이용하여 앞으로의 성장을 예측하고 학습모델에 적용하여 비만율을 예측한다. 본 연구의 데이터는 학습데이터와 실험데이터를 구성된다. 학습데이터로는 국내의 8도 중 하나인 강원도의 데이터를 이용하며 실험데이터로는 강릉과 원주의 데이터를 이용한다. 평가 비교 대상으로는 과거의 흐름을 반영하는 최소자승법 예측기법을 선정하여 비교한다. 연구 결과 강릉의 경우 비교 데이터와의 오차율 평균은 1.22%로 큰 차이가 없음을 알 수 있다. 따라서 본 연구에서 제안하는 방법이 과거의 흐름을 기반으로 작성됨을 알 수 있다. 하지만 단순히 과거의 흐름만을 통해 예측하는 것은 여러 요소가 복합적으로 작용하는 비만율 예측에 알맞지 않기 때문에 본 연구 방법이 유의미하다고 여겨진다.
https://doi.org/10.3745/PKIPS.y2018m10a.565 인용 PDF

A Study on Synthesizing Training Data for One-stage Object Detector (단일 단계 검출 방법을 위한 이미지 합성기반 학습 데이터 증강에 관한 연구)

Lee, Seon-Gyeong;Jeong, Chi Yoon;Moon, KyeongDeok;Kim, Chae-Kyu
- Annual Conference of KIPS
- /
- 2020.05a
- /
- pp.446-450
- /
- 2020
딥러닝 기반의 영상 분석 방법들은 많은 양의 학습 데이터가 필요하며, 학습 데이터 구축에는 많은 시간과 노력이 소요된다. 특히 객체 검출 분야의 경우 영상 내 객체의 위치, 크기, 범주 등의 정보가 모두 필요하여 학습 데이터 구축에 더 많은 어려움이 있으며, 이를 해결하기 위해 최근 이미지 합성기반 데이터 증강에 관한 연구가 활발히 진행되고 있다. 이미지 합성기반 데이터 증강 방법은 배경 영상에 객체를 합성할 때 객체와 배경 영상이 접한 영역에서 아티팩트(Artifact)가 발생하며, 이는 객체 검출 모델이 아티팩트를 객체의 특징으로 모델링하여 검출 성능이 저하되는 원인이 된다. 이러한 문제를 해결하기 위하여 본 논문에서는 양방향 필터 기반의 이미지 합성 방법을 제안하고, 단일 단계 검출의 대표적인 방법인 RetinaNet을 이용하여 이미지 합성기반 데이터 증강 방법의 성능을 분석하였다. 공개 데이터셋에 대한 실험 결과 본 논문에서 사용한 단일 검출 방법 및 데이터 증강 기법을 사용하면 더 적은 양의 증강 데이터로 기존 방법과 동일한 성능을 보여주는 것을 확인하였다.
https://doi.org/10.3745/PKIPS.y2020m05a.446 인용 PDF

Synthetic Data Generation and Performance Analysis for Anomaly Detection (이상 탐지를 위한 합성 데이터 생성 및 성능 분석)

Hwang, Ju-hyo;Jin, Kyo-hong
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2022.10a
- /
- pp.19-21
- /
- 2022
Anomaly detection using self-supervised learning typically generates synthetic data to learn to classify normal and abnormal, and uses real abnormal data as test data to measure anomaly detection performance. In a study using this method to generate synthetic data similar to normal data, anomaly detection was carried out by generating synthetic data by cutting and pasting a specific patch from the original image. In this way, the degree of similarity to normal data depends on the number and size of patches, which affects anomaly detection performance. In this paper, synthetic data were generated by varying patch sizes and numbers, and then similarity and analysis with normal data were conducted using a pre-trained model, and anomaly detection performance was measured by learning the model.
PDF

Smart contract research for efficient learner problem recommendation in online education environment (온라인 교육 환경에서 효율적 학습자 문제추천을 위한 스마트 컨트랙트 연구)

Min, Youn-A
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.22 no.4
- /
- pp.195-201
- /
- 2022
For a efficient distance education environment, the need for correct problem recommendation guides considering the learner's exact learning pattern is increasing. In this paper, we study block chain based smart contract technology to suggest a method for presenting the optimal problem recommendation path for individual learners based on the data given by situational weights to the problem patterns of learners collected in the distance education environment. For the performance evaluation of this study, the learning satisfaction with the existing similar learning environment, the usefulness of the problem recommendation guide, and the learner data processing speed were analyzed. Through this study, it was confirmed that the learning satisfaction improved by more than 15% and the learning data processing speed was improved by more than 20% compared to the existing learning environment.
https://doi.org/10.7236/JIIBC.2022.22.4.195 인용 PDF KSCI HTML

인공지능 보안 이슈

Park, Sohee;Choi, Daeseon
- Review of KIISC
- /
- v.27 no.3
- /
- pp.27-32
- /
- 2017
머신러닝을 위주로 하는 인공지능 기술이 여러 분야에서 다양하게 적용되고 있다. 머신러닝 기술은 시험 데이터에 대해 높은 성능을 보였지만, 악의적으로 만들어진 데이터에 대해서는 오동작을 하는 경우가 보고되고 있다. 그 외에도 학습데이터 오염시키기, 학습된 모델 탈취 등 새로운 공격 유형이 보고되고 있다. 기계학습에 사용된 훈련데이터에 대한 보안과 프라이버시 또한 중요한 이슈이다. 인공지능 기술의 개발 및 적용에 있어 이러한 위험성에 대한 고려와 대비가 반드시 필요하다.
PDF KSCI

Improved Focused Sampling for Class Imbalance Problem (클래스 불균형 문제를 해결하기 위한 개선된 집중 샘플링)

Kim, Man-Sun;Yang, Hyung-Jeong;Kim, Soo-Hyung;Cheah, Wooi Ping
- The KIPS Transactions:PartB
- /
- v.14B no.4
- /
- pp.287-294
- /
- 2007
Many classification algorithms for real world data suffer from a data class imbalance problem. To solve this problem, various methods have been proposed such as altering the training balance and designing better sampling strategies. The previous methods are not satisfy in the distribution of the input data and the constraint. In this paper, we propose a focused sampling method which is more superior than previous methods. To solve the problem, we must select some useful data set from all training sets. To get useful data set, the proposed method devide the region according to scores which are computed based on the distribution of SOM over the input data. The scores are sorted in ascending order. They represent the distribution or the input data, which may in turn represent the characteristics or the whole data. A new training dataset is obtained by eliminating unuseful data which are located in the region between an upper bound and a lower bound. The proposed method gives a better or at least similar performance compare to classification accuracy of previous approaches. Besides, it also gives several benefits : ratio reduction of class imbalance; size reduction of training sets; prevention of over-fitting. The proposed method has been tested with kNN classifier. An experimental result in ecoli data set shows that this method achieves the precision up to 2.27 times than the other methods.
https://doi.org/10.3745/KIPSTB.2007.14-B.4.287 인용 PDF KSCI

User Assistant Soft Computing Method for 3D Effect Optimization (입체효과 최적화를 위한 사용자 보조 소프트컴퓨팅 기법)

Choi Woo-Kyung;Kim Seong-Joo;Jeon Hong-Tae
- Journal of the Korean Institute of Intelligent Systems
- /
- v.15 no.1
- /
- pp.69-74
- /
- 2005
In this paper, we suggested user assistant soft computing method for 3D effect optimization. In order to maximize 3D effect of image, intervals among cameras have to be set up properly according to distance between cameras and an object. Two data such as interval and distance was obtained to use in neural network as the data for learning. However, if the data for learning was obtained by only human's subjective views, it could be that the obtained data was not optimal for learning because the data had an accidental ewer To obtain optimal data lot learning, we added candidature data to obtained data through data analysis, and then selected the most proper data between the candidature data and the obtained data for learning in neural network. Usually, 3D effect of image was affected by both distance from an object to cameras and an object size. Therefore, we suggested fuzzy inference model which was able to represent two factors like distance and size. Candidature data was added by fuzzy model. In the simulation result, we verified that the mote the obtained data was affected by human's subjective views, the more effective the suggested system was.
https://doi.org/10.5391/JKIIS.2005.15.1.069 인용 PDF KSCI

Utilizing Minimal Label Data for Tomato Leaf Disease Classification: An Approach through Recursive Learning Based on YOLOv8 (토마토 잎 병해 분류를 위한 최소 라벨 데이터 활용: YOLOv8 기반 재귀적 학습 방식을 통한 접근)

Junhyuk Lee;Namhyoung Kim
- The Journal of Bigdata
- /
- v.9 no.1
- /
- pp.61-73
- /
- 2024
Class imbalance is one of the significant challenges in deep learning tasks, particularly pronounced in areas with limited data. This study proposes a new approach that utilizes minimal labeled data for effectively classifying tomato leaf diseases. We introduced a recursive learning method using the YOLOv8 model. By utilizing the detection predictions of images on the training data as additional training data, the number of labeled data is progressively increased. Unlike conventional data augmentation and up-down sampling techniques, this method seeks to fundamentally solve the class imbalance problem by maximizing the utility of actual data. Based on the secured labeled data, tomato leaves were extracted, and diseases were classified using the EfficientNet model. This process achieved a high accuracy of 98.92%. Notably, a 12.9% improvement compared to the baseline was observed in the detection of Late blight diseases, which has the least amount of data. This research presents a methodology that addresses data imbalance issues while offering high-precision disease classification, with the expectation of application to other crops.
https://doi.org/10.36498/kbigdt.2024.9.1.61 인용 PDF

Korean Restaurant Reservation System Model Using Hybrid Code Network (Hybrid Code Network를 이용한 한국어 식당 예약 시스템 모델)

Lee, Dong-Yub;Hur, Yun-A;Lim, Heui-Seok
- Proceedings of The KACE
- /
- 2017.08a
- /
- pp.57-59
- /
- 2017
대화 시스템(dialogue system)은 텍스트나 음성을 통해 다양한 분야에서 특정한 목적을 수행할 수 있는 시스템이다. 대화 시스템을 구현하기 위한 방법으로 인공 신경망(neural network)을 기반으로한 end-to-end learning 방식이 제안되었다. End-to-end learning 방식을 이용한 식당 예약 시스템 모델의 학습을 위해 페이스북은 영어로 이루어진 식당 예약에 관련된 학습 대화 데이터셋(The 6 dialog bAbI tasks)을 구축하였다. 하지만 end-to-end learning 방식의 학습은 많은 학습 데이터가 필요하다는 단점이 존재하는데, 액션 템플릿(action template)의 정의를 통해 도메인 지식을 표현함으로써 일반적인 end-to-end learning 방식보다 적은 학습량으로 좋은 성능의 모델을 학습할 수 있는 Hybrid Code Network 구조를 제안한 연구가 있다. 본 논문에서는 Hybrid Code Network 구조를 이용하여 한국어 식당 예약 시스템을 구축할 수 있는 방법을 제안하고, 한국어로 이루어진 식당 예약에 관련한 학습 대화 데이터를 구축하는 방법을 제안한다.
PDF

Search Result 6,458, Processing Time 0.045 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)