Search | Korea Science

Image Generation from Korean Dialogue Text via Prompt-based Few-shot Learning (프롬프트 기반 퓨샷 러닝을 통한 한국어 대화형 텍스트 기반 이미지 생성)

Eunchan Lee;Sangtae Ahn
- Annual Conference on Human and Language Technology
- /
- 2022.10a
- /
- pp.447-451
- /
- 2022
본 논문에서는 사용자가 대화 텍스트 방식의 입력을 주었을 때 이를 키워드 중심으로 변환하여 이미지를 생성해내는 방식을 제안한다. 대화 텍스트란 채팅 등에서 주로 사용하는 형식의 구어체를 말하며 이러한 텍스트 형식은 텍스트 기반 이미지 생성 모델이 적절한 아웃풋 이미지를 생성하기 어렵게 만든다. 이를 해결하기 위해 대화 텍스트를 키워드 중심 텍스트로 바꾸어 텍스트 기반 이미지 생성 모델의 입력으로 변환하는 과정이 이미지 생성의 질을 높이는 좋은 방안이 될 수 있는데 이러한 태스크에 적합한 학습 데이터는 충분하지 않다. 본 논문에서는 이러한 문제를 다루기 위한 하나의 방안으로 사전학습된 초대형 언어모델인 KoGPT 모델을 활용하며, 퓨샷 러닝을 통해 적은 양의 직접 제작한 데이터만을 학습시켜 대화 텍스트 기반의 이미지 생성을 구현하는 방법을 제안한다.
PDF

CKFont2: An Improved Few-Shot Hangul Font Generation Model Based on Hangul Composability (CKFont2: 한글 구성요소를 이용한 개선된 퓨샷 한글 폰트 생성 모델)

Jangkyoung, Park;Ammar, Ul Hassan;Jaeyoung, Choi
- KIPS Transactions on Software and Data Engineering
- /
- v.11 no.12
- /
- pp.499-508
- /
- 2022
A lot of research has been carried out on the Hangeul generation model using deep learning, and recently, research is being carried out how to minimize the number of characters input to generate one set of Hangul (Few-Shot Learning). In this paper, we propose a CKFont2 model using only 14 letters by analyzing and improving the CKFont (hereafter CKFont1) model using 28 letters. The CKFont2 model improves the performance of the CKFont1 model as a model that generates all Hangul using only 14 characters including 24 components (14 consonants and 10 vowels), where the CKFont1 model generates all Hangul by extracting 51 Hangul components from 28 characters. It uses the minimum number of characters for currently known models. From the basic consonants/vowels of Hangul, 27 components such as 5 double consonants, 11/11 compound consonants/vowels respectively are learned by deep learning and generated, and the generated 27 components are combined with 24 basic consonants/vowels. All Hangul characters are automatically generated from the combined 51 components. The superiority of the performance was verified by comparative analysis with results of the zi2zi, CKFont1, and MX-Font model. It is an efficient and effective model that has a simple structure and saves time and resources, and can be extended to Chinese, Thai, and Japanese.
https://doi.org/10.3745/KTSDE.2022.11.12.499 인용 PDF KSCI

Various Prompt Methods for Korean Natural Language Inference (한국어 자연어 추론을 위한 다양한 프롬프트 방법 )

Yohan Choi;Changki Lee;Kyungman Bae
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.419-422
- /
- 2023
자연어 추론은 전제 문장과 가설 문장의 관계를 함의, 중립, 모순으로 분류하는 자연어 처리 태스크이다. 최근 여러 자연어 처리 태스크에서 딥러닝 모델을 이용하는 방법이 우수한 성능을 보이고 있지만, 이는 미세 조정과정에 드는 비용이 많다는 점과 모델 출력의 근거, 과정을 사람이 이해하기 어려운 한계가 있다. 이러한 이유로 최근에는 소량의 입력, 출력 예시를 포함한 프롬프트를 이용한 방법론과 모델 출력에 대한 근거를 생성, 활용하는 방법에 관한 많은 연구가 진행되고 있다. 본 논문에서는 퓨샷 학습 환경의 한국어 자연어 추론 태스크를 위한 세 가지 프롬프트 방법과 이들을 조합하여 적용하는 방법을 제안한다. 이를 통해 '해석 가능성'과 자연어 추론 성능을 모두 향상시킬 수 있음을 보인다.
PDF

Named Entity Detection Using Generative Al for Personal Information-Specific Named Entity Annotation Conversation Dataset (개인정보 특화 개체명 주석 대화 데이터셋 기반 생성AI 활용 개체명 탐지)

Yejee Kang;Li Fei;Yeonji Jang;Seoyoon Park;Hansaem Kim
- Annual Conference on Human and Language Technology
- /
- 2023.10a
- /
- pp.499-504
- /
- 2023
본 연구에서는 민감한 개인정보의 유출과 남용 위험이 높아지고 있는 상황에서 정확한 개인정보 탐지 및 비식별화의 효율을 높이기 위해 개인정보 항목에 특화된 개체명 체계를 개발하였다. 개인정보 태그셋이 주석된 대화 데이터 4,981세트를 구축하고, 생성 AI 모델을 활용하여 개인정보 개체명 탐지 실험을 수행하였다. 실험을 위해 최적의 프롬프트를 설계하여 퓨샷러닝(few-shot learning)을 통해 탐지 결과를 평가하였다. 구축한 데이터셋과 영어 기반의 개인정보 주석 데이터셋을 비교 분석한 결과 고유식별번호 항목에 대해 본 연구에서 구축한 데이터셋에서 더 높은 탐지 성능이 나타났으며, 이를 통해 데이터셋의 필요성과 우수성을 입증하였다.
PDF

A Study on Icon Detection in Korean Traditional Paintings (한국 전통회화 내 도상 검출에 관한 연구)

Jiwon Lee;JungSoo Lee;Sungwon Moon;Do-Won Nam;Wonyoung Yoo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.11a
- /
- pp.446-448
- /
- 2023
최근 문화유산 해설 분야에도 AI를 도입하기 위해 여러 노력을 기울이고 있으나, 관람객의 특성이나 관심사를 고려하지 않고 사전에 수동으로 입력한 동일한 문화해설 콘텐츠를 다수의 관람객에게 반복 전달하는 형태로만 제공되는데 그치고 있다. 본 논문에서는 관람객이 관람 중인 문화유산을 관람객의 다양한 관심사에 맞추어 문화유산을 다양하게 해설해주기 위한 기초 연구로 영상을 통해 입력된 한국 전통회화에서 도상을 검출하는 연구를 진행하였다. 아직 가능성 타진 연구로 진행되어 현재 제시된 실험 결과에서는 우수한 도상 검출 성능을 내지 못하였지만, 다양한 증강기법과 퓨샷 러닝기법을 통하여 성능 향상을 도모할 경우 충분히 관람객 맞춤형 문화유산 해설 분야에 활용 가능할 것으로 기대된다.
https://doi.org/10.3745/PKIPS.y2023m11a.446 인용 PDF

Implementation of Point detail Classification System using Few-shot Learning (Few-shot Learning을 이용한 격점상세도 분류 시스템 구현)

Park, Jin-Hyouk;Kim, Yong Hyun;Lee, Kook-Bum;Lee, Jongseo;Kim, Yu-Doo
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.26 no.12
- /
- pp.1809-1815
- /
- 2022
A digital twin is a technology that creates a virtual world identical to the real world. Problems in the real world can be identified through various simulations, so it is a trend to be applied in various industries. In order to apply the digital twin, it is necessary to analyze the drawings in which the structure of the real world to be made identical is designed. Although the technology for analyzing drawings is being studied, it is difficult to apply them because the rules or standards for drawing drawings are different for each author. Therefore, in this paper, we implement a system that analyzes and classifies the vertex detail, one of the drawings, using artificial intelligence. Through this, we intend to confirm the possibility of analyzing and classifying drawings through artificial intelligence and introduce future research directions.
https://doi.org/10.6109/jkiice.2022.26.12.1809 인용 PDF KSCI

3D Film Image Classification Based on Optimized Range of Histogram (히스토그램의 최적폭에 기반한 3차원 필름 영상의 분류)

Lee, Jae-Eun;Kim, Young-Bong;Kim, Jong-Nam
- Journal of the Institute of Convergence Signal Processing
- /
- v.22 no.2
- /
- pp.71-78
- /
- 2021
In order to classify a target image in a cluster of images, the difference in brightness between the object and the background is mainly concerned, which is not easy to classify if the shape of the object is blurred and the sharpness is low. However, there are a few studies attempted to solve these problems, and there is still the problem of not properly distinguishing between wrong pattern and right pattern images when applied to actual data analysis. In this paper, we propose an algorithm that classifies 3D films into sharp and blurry using the width of the pixel values histogram. This algorithm determines the width of the right and wrong images based on the width of the pixel distributions. The larger the width histogram, the sharp the image, while the shorter the width histogram the blurry the image. Experiments show that the proposed algorithm reflects that the characteristics of these histograms allows classification of all wrong images and right images. To determine the reliability and validity of the proposed algorithm, we compare the results with the other obtained from preprocessed 3D films. We then trained the 3D films using few-shot learning algorithm for accurate classification. The experiments verify that the proposed algorithm can perform higher without complicated computations.
PDF KSCI

Host-Based Intrusion Detection Model Using Few-Shot Learning (Few-Shot Learning을 사용한 호스트 기반 침입 탐지 모델)

Park, DaeKyeong;Shin, DongIl;Shin, DongKyoo;Kim, Sangsoo
- KIPS Transactions on Software and Data Engineering
- /
- v.10 no.7
- /
- pp.271-278
- /
- 2021
As the current cyber attacks become more intelligent, the existing Intrusion Detection System is difficult for detecting intelligent attacks that deviate from the existing stored patterns. In an attempt to solve this, a model of a deep learning-based intrusion detection system that analyzes the pattern of intelligent attacks through data learning has emerged. Intrusion detection systems are divided into host-based and network-based depending on the installation location. Unlike network-based intrusion detection systems, host-based intrusion detection systems have the disadvantage of having to observe the inside and outside of the system as a whole. However, it has the advantage of being able to detect intrusions that cannot be detected by a network-based intrusion detection system. Therefore, in this study, we conducted a study on a host-based intrusion detection system. In order to evaluate and improve the performance of the host-based intrusion detection system model, we used the host-based Leipzig Intrusion Detection-Data Set (LID-DS) published in 2018. In the performance evaluation of the model using that data set, in order to confirm the similarity of each data and reconstructed to identify whether it is normal data or abnormal data, 1D vector data is converted to 3D image data. Also, the deep learning model has the drawback of having to re-learn every time a new cyber attack method is seen. In other words, it is not efficient because it takes a long time to learn a large amount of data. To solve this problem, this paper proposes the Siamese Convolutional Neural Network (Siamese-CNN) to use the Few-Shot Learning method that shows excellent performance by learning the little amount of data. Siamese-CNN determines whether the attacks are of the same type by the similarity score of each sample of cyber attacks converted into images. The accuracy was calculated using Few-Shot Learning technique, and the performance of Vanilla Convolutional Neural Network (Vanilla-CNN) and Siamese-CNN was compared to confirm the performance of Siamese-CNN. As a result of measuring Accuracy, Precision, Recall and F1-Score index, it was confirmed that the recall of the Siamese-CNN model proposed in this study was increased by about 6% from the Vanilla-CNN model.
https://doi.org/10.3745/KTSDE.2021.10.7.271 인용 PDF KSCI

Search Result 8, Processing Time 0.027 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)