Search | Korea Science

KoDialoGPT2 : Modeling Chit-Chat Dialog in Korean (KoDialoGPT2 : 한국어 일상 대화 생성 모델)

Oh, Dongsuk;Park, Sungjin;Lee, Hanna;Jang, Yoonna;Lim, Heuiseok
- Annual Conference on Human and Language Technology
- /
- 2021.10a
- /
- pp.457-460
- /
- 2021
대화 시스템은 인공지능과 사람이 자연어로 의사 소통을 하는 시스템으로 크게 목적 지향 대화와 일상대화 시스템으로 연구되고 있다. 목적 지향 대화 시스템의 경우 날씨 확인, 호텔 및 항공권 예약, 일정 관리 등의 사용자가 생활에 필요한 도메인들로 이루어져 있으며 각 도메인 별로 목적에 따른 시나리오들이 존재한다. 이러한 대화는 사용자에게 명확한 발화을 제공할 수 있으나 자연스러움은 떨어진다. 일상 대화의 경우 다양한 도메인이 존재하며, 시나리오가 존재하지 않기 때문에 사용자에게 자연스러운 발화를 제공할 수 있다. 또한 일상 대화의 경우 검색 기반이나 생성 기반으로 시스템이 개발되고 있다. 검색 기반의 경우 발화 쌍에 대한 데이터베이스가 필요하지만, 생성 기반의 경우 이러한 데이터베이스가 없이 모델의 Language Modeling (LM)으로 부터 생성된 발화에 의존한다. 따라서 모델의 성능에 따라 발화의 품질이 달라진다. 최근에는 사전학습 모델이 자연어처리 작업에서 높은 성능을 보이고 있으며, 일상 대화 도메인에서도 역시 높은 성능을 보이고 있다. 일상 대화에서 가장 높은 성능을 보이고 있는 사전학습 모델은 Auto Regressive 기반 생성모델이고, 한국어에서는 대표적으로 KoGPT2가 존재한다. 그러나, KoGPT2의 경우 문어체 데이터만 학습되어 있기 때문에 대화체에서는 낮은 성능을 보이고 있다. 본 논문에서는 대화체에서 높은 성능을 보이는 한국어 기반 KoDialoGPT2를 개발하였고, 기존의 KoGPT2보다 높은 성능을 보였다.
PDF

A Design of Industrial Safety Service using LoRa Gateway Networks (LoRa 게이트웨이 네트워크를 활용한 산업안전서비스 설계)

Chang, Moon-soo
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.10a
- /
- pp.313-316
- /
- 2021
In the IoT(IoT: Internet of Things) environment, network configuration is essential to collect data generated from objects. Various communication methods are used to process data of objects, and wireless communication methods such as Bluetooth and WiFi are mainly used. In order to collect data of objects, a communication module must be installed to collect data generated from sensors or edge devices in real time. And in order to deliver data to the database, a software architecture must be configured. Data generated from objects can be stored and managed in a database in real time, and data necessary for industrial safety can be extracted and utilized for industrial safety service applications. In this paper, a network environment was constructed using a LoRa(LoRa: Long Range) gateway to collect object data, and a client/server data collection model was designed to collect object data transmitted from the LoRa module. In order to secure the resources necessary for data collection and storage management without data leakage, data collection should be possible in real time. As an application service, location data required for industrial safety can be stored and managed in a database in real time.
PDF

Morphology Representation using STT API in Rasbian OS (Rasbian OS에서 STT API를 활용한 형태소 표현에 대한 연구)

Woo, Park-jin;Im, Je-Sun;Lee, Sung-jin;Moon, Sang-ho
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2021.10a
- /
- pp.373-375
- /
- 2021
In the case of Korean, the possibility of development is lower than that of English if tagging is done through the word tokenization like English. Although the form of tokenizing the corpus by separating it into morpheme units via KoNLPy is represented as a graph database, full separation of voice files and verification of practicality is required when converting the module from graph database to corpus. In this paper, morphology representation using STT API is shown in Raspberry Pi. The voice file converted to Corpus is analyzed to KoNLPy and tagged. The analyzed results are represented by graph databases and can be divided into tokens divided by morpheme, and it is judged that data mining extraction with specific purpose is possible by determining practicality and degree of separation.
PDF

Research trends to analysis of 『Muyedobotongji』 (『무예도보통지』 연구동향 분석)

Kwak, Nak-hyun
- (The)Study of the Eastern Classic
- /
- no.55
- /
- pp.193-221
- /
- 2014
This study aims to analyze trends of advanced research of "Muyedobotongji". The conclusions are as following in these. First, the number of theses related with "Muyedobotongji" is 47 in total including 29 master's theses and 18 doctor's theses. The sports science comprises the largest proportion of study including 23 master's degree and 12 doctor's degree. Besides sports science field, "Muyedobotongji" is analyzed in various study fields such as library and information, engineering, science of art and culture contents. In master's theses, They focused on practical ways of "Muyedobotongji". But "Muyedobotongji" is conducted by perspective of the humanities in doctor's theses. Second, There are 72 theses related with "Muyedobotongji" in scientific journal. Regarding these in detail, there are 35 theses in sports science, 12 theses in Korean history, 7 theses in martial arts, 5 theses in dance studies, 4 these in Korean studies, 2 theses in Chinse studies, 2 theses in art history, 1 these in Japanese literature and 1 thesis in military science. This fact helps us understand "Muyedobotongji" is studied actively in sports science field. Third, the future research directions of "Muyedobotongji" Should be considered in 3 categories. first, it needs to do interdisciplinary fusion research. Through this, it can complement insufficient parts of existing researches. Second, it needs to make standard Key words. The unified Key words are able to use communicating in different field of scientific journals without confusing. Third It needs to build data bases which are applied to martial art areas. It can provide chances for both Korean martial arts and "Muyedobotongji" to be practiced in culture contents.

A Study on Ontology Based Knowledge Representation Method with the Alzheimer Disease Related Articles (알츠하이머 관련 논문을 대상으로 하는 온톨로지 기반 지식 표현 방법 연구)

Lee, Jaeho;Kim, Younhee;Shin, Hyunkyung;Song, Kibong
- Journal of Internet Computing and Services
- /
- v.15 no.3
- /
- pp.125-135
- /
- 2014
In the medical field, for the purpose of diagnosis and treatment of diseases, building knowledge base has received a lot of attention. The most important thing to build a knowledge base is representing the knowledge accurately. In this paper we suggest a knowledge representation method using Ontology technique with the datasets obtained from the domestic papers on Alzheimer disease that has received a lot of attention recently in the medical field. The suggested Ontology for Alzheimer disease defines all the possible classes: lexical information from journals such as 'author' and 'publisher' research subjects extracted from 'title', 'abstract', 'keywords', and 'results'. It also included various semantic relationships between classes through the Ontology properties. Inference can be supported since our Ontology adopts hierarchical tree structure for the classes and transitional characteristics of the properties. Therefore, semantic representation based query is allowed as well as simple keyword query, which enables inference based knowledge query using an Ontology query language 'SPARQL'.
https://doi.org/10.7472/jksii.2014.15.3.125 인용 PDF KSCI

Comparison of MEL-LPC and LPC-MEL Analysis Method for the Korean Speech Recognition Systems. (한국어 음성 인식 시스템을 위한 MEL-LPC 분석 방법과 LPC-MEL 분석 방법의 비교)

김주곤;김범국;정호열;정현열
- Proceedings of the IEEK Conference
- /
- 2001.09a
- /
- pp.833-836
- /
- 2001
본 논문에서는 한국어 음성인식 시스템의 성능 향상을 위해 청각 주파수 분해능을 가진 MEL-LPC Cepstrum을 음소단위의 HMM(Hidden Markov Model)을 기반으로 하는 인식 시스템에 적용하여 그 결과를 비교 검토하였다. 선형예측(LP) 분석 후에 후처리로서 주파수를 왜곡시킨 LPC-MEL 분석이 계산량이 적고 효과적이라 일반적으로 많이 사용되고 있으나 주파수 분해능은 많이 개선되지 않는다. 따라서 본 논문에서는 주파수 분해능을 개선하기 위해, 원 음성신호로부터 직접적으로 멜주파수로 왜곡시킨 후 선형 예측 분석을 수행하는 MEL-LPC 분석방법을 이용한 음소기반의 화자 독립 음성인식 시스템을 구성하여 기존의 LPC-MEL 분석방법과 비교실험을 통하여 MEL-LPC 분석방법의 유효성을 검토하였다. 실험에 사용한 음성 데이터베이스는 음소 및 단어 인식실험에서는 ETRI 445단어 DB, 연속 숫자음인식 실험에서는 KLE 4연속 숫자음 DB를 사용하였다. 화자 독립 음소인식 실험의 경우, 묵음을 제외한 47개의 유사 음소에 대하여 4상태 3출력의 Left-to-Right 모델을이용하였다. 단어 및 연속 숫자음 인식 실험의 경우, 유한상태 네트워크에 의한 OPDP법을 이용하였다. 화자 독립 음소, 단어 및 4연속 숫자음 인식 실험결과, 기존의 LPC-MEL Cepstrum을 사용한 경우보다 MEL-LPC Cepstum을 사용한 경우가 더 높은 인식률을 나타내어 한국어 음성인식 시스템에서 MEL-LPC 분석방법의 유효성을 확인할 수 있었다.
PDF

A Face Recognition Based Suspected Criminal Detection and Identification System (얼굴 인식 기반의 범죄 용의자 탐지 및 식별 시스템)

Lee, Jong-Uk;Kang, Bong-Su;Lee, Han-Sung;Park, Dae-Hee
- Proceedings of the Korean Information Science Society Conference
- /
- 2010.11a
- /
- pp.127-128
- /
- 2010
본 논문에서는 CCTV 감시 영상에서 취득한 얼굴 이미지를 이용하여, 범죄자 감시목록에 등록된 범죄 용의자를 탐지 식별하는 시스템을 설계 및 구현하였다. 특히 본 논문에서 제안한 SVDD와 SRC를 혼합한 계층적 구조의 범죄 용의자 식별 모듈은 다음과 같은 특성을 갖는다: 1) 먼저 SVDD를 이용하여 범죄 용의자만을 빠르게 인식함으로써, 일반인에 대한 불필요한 범죄자 식별 연산을 수행하지 않는다; 2) 다양한 식별 성능을 저해하는 환경에서도 이미 강인한 성능이 검증된 SRC를 범죄 용의자 식별과정에 적용함으로써 안정적이고 정확한 식별 시스템을 보장한다; 3) 동일 생체 특정의 반복적 사용을 통한 다수결 투표전략을 취함으로써 시스템의 신뢰도를 보장한다; 4) 점증적 갱신의 학습 능력으로 인하여 범죄 용의자 감시목록 데이터베이스의 변화에도 능동적으로 적응한다 실제 KUFD(Korea University Face Database)를 자체 제작하고 캠퍼스 내에서 CCTV 환경의 얼굴 인식 기반 범죄 용의자 탐지 및 식별 시스템 환경을 모의 구축하여 실험적으로 제안된 시스템의 성능을 검증한다.
PDF

Image Set Optimization for Real-Time Video Photomosaics (실시간 비디오 포토 모자이크를 위한 이미지 집합 최적화)

Choi, Yoon-Seok;Koo, Bon-Ki
- 한국HCI학회:학술대회논문집
- /
- 2009.02a
- /
- pp.502-507
- /
- 2009
We present a real-time photomosaics method for small image set optimized by feature selection method. Photomosaics is an image that is divided into cells (usually rectangular grids), each of which is replaced with another image of appropriate color, shape and texture pattern. This method needs large set of tile images which have various types of image pattern. But large amount of photo images requires high cost for pattern searching and large space for saving the images. These requirements can cause problems in the application to a real-time domain or mobile devices with limited resources. Our approach is a genetic feature selection method for building an optimized image set to accelerate pattern searching speed and minimize the memory cost.
PDF

A Study on PLU (Phone-Likely Unit) for Korean Continuous Speech Recognition (강건한 한국어 연속음성인식을 위한 유사음소단일에 대한 연구)

Seo Jun-Bae;Kim Joo-Gon;Kim Min-Jung;Jung Ho-Youl;Chung Hyun-Yeol
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.37-40
- /
- 2004
본 논문은 한국어 연속음성인식에 효율적인 문맥의존 음향모델 수에 대한 연구로써 유사음소단위 수에 따른 인식 성능을 비교, 평가하였다. 기존에 본연구실에서는 48음소를 기본인식단위로 이용하고 있으나 연속음성인식의 경우 문맥종속모델이 사용되고 문맥종속모델은 변이 음을 고려한 음소가 이미 포함되어 있어 이를 고려하면 기본 음소를 줄이므로서 계산량의 감소와 인식 성능 향상을 기대할 수 있을 것으로 생각된다. 따라서 , 본 논문에서는 기존의 48음소와 이를 39음소로 줄여 인식실험에 사용하여 그 성능을 비교 평가하기로 하였다. 이를 위하여 다양한 태스크의 데이터베이스를 통합하여 부족한 문맥요소들을 확장한 후 인식실험을 수행하였다. 실험결과 변이음의 개수를 줄이면서도 인식 성능저하가 없음을 확인할 수 있었으며 연속 음성의 경우 39음소를 이용한 경우가 $10\%$정도의 향상된 인식성능을 얻을 수 있음을 확인할 수 있었다.
PDF

Multi-view Contents Production by Control of Depth Image (깊이영상 조절을 통한 다시점 콘텐츠 제작)

Bae, Yun-Jin;Lee, Yoon-Hyuk;Kim, Dong-Yoon;Lee, Jae-Won;Choi, Hyun-Jun;Seo, Young-Ho;Kim, Dong-Wook
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2011.07a
- /
- pp.356-358
- /
- 2011
본 논문에서는 깊이영상 기반의 영상합성과 다시점 영상 생성 기술을 이용하여 3차원 입체 콘텐츠를 제작하는 방법을 제안하였다. 이를 위해 깊이영상을 촬영한 후에 깊이정보를 조절하고, 레이어 기반의 영상으로 합성한 후에 이를 이용하여 다시점 영상을 생성하였다. 깊이카메라와 RGB 카메라로 구성된 카메라 시스템을 이용하여 객체들을 촬영함으로써 객체에 대한 3차원 정보를 획득하고 이를 데이터베이스화하여 3차원 영상을 합성하고 생성하는데 이용한다. 3차원 영상의 위치 및 거리를 고려하여 객체의 3차원 정보를 조절하고, 레이어 기반으로 하나의 영상으로 합성한다. 합성된 영상은 다시점 영상 생성 도구를 이용하여 원하는 시점만큼의 다시점 영상들로 생성된다. 본 논문에서는 객체와 사람의 영상을 합성하였고, 이들을 이용하여 각각 37시점의 다시점 영상을 생성하였다.
PDF

Search Result 2,657, Processing Time 0.04 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)