Morphology Representation using STT API in Rasbian OS

Woo, Park-jin;Im, Je-Sun;Lee, Sung-jin;Moon, Sang-ho;

한국정보통신학회:학술대회논문집 (Proceedings of the Korean Institute of Information and Commucation Sciences Conference)

한국정보통신학회 (The Korea Institute of Information and Commucation Engineering)

Rasbian OS에서 STT API를 활용한 형태소 표현에 대한 연구

Morphology Representation using STT API in Rasbian OS

박진우 (부산외국어대학교) ;
임재순 (부산외국어대학교) ;
이성진 (부산외국어대학교) ;
문상호 (부산외국어대학교)

Woo, Park-jin (Busan University of Foreign Studies) ;
Im, Je-Sun (Busan University of Foreign Studies) ;
Lee, Sung-jin (Busan University of Foreign Studies) ;
Moon, Sang-ho (Busan University of Foreign Studies)

발행 : 2021.10.03

PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

국어의 경우 교착어이기 때문에 영어와 같이 어절 토큰화를 통하여 태깅할 경우 발전 가능성이 영어 보다 낮은 편이다. KoNLPy를 통해 형태소 단위로 분리하여 코퍼스를 토큰화한 형태를 그래프 데이터 베이스로 표현이 되지만 해당 모듈을 그래프 데이터베이스에서 코퍼스로 변환 시 음성파일의 완전 분리 및 실용성에 대한 검증이 필요하다. 본 논문에서는 Raspberry Pi에서 STT API를 활용한 형태소 표현을 나타내고 있다. 코퍼스로 변환된 음성 파일을 KoNLPy로 형태소 분석 후 태깅한다. 분석된 결과는 그래프 데이터베이스로 표현되며 형태소별로 나누어진 토큰으로 구분할 수 있음이 확인되었고, 실용성과 분리 정도를 판단하여 특정 목적성을 지닌 데이터 마이닝 추출이 가능한 것으로 판단된다.

In the case of Korean, the possibility of development is lower than that of English if tagging is done through the word tokenization like English. Although the form of tokenizing the corpus by separating it into morpheme units via KoNLPy is represented as a graph database, full separation of voice files and verification of practicality is required when converting the module from graph database to corpus. In this paper, morphology representation using STT API is shown in Raspberry Pi. The voice file converted to Corpus is analyzed to KoNLPy and tagged. The analyzed results are represented by graph databases and can be divided into tokens divided by morpheme, and it is judged that data mining extraction with specific purpose is possible by determining practicality and degree of separation.

한국정보통신학회:학술대회논문집 (Proceedings of the Korean Institute of Information and Commucation Sciences Conference)

Rasbian OS에서 STT API를 활용한 형태소 표현에 대한 연구

Morphology Representation using STT API in Rasbian OS

초록

키워드

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)