Search | Korea Science

Framework Switching of Speaker Overlap Detection System (화자 겹침 검출 시스템의 프레임워크 전환 연구)

Kim, Hoinam;Park, Jisu;Cha, Shin;Son, Kyung A;Yun, Young-Sun;Park, Jeon Gue
- Journal of Software Assessment and Valuation
- /
- v.17 no.1
- /
- pp.101-113
- /
- 2021
In this paper, we introduce a speaker overlap system and look at the process of converting the existed system on the specific framework of artificial intelligence. Speaker overlap is when two or more speakers speak at the same time during a conversation, and can lead to performance degradation in the fields of speech recognition or speaker recognition, and a lot of research is being conducted because it can prevent performance degradation. Recently, as application of artificial intelligence is increasing, there is a demand for switching between artificial intelligence frameworks. However, when switching frameworks, performance degradation is observed due to the unique characteristics of each framework, making it difficult to switch frameworks. In this paper, the process of converting the speaker overlap detection system based on the Keras framework to the pytorch-based system is explained and considers components. As a result of the framework switching, the pytorch-based system showed better performance than the existing Keras-based speaker overlap detection system, so it can be said that it is valuable as a fundamental study on systematic framework conversion.
https://doi.org/10.29056/jsav.2021.06.13 인용

A Study on the Development of Automatic Schedule Management System through Speech Recognition Text Analysis (음성인식 텍스트 분석을 통한 자동 일정 관리 시스템 개발에 관한 연구)

Lee, Hae-Mi;Cho, We-Duke
- Proceedings of the Korea Information Processing Society Conference
- /
- 2022.05a
- /
- pp.279-282
- /
- 2022
컴퓨터가 마이크 등의 소리 센서를 통해 얻은 음향학적 신호를 단어나 문장으로 변환시키는 기술인 음성 인식 기술과 인공지능 기술을 결합한 음성 대화 시스템에 대한 연구 진행 및 제품 출시가 활발하게 이루어지고 있다. 기존의 시스템을 사용하면서 날짜와 시간 외의 정보 추출 정도가 빈약하거나 자동 등록이 되지 않는 문제점을 확인하였다. 음성 인식 기술을 통해 얻은 텍스트에서 보다 많은 정보를 추출하고, 자동 등록 및 알림과 맛집 등 추가 정보 제공 시스템을 구축하는 것을 목표로 하였다.
https://doi.org/10.3745/PKIPS.y2022m05a.279 인용 PDF

Developing a New Algorithm for Conversational Agent to Detect Recognition Error and Neologism Meaning: Utilizing Korean Syllable-based Word Similarity (대화형 에이전트 인식오류 및 신조어 탐지를 위한 알고리즘 개발: 한글 음절 분리 기반의 단어 유사도 활용)

Jung-Won Lee;Il Im
- Journal of Intelligence and Information Systems
- /
- v.29 no.3
- /
- pp.267-286
- /
- 2023
The conversational agents such as AI speakers utilize voice conversation for human-computer interaction. Voice recognition errors often occur in conversational situations. Recognition errors in user utterance records can be categorized into two types. The first type is misrecognition errors, where the agent fails to recognize the user's speech entirely. The second type is misinterpretation errors, where the user's speech is recognized and services are provided, but the interpretation differs from the user's intention. Among these, misinterpretation errors require separate error detection as they are recorded as successful service interactions. In this study, various text separation methods were applied to detect misinterpretation. For each of these text separation methods, the similarity of consecutive speech pairs using word embedding and document embedding techniques, which convert words and documents into vectors. This approach goes beyond simple word-based similarity calculation to explore a new method for detecting misinterpretation errors. The research method involved utilizing real user utterance records to train and develop a detection model by applying patterns of misinterpretation error causes. The results revealed that the most significant analysis result was obtained through initial consonant extraction for detecting misinterpretation errors caused by the use of unregistered neologisms. Through comparison with other separation methods, different error types could be observed. This study has two main implications. First, for misinterpretation errors that are difficult to detect due to lack of recognition, the study proposed diverse text separation methods and found a novel method that improved performance remarkably. Second, if this is applied to conversational agents or voice recognition services requiring neologism detection, patterns of errors occurring from the voice recognition stage can be specified. The study proposed and verified that even if not categorized as errors, services can be provided according to user-desired results.
https://doi.org/10.13088/jiis.2023.29.3.267 인용 PDF

A Study on Named Entity Recognition for Effective Dialogue Information Prediction (효율적 대화 정보 예측을 위한 개체명 인식 연구)

Go, Myunghyun;Kim, Hakdong;Lim, Heonyeong;Lee, Yurim;Jee, Minkyu;Kim, Wonil
- Journal of Broadcast Engineering
- /
- v.24 no.1
- /
- pp.58-66
- /
- 2019
Recognition of named entity such as proper nouns in conversation sentences is the most fundamental and important field of study for efficient conversational information prediction. The most important part of a task-oriented dialogue system is to recognize what attributes an object in a conversation has. The named entity recognition model carries out recognition of the named entity through the preprocessing, word embedding, and prediction steps for the dialogue sentence. This study aims at using user - defined dictionary in preprocessing stage and finding optimal parameters at word embedding stage for efficient dialogue information prediction. In order to test the designed object name recognition model, we selected the field of daily chemical products and constructed the named entity recognition model that can be applied in the task-oriented dialogue system in the related domain.
https://doi.org/10.5909/JBE.2019.24.1.58 인용 PDF KSCI KPUBS HTML

Retrieval-based Chat Model using Index-Term Normalization and Answer Filtering (색인어 정규화 및 응답 필터링을 이용한 검색기반 채팅 모델)

Lee, Hyeon-gu;Kim, Minkyoung;Kim, Jintae;Kim, Harksoo;Lee, Yeonsoo;Choi, Maengsik
- Annual Conference on Human and Language Technology
- /
- 2017.10a
- /
- pp.197-200
- /
- 2017
채팅 모델은 인간과 컴퓨터가 신변잡기 대화를 나눌 수 있게 해주는 시스템으로 빠른 속도로 발전하는 인공지능 음성언어 비서 시스템에 필수적으로 사용되는 기술이다. 본 논문에서는 검색기반 채팅 모델에서 발생하는 검색 효율 문제와 정확하지 못한 답변을 출력하는 문제를 해결하기 위해 색인어 정규화와 응답 필터링이 적용된 검색기반 채팅 모델을 제안한다. 색인어 정규화를 통해 99.3%의 색인 커버리지를 확보하였으며 필터링 모델을 통해 기존 검색 모델에서보다 향상된 사용자 만족도를 얻었다.
PDF

Retrieval-based Chat Model using Index-Term Normalization and Answer Filtering (색인어 정규화 및 응답 필터링을 이용한 검색기반 채팅 모델)

Lee, Hyeon-gu;Kim, Minkyoung;Kim, Jintae;Kim, Harksoo;Lee, Yeonsoo;Choi, Maengsik
- 한국어정보학회:학술대회논문집
- /
- 2017.10a
- /
- pp.197-200
- /
- 2017
채팅 모델은 인간과 컴퓨터가 신변잡기 대화를 나눌 수 있게 해주는 시스템으로 빠른 속도로 발전하는 인공지능 음성언어 비서 시스템에 필수적으로 사용되는 기술이다. 본 논문에서는 검색기반 채팅 모델에서 발생하는 검색 효율 문제와 정확하지 못한 답변을 출력하는 문제를 해결하기 위해 색인어 정규화와 응답 필터링이 적용된 검색기반 채팅 모델을 제안한다. 색인어 정규화를 통해 99.3%의 색인 커버리지를 확보하였으며 필터링 모델을 통해 기존 검색 모델에서보다 향상된 사용자 만족도를 얻었다.
PDF

Mobile Food Recommendation System for Patients U sing Light-weight Deep Learning and Knowledge Bases (경량 딥러닝과 지식베이스를 활용한 모바일 질환별 식품 추천 시스템)

Hyeon, Bumsu;Kim, Dohyun;Lee, SangKeun
- Proceedings of the Korea Information Processing Society Conference
- /
- 2020.05a
- /
- pp.534-535
- /
- 2020
본 논문에서는 딥러닝과 지식베이스를 융합하여 활용한 질환 인식 및 식품 추천 시스템을 제안한다. 제안하는 시스템은 온전히 모바일 디바이스 내에서 작동하는 시스템이다. 본 시스템은 압축된 딥러닝 모델을 이용해 사용자 대화 텍스트를 분석하여 사용자의 질환을 예측한다. 그 후, 지식베이스를 기반으로 해당 질환 관리에 도움이 되는 식품을 매칭하고 사용자에게 추천한다. 이는 사용자 친화적 헬스케어 애플리케이션으로써 체크리스트 작성 등 번거로운 작업 없이도 사용자에게 유용한 건강 정보를 제공할 수 있다.
https://doi.org/10.3745/PKIPS.y2020m05a.534 인용 PDF

Korean Paraphrase Corpus and Building Guidelines for Sentence Similarity Analysis (문장 유사성 분석을 위한 한국어 패러프레이즈 말뭉치 및 구축 가이드라인)

Oh, Kyo-Joong;Kim, Hyunmin;Ko, Bowon;Nam, Jehyun;Choi, Ho-Jin
- Annual Conference on Human and Language Technology
- /
- 2019.10a
- /
- pp.527-530
- /
- 2019
최근 각 산업분야에서 대화 시스템과 챗봇 기술의 업무로의 도입이 활발해짐에 따라 한국어 패러프레이즈 기술에 대한 관심이 높아지고 있다. 기존에는 연구와 평가 목적으로 규모는 작아도 잘 정제된 평가셋을 만드는 것이 중요했으나, 최근에는 기계학습 기술의 발달로 학습을 위한 일정 수준의 품질을 보장하는 대량의 말뭉치를 빠르게 확보하는 방법이 중요해지고 있다. 본 논문에서는 현재 수행하고 있는 한국어 패러프레이즈 말뭉치 구축 경험과 방법에 대해 소개한다.
PDF

Control of Multi-Home Devices Using AI Vision and Generative AI (AI 비전과 생성형 AI 를 이용한 멀티 홈 디바이스 제어)

Su-Min Hong;Su-Min Kim;Su-Hee Song;Chae-Yeon Ahn
- Proceedings of the Korea Information Processing Society Conference
- /
- 2023.11a
- /
- pp.1037-1038
- /
- 2023
기술의 발전으로 인해 스마트 가전제품이 늘어나며 스마트 홈 기술이 주목을 받고 있다. 그러나 이러한 기술은 설정과정의 복잡성으로 사용자들이 쉽게 접근하기 어렵다. 특히 디지털 기기 사용에 익숙하지 않은 사용자들을 스마트 홈 기술로부터 소외시키는 결과를 낳고 있다. 본 논문에서는 사용자 친화적인 스마트 홈 시스템을 제안한다. 사용자의 시선 방향을 추적하여 디바이스를 선택하고 간단한 인터페이스의 컨트롤러로 디바이스를 손쉽게 조작할 수 있도록 한다. 또한, 생성형 인공지능과 RAG 를 결합하여 사용자가 가전제품과 자연스럽게 대화하며 정보를 얻을 수 있는 인터페이스를 제공한다.
https://doi.org/10.3745/PKIPS.y2023m11a.1037 인용 PDF

Study on Curator of Tourist Attractions using Chatbot (관광지 교육을 위한 교육용 챗봇 큐레이터)

Park, Jong-hyun;Kim, Im-yeoreum;Ryu, Gi-Hwan
- The Journal of the Convergence on Culture Technology
- /
- v.8 no.6
- /
- pp.303-308
- /
- 2022
A chatbot is a responsive chatting program that can communicate with people through text or voice. 'Siri' and 'Bixby' installed in smartphones are also representative artificial intelligences that use the chatbot system. With the rapid development of chatbots, users in various fields have also begun to pay attention to the food service industry. As machine learning technology developed, it became possible to use more flexible conversations, and it soon expanded to the realm of education. Userㄴs interact through conversations with chatbots, and active interactions stimulate users' desires and at the same time have a positive effect on learning motivation. Recommendation system programs using chatbots not only recommend products according to users' preferences, but also provide various additional information. This study planned a program that combined the chatbot system and tourism service. The chatbot curator will develop into a form of inducing interest and curiosity to users through learning, and then facilitating the desire for tourism. The purpose of this study is to lay the foundation for a chatbot curator based on previous studies.
https://doi.org/10.17703/JCCT.2022.8.6.303 인용 PDF KSCI

Search Result 87, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)