• Title/Summary/Keyword: User utterance

Search Result 39, Processing Time 0.029 seconds

Design of Markov Decision Process Based Dialogue Manager (마르코프 의사결정 과정에 기반한 대화 관리자 설계)

  • Choi, Joon-Ki;Eun, Ji-Hyun;Chang, Du-Seong;Kim, Hyun-Jeong;Koo, Myong-Wan
    • Proceedings of the KSPS conference
    • /
    • 2006.11a
    • /
    • pp.14-18
    • /
    • 2006
  • The role of dialogue manager is to select proper actions based on observed environment and inferred user intention. This paper presents stochastic model for dialogue manager based on Markov decision process. To build a mixed initiative dialogue manager, we used accumulated user utterance, previous act of dialogue manager, and domain dependent knowledge as the input to the MDP. We also used dialogue corpus to train the automatically optimized policy of MDP with reinforcement learning algorithm. The states which have unique and intuitive actions were removed from the design of MDP by using the domain knowledge. The design of dialogue manager included the usage of natural language understanding and response generator to build short message based remote control of home networked appliances.

  • PDF

CosmoScriBe 2.0 : The development of Korean transcription tools (CosmoScriBe 2.0: 한국어 전사 도구의 개발)

  • Kwak, Sun-Dong;Chang, Moon-Soo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.24 no.3
    • /
    • pp.323-329
    • /
    • 2014
  • In spoken language research, transcription process needs to be carried out to translate voice data into text. Transcription tool, support program of transcription, offers various information such as content and time of utterance and speaker information. For this reason, inexperienced computer users are having trouble familiarizing with the program. Moreover, since there are little transcription tools developed domestically in Korea, they are usually not suitable for Korean environment. In this paper, we propose a transcription tool which supports not only Korean transcription but easy-to-use interface environment for novice. The transcription supporting function is also provided to minimize mistake that might happen in the process of transcription. And a system structure will be provided for data reliability. Usability of the proposed tool is evaluated in accordance with transcription experience. The evaluation result shows that transcription process and transcription support function have become faster and more convenient respectively.

Real-time Background Music System for Immersive Dialogue in Metaverse based on Dialogue Emotion (메타버스 대화의 몰입감 증진을 위한 대화 감정 기반 실시간 배경음악 시스템 구현)

  • Kirak Kim;Sangah Lee;Nahyeon Kim;Moonryul Jung
    • Journal of the Korea Computer Graphics Society
    • /
    • v.29 no.4
    • /
    • pp.1-6
    • /
    • 2023
  • To enhance immersive experiences for metaverse environements, background music is often used. However, the background music is mostly pre-matched and repeated which might occur a distractive experience to users as it does not align well with rapidly changing user-interactive contents. Thus, we implemented a system to provide a more immersive metaverse conversation experience by 1) developing a regression neural network that extracts emotions from an utterance using KEMDy20, the Korean multimodal emotion dataset 2) selecting music corresponding to the extracted emotions from an utterance by the DEAM dataset where music is tagged with arousal-valence levels 3) combining it with a virtual space where users can have a real-time conversation with avatars.

A Situation-Based Dialogue Management with Dialogue Examples (대화 예제를 이용한 상황 기반 대화 관리 시스템)

  • Lee, Cheong-Jae;Jung, Sang-Keun;Lee, Geun-Bae
    • MALSORI
    • /
    • no.56
    • /
    • pp.185-194
    • /
    • 2005
  • In this paper, we present POSSDM (POSTECH Situation-Based Dialogue Manager) for a spoken dialogue system using a new example and situation-based dialogue management technique for effective generation of appropriate system responses. Spoken dialogue system should generate cooperative responses to smoothly control dialogue flow with the users. We introduce a new dialogue management technique incorporating dialogue examples and situation-based rules for EPG (Electronic Program Guide) domain. For the system response inference, we automatically construct and index a dialogue example database from dialogue corpus, and the best dialogue example is retrieved for a proper system response with the query from a dialogue situation including a current user utterance, dialogue act, and discourse history. When dialogue corpus is not enough to cover the domain, we also apply manually constructed situation-based rules mainly for meta-level dialogue management.

  • PDF

An analysis and correction of the phonological and syntactic errors in korean dialogues for a robust dialogue system (견고한 대화시스템을 위한 한국어 대화체의 음운론적, 구문론적 오류 분석 및 복구)

  • 김영길;김한우;최병욱
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.34C no.5
    • /
    • pp.55-65
    • /
    • 1997
  • In many cases, a dialogue system can't extract the correct analysis information of a user's spoken utterance, because of its own ungrammatical components. Therefore, in order to perform a correct before it performs the syntactic processing. In this paper, we use a real dialogue corpus and classify these ungrammatical errors as 4 categories : phonological, syntactic, semantic errors that consist of speech reparis and inversions, and propose an algorithm to detect and correct the errors. In short, this paper proposes a method to detect and correct the speech repairs and inversions that are classified as the phonological and syntactic errors to implement a robust dialogue system. And, through the test of real dialogue data, this paper shows an efficiency of the proposed algorithm.

  • PDF

Text-Prompt Speaker Verification using Variable Threshold and Sequential Decision (가변 문턱치와 순차결정법을 통한 문맥요구형 화자확인)

  • Ahn, Sung-Joo;Kang, Sun-Mee;Ko, Han-Seok
    • Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.41-47
    • /
    • 2000
  • This paper concerns an effective text-prompted speaker verification method to increase the performance of speaker verification. While various speaker verification methods have already been developed, their effectiveness has not yet been formally proven in terms of achieving an acceptable performance level. It is also noted that the traditional methods were focused primarily on single, prompted utterance for verification. This paper, instead, proposes sequential decision method using variable threshold focused at handling two utterances for text-prompted speaker verification. Experimental results show that the proposed speaker verification method outperforms that of the speaker verification scheme without using the sequential decision by a factor of up to 3 times. From these results, we show that the proposed method is highly effective and achieves a reliable performance suitable for practical applications.

  • PDF

Post-Processing of Speech Recognition Using User Utterance Sequential Pattern (사용자 발화 순차패턴을 이용한 음성인식 후처리)

  • Song, Won-Moon;Kim, Eun-Ju;Kim, Myung-Won
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2005.07b
    • /
    • pp.709-711
    • /
    • 2005
  • 최근 음성인식 분야에서는 발화된 음성의 단순한 신호 처리위주의 인식 결과로부터 좀 더 신뢰할 수 있는 결과를 얻기 위하여 여러 가지 후처리 기법들이 연구되고 있다. 본 논문에서는 개인 사용자를 위한 음성 명령어 인식 환경에서 사용자의 발화 정보를 후처리에 적용함으로써 사용자 정보를 고려한 음성인식 후처리 기법을 제안한다. 먼저 이전에 사용했던 음성 명령어들로부터 명령어 발화 순차 패턴 규칙을 추출 한 후 사용자가 사전에 발화한 명령어를 바탕으로 구성된 순차 패턴을 비교하여 순차 규칙상 얻어 질 수 있는 단어를 결정한다. 이렇게 얻어진 단어를 고려하여 음성인식기 인식단어 후보들의 확률값을 적절히 보정한 후 최종 인식 단어를 재결정한다. 이러한 과정에서 적절한 보정을 위하여 발화 순차 패턴의 신뢰도와 인식기의 결과단어를 고려한 보정 방법을 제안한다. 실험을 통하여 제안한 후처리를 이용한 음성인식이 HMM을 이용한 기본 음성인식에 비해 오류율을 $15\%$이상 낮추어 인식률에 상당한 기여를 하였음을 확인할 수 있다.

  • PDF

A Situation-Based Dialogue Management with Dialogue Examples (대화 예제를 이용한 상황 기반 대화 관리 시스템)

  • Lee, Cheon-Jae;Jung, Sang-Keun;Lee, Geun-Bae
    • Proceedings of the KSPS conference
    • /
    • 2005.11a
    • /
    • pp.113-115
    • /
    • 2005
  • In this paper, we present POSSDM (POSTECH Situation-Based Dialogue Manager) for a spoken dialogue system using a new example and situation-based dialogue management techniques for effective generation of appropriate system responses. Spoken dialogue system should generate cooperative responses to smoothly control dialogue flow with the users. We introduce a new dialogue management technique incorporating dialogue examples and situation-based rules for EPG (Electronic Program Guide) domain. For the system response inference, we automatically construct and index a dialogue example database from dialogue corpus, and the best dialogue example is retrieved for a proper system response with the query from a dialogue situation including a current user utterance, dialogue act, and discourse history. When dialogue corpus is not enough to cover the domain, we also apply manually constructed situation-based rules mainly for meta-level dialogue management.

  • PDF

A Study for Improvement of User Consent UI / UX according to Personal Information Utterance in Smart Speaker (스마트 스피커에서 개인정보 발화에 따른 사용자 동의 UI/UX 개선 연구)

  • Jung, Jae-Eun;Park, Hyoju;Yang, Jinhong
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2019.10a
    • /
    • pp.414-417
    • /
    • 2019
  • 스마트 스피커는 기존 서비스와 달리 음성으로 데이터를 수집할 뿐 아니라 수집한 데이터를 기반으로 처리한 정보를 스피커를 통해 발화하는, 즉 소리로 전달하는 특징을 가진다. 이러한 서비스 응답 구조는 스피커에서 음성을 통해 전달되는 정보에 사용자의 개인정보가 포함되어 발화될 수 있는 위험이 존재한다. 구글, 아마존의 스마트 스피커 초기 설정 시 동의 과정 분석을 통해 개인정보 발화 위험을 사용자가 명확히 인지하기 어렵다는 문제를 발견하였다. 이에 본 연구는 스마트 스피커 서비스의 사용자 동의 과정에서 사용자의 개인정보 발화 위험 인식 재고를 위한 UI/UX 개선방안으로 1) 개인정보 발화 위험성 약관 명시 및 별도 화면 제시, 2) 사용자의 자유로운 서비스 동의 허용, 3) 컨트롤러에게 전달되는 개인정보와 스피커를 통해 발화될 수 있는 개인정보를 구분하여 제시, 4) 개인정보 발화 위험에 대한 음성 고지 및 동의 과정 추가를 제안하였다.

Database Interface System with Dialog (대화를 통한 데이타베이스 인터페이스 시스템)

  • Woo, Yo-Seop;Kang, Seok-Hoon
    • The Transactions of the Korea Information Processing Society
    • /
    • v.3 no.3
    • /
    • pp.417-428
    • /
    • 1996
  • In this paper, a database interface system with natural language dialogue is designed and implemented. The system is made up of language analysis, context processing, dialogue processing and DB processing unit. The method for classifying and processing an undefined word in language analysis is proposed. It reduces the dictionary size, which gives difficulties in DB Interface. And the current DB Interfaces dealt with an input utterance independently. But the system in this paper provides a user with the interface environment in which he or she can have a continuous conversation with the system and retrieve DB information. Thus in this paper, speech acts which include user's inattentions well as propositional contents are defined, and user action hierarchical model for library DB retrieval is constructed. And the system uses the defined knowledge to recognize-user's plan, effectively understanding and managing the ongoing dialogue. And the system is implemented in the domain of library database in order to prove the proposed methods in this paper.

  • PDF