• Title/Summary/Keyword: Voice language

Search Result 412, Processing Time 0.026 seconds

A Study On the ASP Module in Conversational Automatic Speech Recognition Flight Information System (대화형 음성 인식 항공정보 시스템에서의 ASP 모듈에 관한 연구)

  • 윤재석;장준식
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.4
    • /
    • pp.595-603
    • /
    • 2002
  • In this research, it has been shown that how the computer can recognize and understand spoken natural language and its symbolization using VoiceXML and Grammar Specific Language in developing telephone based conversational automatic speech recognition flight information system. In order for user to hear correct information, ASP Module has been revised and its effectivities has been experimented on the Voice portal airplane information system platform.

A Study On the Automatic Generation System of Mobile Voice Web Page (모바일 음성 웹 페이지의 자동 생성 시스템에 관한 연구)

  • You-Jung Ko;Yoon-Joong Kim
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2008.11a
    • /
    • pp.153-156
    • /
    • 2008
  • 모바일 기기는 화면의 크기가 작아 스타일러스나 펜으로 웹 컨텐츠를 이용하기에는 불편함이 있다. 이에 따라 음성으로 웹의 컨텐츠를 개발하기 위한 포준 언어인 VoiceXML(Voice Extenxible Markup Language), SALT(Speech application Language Tags)가 빠르게 보급되고 있다. 이를 이용하기 위해서는 기존의 모바일 웹페이지를 음성 웹 표준 기술에 맞게 변환해줘야 한다. 따라서 본 논문에서는 WML(Wireless Markup Language)로 구성된 모바일 웹 페이지를 SALT 음성기술을 이용하여 음성명령이 가능한 모바일 음성 웹페이지(WML + SALT)로 자동 생성하는 시스템을 구현 하고자 한다. 이에 따라 사용자는 음성명령을 통해 컨텐츠를 제어함으로써 편리함을 제공하고, 개발자는 자동 생성 시스템을 이용함으로써 기존의 모바일 웹 페이지를 음성 웹 페이지를 변환하기 위한 개발시간과 비용을 감소 할 수 있다.

Chaucer's Storytelling: The Clerk's Tale in Terms of Bakhtin's Concept (초서의 이야기하기 -바흐친의 개념을 통해 본 「서생의 이야기」)

  • Lee, Dongchoon
    • Journal of English Language & Literature
    • /
    • v.53 no.2
    • /
    • pp.281-306
    • /
    • 2007
  • M. M. Bakhtin's dialogic concept of multi-voiced discourse allows us to open up the text of The Clerk's Tale and to account for its radical heterogeneity. Once we recognize the multi-voiced character of The Clerk's Tale, then what was heretofore regarded as discontinuous or ignored can be seen as the clash of several different world-views. Such a conceptual framework gives an added depth and scope to such thematic subjects as sovereignty, the status of women, and rhetorical style. There are three different and antagonistic voices involved in the tale's narration. These voices project different viewpoints or world-views, and they consequently engage each other in a polemic debate. Their relationship with each other is discontinuous and dialectical rather than continuous and harmonious. The first voice is the Petrarchan voice of moral allegory, which is the voice of tradition, authority, and high seriousness. This voice of moral allegory regards the story of Griselda as an exemplum of spiritual constancy and virtuous suffering. The second voice is the Clerkly voice of pathos based on human experience and feeling. This voice is defined by the Clerk's asides and apostrophes interspersed in the narrative proper, which function to engage the Petrarchan voice in a polemical debate. The third voice is the voice of parody, nominally identified with Chaucer the poet, which is located in the second ending, including Envoy. Whereas the other two voices are earnest and serious, the voice of parody is irrelevant, playful and antagonistic to both the Petrarchan voice of moral allegory and the Clerkly voice of secular humility.

The Interactive Voice Services based on VoiceXML (VoiceXML 기반 음성인식시스템을 이용한 서비스 개발)

  • Kim Hak-Gyoon;Kim Eun-Hyang;Kim Jae-In;Koo Myoung-Wan
    • MALSORI
    • /
    • no.43
    • /
    • pp.113-125
    • /
    • 2002
  • As there are needs to search the Web information via wire or wireless telephones, VoiceXML forum was established to develop and promote the Voice eXtensible Markup Language (VoiceXML). VoiceXML simplifies the creation of personalized interactive voice response services on the Web, and allows voice and phone access to information on Web sites, call center databases. Also, it can utilize the Web-based technologies, such as CGI(Common Gateway Interface) scripts. In this paper, we have developed the voice portal service platform based on VoiceXML called TeleGateway. It enables integration of voice services with data services using the Automatic Speech Recognition (ASR) and Text-To-Speech (TTS) engines. Also, we have showed the various services on voice portal services.

  • PDF

Design and Implementation of IVR Server Using VoiceXML (VoiceXML을 이용한 IVR 서버 설계 및 구현)

  • Lee, Chang-Ho;Jang, Won-Jo;Kang, Sun-Mee
    • Speech Sciences
    • /
    • v.9 no.3
    • /
    • pp.47-59
    • /
    • 2002
  • A new brilliant service using human-voice and DTMF (Dual Tone Multi Frequency) technique is expected nowadays in order to obtain valuable information on the internet more easily. VoiceXML (Voice eXtensible Markup Language) is the right choice that makes the new service possible. In this paper, the design and implementation of IVR (Interactive Voice Response) server using VoiceXML is described, where it connects with internet and IVR server efficiently. IVR server using VoiceXML is composed of two groups: VoiceXML document handling and VoiceXML execution. Scenario part of IVR server corresponds to VoiceXML document, the execution is performed by VoiceXML execution.

  • PDF

Development of tangible language content system based on voice recording (음성녹음 기반의 실감형 어학시스템 콘텐츠 개발)

  • Na, Jong-Won
    • Journal of Advanced Navigation Technology
    • /
    • v.17 no.2
    • /
    • pp.234-239
    • /
    • 2013
  • Learning a lesson about poor concentration and problems of the existing content, the system of language which could not be determined, Many teachers' assessment decision was made. As a result, voice recording based on the combination of ubiquitous technology and virtual reality technology, and install the projector in a classroom Through the learning content corresponding grade English student ID card attached RFID reader in each classroom, and students of RFID tags attached. In reality of the virtual three-dimensional image content foreigners and question-and-answer using the voice recording technology at the same time check the pronunciation and intonation level passes or level failure judged. Student education data to a central server system is configured to do so after saving to the DB through a feedback process, which provides information. Analysis of the issues that can have a common language content in the present study and Problem for voice recording technology to solve the problem and did not solve the existing language in the content level based classes.

The Perception and Production of Vietnamese Tones by Japanese, Lao and Taiwanese Second Language Speakers

  • Dao, Muc Dich;Anh, Thu T. Nguyen
    • SUVANNABHUMI
    • /
    • v.14 no.1
    • /
    • pp.193-228
    • /
    • 2022
  • This study investigates the production and perception of Vietnamese tones by Japanese, Lao, and Taiwanese second language (L2) learners [n=30], comparing their performance in an Imitation task to that of Identification and Read-Aloud tasks. The results show that the Imitation task is generally easier for L2 speakers than the Identification and Read-Aloud tasks, suggesting that imitation is performed without some of the skills required by the other two tasks. It is also found that Lao and Taiwanese speakers outperform Japanese speakers, suggesting that prior experience with one tone language facilitates the acquisition of tone in another language. The result on speakers' tonal range show that L2 leaners have significantly narrower tonal F0 range than control Vietnamese speakers [n=11]. The results of error pattern analysis and tonal transcription also suggest that non-modal voice (glottal stop and creakiness) and contour tones (bidirectional fall-rise) are more difficult for L2 learners than modal voice tones (e.g., unidirectional contours: rising, falling, and level).

A Study of Hybrid Automatic Interpret Support System (하이브리드 자동 통역지원 시스템에 관한 연구)

  • Lim, Chong-Gyu;Gang, Bong-Gyun;Park, Ju-Sik;Kang, Bong-Kyun
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.28 no.3
    • /
    • pp.133-141
    • /
    • 2005
  • The previous research has been mainly focused on individual technology of voice recognition, voice synthesis, translation, and bone transmission technical. Recently, commercial models have been produced using aforementioned technologies. In this research, a new automated translation support system concept has been proposed by combining established technology of bone transmission and wireless system. The proposed system has following three major components. First, the hybrid system consist of headset, bone transmission and other technologies will recognize user's voice. Second, computer recognized voice (using small server attached to the user) of the user will be converted into digital signal. Then it will be translated into other user's language by translation algorithm. Third, the translated language will be wirelessly transmitted to the other party. The transmitted signal will be converted into voice in the other party's computer using the hybrid system. This hybrid system will transmit the clear message regardless of the noise level in the environment or user's hearing ability. By using the network technology, communication between users can also be clearly transmitted despite the distance.

An Interactive Voice Web Browser Usable as a Multimodal Interface in Information Devices by Using VoiceXML

  • Jang, Min-Seok
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.14 no.6
    • /
    • pp.771-775
    • /
    • 2004
  • The present Web surroundings is mostly composed of HTML(Hypertext Mark-up Language) and thereby users obtain web informations mainly in GUI(Graphical User Interface) environment by clicking mouse in order to keep up with hyperlinked informations. However it is very inconvenient to work in this environment comparing with easily accessed one in which human`s voice is utilized for obtaining informations. Using VoiceXML, resulted from XML, for supplying the information through telephone on the basis of the contemporary matured technology of voice recognition/synthesis to work out the inconvenience problem, this paper presents the research results about VoiceXML VUI(Voice User Interface) Browser designed and implemented for realizing its technology and also the VoiceXML Dialog designed for the purpose of the browser's efficient use.

Comparison of Self-Reporting Voice Evaluations between Professional and Non-Professional Voice Users with Voice Disorders by Severity and Type (음성장애가 있는 직업적 음성사용자와 비직업적 음성사용자의 음성장애 중증도와 유형에 따른 자기보고식 음성평가 차이)

  • Kim, Jaeock
    • Phonetics and Speech Sciences
    • /
    • v.7 no.4
    • /
    • pp.67-76
    • /
    • 2015
  • The purpose of this study was to compare professional (Pro) and non-professional (Non-pro) voice users with voice disorders in self-reporting voice evaluation using Korean-Voice Handicap Index (K-VHI) and Korean-Voice Related Quality of Life (K-VRQOL). In addition, those were compared by voice quality and voice disorder type. 94 Pro and 106 Non-pro were asked to fill out the K-VHI and K-VRQOL, perceptually evaluated on GRBAS scales, and divided into three types of voice disorders (functional, organic and neurologic) by an experienced speech-language pathologist and an otolaryngologist. The results showed that the functional (F) and physical (P) scores of K-VHI in Pro group were significantly higher than those in Non-pro group. As the voice quality evaluated by G scale got worse, the scores of all aspects except emotional (E) of K-VHI and social-emotional (SE) of K-VRQOL were higher. All scores of K-VHI and K-VRQOL in neurologic voice disorders were significantly higher than those in functional and organic voice disorders. In conclusion, professional voice users are more sensitive to their functional and physical handicap resulted by their voice problems and that goes double for the patients with severe and neurologic voice disorders.