통합 검색 | Korea Science

철도예약서비스를 위한 VoiceXML 기반의 음성인식 구현에 관한 연구 (A Study on Realization of Speech Recognition System based on VoiceXML for Railroad Reservation Service)

김범승;김순협
- 한국철도학회논문집
- /
- 제14권2호
- /
- pp.130-136
- /
- 2011
본 논문에서는 철도예약서비스를 위한 SIP를 기반으로 하는 텔레포니 환경에서의 VoiceXML을 이용한 실시간 음성인식을 구현하는 방안을 제안하였다. 제안된 방법은 PSTN 또는 인터넷을 통하여 들어온 음성신호를 VoiceXML을 이용한 Dialog 처리를 하고 전송된 음성신호를 음성인식 시스템에서 처리하여 출력된 결과값을 VoiceXML의 Dialog에 반환하여 사용자에게 전달하는 방식이다. VASR 시스템은 Dialog를 처리하는 Dialog 서버, 음성신호를 처리하기 위한 APP서버, 그리고 음성인식을 처리하는 음성인식 시스템으로 구성된다. 본 논문에서는 텔레포니 환경에서의 음성신호 처리를 위하여 VoiceXML의 Record Tag 기능을 이용하여 음성신호를 녹음하고 이를 실시간 재생하여 음성인식 시스템으로 전송하는 방식을 구현하였다.
https://doi.org/10.7782/JKSR.2011.14.2.130 인용 PDF KSCI

Performance Analysis of Packet CDMA R-ALOHA for Multi-media Integration in Cellular Systems with Adaptive Access Permission Probability

Kyeong Hur;Eom, Doo-Seop;Tchah, Kyun-Hyon
- 한국통신학회논문지
- /
- 제25권12B호
- /
- pp.2109-2119
- /
- 2000
In this paper, the Packet CDMA Reservation ALOHA protocol is proposed to support the multi-traffic services such as voice and videophone services with handoff calls, high-rate data and low-rate data services efficiently on the multi-rate transmission in uplink cellular systems. The frame structure, composed of the access slot and the transmission slot, and the proposed access permission probability based on the estimated number of contending users for each service are presented to reduce MAI. The assured priority to the voice and the videophone handoff calls is given through higher access permission probability. And through the proposed code assignment scheme, the voice service can be provided without the voice packet dropping probability in the CDMA/PRMA protocols. The code reservation is allowed to the voice and the videophone services. The low-rate data service uses the available codes during the silent periods of voice calls and the remaining codes in the codes assigned to the voice service to utilize codes efficiently. The high-rate data service uses the assigned codes to the high-rate data service and the remaining codes in the codes assigned to the videophone service. Using the Markov-chain subsystem model for each service including the handoff calls in uplink cellular systems, the steady-state performances are simulated and analyzed. After a round of tests for the examples, through the proposed code assignment scheme and the access permission probability, the Packet CDMA Reservation ALOHA protocol can guarantee the priority and the constant QoS for the handoff calls even at large number of contending users. Also, the data services are integrated efficiently on the multi-rate transmission.
PDF

스마트 폰 음성 인식 서비스의 상황별 만족도 조사 (Study on the Situational satisfaction survey of Smart Phone based on voice recognition technology)

이윤정;김승인
- 디지털융복합연구
- /
- 제15권8호
- /
- pp.351-357
- /
- 2017
본 연구는 스마트 폰 음성인식 서비스를 분석하고 음성인식 서비스의 상황별 만족도 조사를 통하여 사용자들의 기대요구와 만족도 간의 관련성을 분석하여 더 나은 음성인식 서비스 발전을 모색하고자 하였다. 1차로 문헌연구를 통하여 음성인식 서비스의 개념과 현황을 알아보고, 2차로 육하원칙을 기조로 한 설문지를 통해 설문 조사를 진행하였다. 그 결과, 사용자는 스마트 폰 음성인식 서비스를 전화를 걸 때에 가장 많이 사용하며, 주로 혼자 있을 때 사용하며 시간대는 대체로 평이하지만, 저녁 시간이 가장 많았다. 또한, 해당 서비스는 집에서 가장 많이 사용하며 손을 사용할 수 없을 때 서비스를 가장 많이 사용하는 것으로 나타났다. 이러한 상황별 다양한 결과를 통해 개인화 서비스, 조건 인식 기능, 위급 상황 자동인식, 음성으로 잠금 해제 등 다양한 방안을 도출할 수 있었다. 본 연구를 바탕으로 추후 국내 스마트 폰 음성인식 서비스 개선과 웨어러블 디바이스 개발을 위해 효과적으로 활용할 수 있을 것으로 기대한다.
https://doi.org/10.14400/JDC.2017.15.8.351 인용 PDF KSCI

Voice Service Architecture in IMT-2000 using Voice Gateway

Kim, Moo-Wan;Kim, Kwang-Sik
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 ITC-CSCC -2
- /
- pp.757-759
- /
- 2000
This paper proposes a new voice service network architecture for initial IMT-2000 and describes the features of Voice Gateway which is a core entity of the proposed architecture. Also describes a system configuration of the prototype of the proposed architecture and software configuration of Voice Gateway in the prototype.
PDF

사용자 주도 폼 다이얼로그 시스템의 VoiceXML 어플리케이션에 관한 연구 (A Study on VoiceXML Application of User-Controlled Form Dialog System)

권형준;노용완;이현구;홍광석
- 정보처리학회논문지B
- /
- 제14B권3호
- /
- pp.183-190
- /
- 2007
VoiceXML은 음성을 통해 웹 자원 탐색을 제공하기 위한 목적으로 설계된 XML 기반의 새로운 마크업 언어이다. VoiceXML로 만들어진 어플리케이션은 기계 주도 폼 다이얼로그 구조와 상호 주도 폼 다이얼로그 구조로 분류된다. 이와 같은 다이얼로그 구조들은 어플리케이션 개발자에 의해 서비스 시나리오가 결정되기 때문에 사용자가 자유롭게 웹 자원을 탐색하는 서비스를 구축할 수 없다. 본 논문에서는 사용자의 의도에 따라 서비스 시나리오가 결정되는 음성 웹 서비스의 구축을 위해 사용자 주도 폼 다이얼로그 시스템의 VoiceXML 어플리케이션 구조를 제안한다. 제안하는 어플리케이션은 사용자에 의해 요청된 정보로부터 인식 후보들을 자동적으로 검출하여 음성 앵커로 사용하고 각각의 음성 앵커론 새로운 음성 노드로 연결한다. 제안하는 시스템의 예로 IT 용어사전을 내장한 뉴스 서비스를 구현하여 음성 앵커의 검출 및 등록 여부를 확인하였고, 음성 인식률 및 사용자가 의도한 정보를 성공적으로 제공했는지 판단하는 척도가 되는 적중률과 응답 속도를 측정하였다. 실험 결과, 제안한 시스템이 기존의 VoiceXML 폼 다이얼로그 구조의 시스템보다 더 자유로운 웹 자원의 탐색이 가능함을 확인하였다.
https://doi.org/10.3745/KIPSTB.2007.14-B.3.183 인용 PDF KSCI

디스플레이 탑재형 음성 인터페이스를 통한 음성쇼핑 서비스 휴리스틱 개발 (The Development of Heuristics for Voice Shopping Service through Voice Interface with Display)

권현정;이지연
- 정보관리학회지
- /
- 제39권2호
- /
- pp.1-33
- /
- 2022
음성을 통해 상품을 구매하는 음성쇼핑 서비스는 미래에 본격적으로 상용화될 것으로 예상된다. 본 연구에서는 음성쇼핑이 세계적으로 일상이 될 미래를 대비하여 디스플레이 탑재형 음성 인터페이스를 활용한 음성쇼핑 서비스 휴리스틱을 개발하였다. 첫째, 이론적 접근으로 '시각 인터페이스', '음성 인터페이스', '쇼핑 서비스'의 설계 원칙을 주제로 한 논문 50건을 읽고 문헌조사를 실시하여 총 29개의 설계 원칙 초안을 제작하였다. 둘째, 실증적 접근으로 소비자 의사결정과정에 따른 쇼핑 경험 및 쇼핑 맥락에서의 정보추구행위에 관한 포커스 그룹 인터뷰를 진행하여 문헌 연구에서 미흡했던 분야인 이용자 경험 측면을 보완한 휴리스틱 초안을 작성하였다. 셋째, 델파이 조사를 통해 위의 두 단계를 거쳐 개발된 휴리스틱 초안에 대하여 20명의 UX, 서비스기획, 인공지능 개발, 쇼핑 분야 전문가들에게 전문가 평가를 해줄 것을 요청하였다. 3회에 걸친 델파이 조사를 통해 최종 휴리스틱을 제안하였다.
https://doi.org/10.3743/KOSIM.2022.39.2.001 인용 PDF KSCI

갑상선 수술 후 음성 변화에 대한 비대면 음성언어치료 증례 (A Case of Voice Therapy for Patient Who Voice Changed after Total Thyroidectomy Using Contactless Voice and Speech Therapy Service Platform)

이길준;박수나
- 대한후두음성언어의학회지
- /
- 제32권1호
- /
- pp.43-47
- /
- 2021
Voice therapy is effective in many voice and speech disorders. However, patients have low accessibility to therapeutic facilities due to disease-unrelated reasons such as lack of time and pandemic of COVID-19. Contactless voice therapy could be an alternative and may helpful to all patients with voice and speech problems. We developed contactless voice and speech therapy program on the necessity of improving accessibility. Herein, we report the first case of voice therapy to 30 year-old female patient who complained voice change after total thyroidectomy using contactless voice and speech therapy service platform in Korea.
https://doi.org/10.22469/jkslp.2021.32.1.43 인용 PDF KSCI

음성을 이용한 감정 정보 추출 방법 (An acoustic study of feeling information extracting method)

이연수;박용범
- 한국인터넷방송통신학회논문지
- /
- 제10권1호
- /
- pp.51-55
- /
- 2010
최근 콜센터 등에서는 고객을 음성 미디어를 통하여 서비스하고 있다. 이런 콜 센터에서 제공하는 다양한 서비스의 품질 측정 방법 중 음성 대화 속 화자의 감정에 따른 측정 방법이 있다. 본 연구에서는 화자의 음성을 이용하여 그 사람의 감정을 알아내고자 하였다. 이를 위하여 음성 신호로부터 여러 가지 파라미터를 추출하고 분석함으로써 인간의 감정을 분류하였다. 사람의 감정은 크게 기쁨, 슬픔, 흥분, 보통 등 4가지 상태로 나눌 수 있다. 대부분의 음성 서비스 품질은 흥분 또는 분노의 상태가 중요하다. 본 논문에서는 이와 같은 감정을 Pitch와 Amplitude를 기초로 한 5가지 요소를 통하여 효율적으로 대화자간의 문제가 되는 대화를 선별해 내는 방법을 연구 하였다.
PDF KSCI

Voice Quality Criteria for Heterogenous Network Communication Under Mobile-VoIP Environments

Choi, Jae-Hun;Seol, Soon-Uk;Chang, Joon-Hyuk
- The Journal of the Acoustical Society of Korea
- /
- 제28권3E호
- /
- pp.99-108
- /
- 2009
In this paper, we suggest criteria for objective measurement of speech quality in mobile VoIP (Voice over Internet Protocol) services over wireless mobile internet such as mobile WiMAX networks. This is the case that voice communication service is available under other networks. When mobile VoIP service users in the mobile internet network based on packet call up PSTN and mobile network users, but there have not been relevant quality indexes and quality standards for evaluating speech quality of mobile VoIP. In addition, there are many factors influencing on the speech quality in packet network. Especially, if the degraded speech with packet loss transfers to the other network users through the handover, voice communication quality is significantly deteriorated by the transformation of speech codecs. In this paper, we eventually adopt the Gilbert-Elliot channel model to characterize packet network and assess the voice quality through the objective speech quality method of ITU-T P. 862. 1 MOS-LQO for the various call scenario from mobile VoIP service user to PSTN and mobile network users under various packet loss rates in the transmission channel environments. Our simulation results show that transformation of speech codecs results in the degraded speech quality for different transmission channel environments when mobile VoIP service users call up PSTN and mobile network users.
PDF KSCI

APPLICATION OF KOREAN TEXT-TO-SPEECH FOR X.400 MHS SYSTEM

Kim, Hee-Dong;Koo, Jun-Mo;Choi, Ho-Joon;Kim, Sang-Taek
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
- /
- pp.885-892
- /
- 1994
This paper presents the Korean text-to-speech (TTS) algorithm with speed and intonation control capability, and describes the development of the Voice message delivery system employing this TTS algorithm. This system allows the Interpersonal Messaging (IPM) Service users of Message Handling System (MHS) to send his/her text messages to user via telephone line using synthetic voice. In the X.400 MHS recommendation, the protocols and service elements are not specified for the voice message delivery system. Thus, we defined access protocol and service elements for Voice Access Unit based on the application program interface for message transfers between X.400 Message Transfer Agent and Voice Access Unit. The system architecture and operations will be provided.
PDF

검색결과 818건 처리시간 0.026초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)