통합 검색 | Korea Science

Classical Tamil Speech Enhancement with Modified Threshold Function using Wavelets

Indra., J;Kasthuri., N;Navaneetha Krishnan., S
- Journal of Electrical Engineering and Technology
- /
- 제11권6호
- /
- pp.1793-1801
- /
- 2016
Speech enhancement is a challenging problem due to the diversity of noise sources and their effects in different applications. The goal of speech enhancement is to improve the quality and intelligibility of speech by reducing noise. Many research works in speech enhancement have been accomplished in English and other European Languages. There has been limited or no such works or efforts in the past in the context of Tamil speech enhancement in the literature. The aim of the proposed method is to reduce the background noise present in the Tamil speech signal by using wavelets. New modified thresholding function is introduced. The proposed method is evaluated on several speakers and under various noise conditions including White Gaussian noise, Babble noise and Car noise. The Signal to Noise Ratio (SNR), Mean Square Error (MSE) and Mean Opinion Score (MOS) results show that the proposed thresholding function improves the speech enhancement compared to the conventional hard and soft thresholding methods.
https://doi.org/10.5370/JEET.2016.11.6.1793 인용 PDF KSCI

로짓모형을 이용한 통신 서비스품질 평가방법 (Evaluation Method of Quality of Service in Telecommunications Using Logit Model)

조재균;안혜숙
- 산업공학
- /
- 제15권2호
- /
- pp.209-217
- /
- 2002
Quality of Service(QoS) in the telecommunications can be evaluated by analyzing the opinion data which result from the surveyed opinions of respondents and quantify subjective satisfaction on the QoS from the customers' viewpoints. For analyzing the opinion data, MOS(mean opinion score) method and Cumulative Probability Curve method are often used. The methods are based on the scoring method, and therefore, have the intrinsic deficiency due to the assignment of arbitrary scores. In this paper, we propose an analysis method of the opinion data using logit models which can be used to analyze the ordinal categorical data without assigning arbitrary scores to customers' opinion, and develop an analysis procedure considering the usage of procedures provided by SAS(Statistical Analysis System) statistical package. By the proposed method, we can estimate the relationship between customer satisfaction and network performance parameters, and provide guidelines for network planning. In addition, the proposed method is compared with Cumulative Probability Curve method with respect to prediction errors.
PDF KSCI

맞춤형 동화구연 시스템구연에 관한 연구 (A Study on the Fairy tale Narration System with Key-word Exchange)

박원;배명진
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2000년도 제13회 신호처리 합동 학술대회 논문집
- /
- pp.819-822
- /
- 2000
최근 유아기 아이들을 위한 교육매체의 발달로 각종 CD-ROM이나 테이프 등에서 성우의 목소리로 동화를 읽어주는 시스템이 많이 나와 있고, 또한 Web Book이 점차적으로 보편화가 되 가고 있다. 하지만 이런 획일적이고 균일화된 매체들은 아이들에게 금새 실증을 내게 하기 때문에 흥미 유발을 위해 동화의 주인공을 자기자신이나 친근한 사람의 이름 등으로 바꾸어 발성해 준다면 더욱 친근감 있게 받아들일 것이다. 본 논문에서는 기존의 성우가 발성하는 동화의 주인공 이름을 Test화자가 새로운 이름으로 발성을 해주면 기존 성우의 목소리패턴으로 바꾸어 동화를 읽어주는 시스템에 대해서 제안하고자 한다. 우선 Test화자가 발성한 목소리를 성우의 목소리로 바꾸어 주기 위해서 기존의 성우가 발성한 동화주인공 이름과 Test화자가 발성한 이름과의 운율패턴을 비교하여 성우의 운율패턴에 일치시키고 성우의 목소리 패턴으로 변경된 새로운 주인공의 이름만을 기존의 동화 DB에 삽입하였다. 또한 에너지 패턴조절은 기존의 성우가 발성한 기준패턴에 근사화 시켰고 끝점을 스므딩 시킴으로써 자연스런 발성이 되게 만들어주었다. 결과적으로 Mos Score가 3.873로 비교적 좋은 결과를 얻을 수 있었다.
PDF

아날로그 셀룰라 시스템을 위한 자동 음질 평가기 개발 (Development of an Automatic Speech Quality Evaluator for Analog Cellular System)

박상욱;최용수;정성교;윤대희;이충용
- 한국음향학회지
- /
- 제17권7호
- /
- pp.28-35
- /
- 1998
본 논문에서는 아날로그 이동 전화 환경에서의, 객관적인 음질 평가 척도를 사용하 여 주관적 음질을 추정하는 이동전화 자동 음질평가 시스템을 개발하였다. 이동전화의 통화 품질을 유지하기 위해서는 이동전화의 네트워크를 계속하여 체크하는 것이 매우 중요하다. 주관적 음질 평가는 사람의 체감을 직접 나타내는 것이므로 실제적인 음질을 평가하는데 중 요한 척도가 되지만, 인력과 시간이 많이 소모되므로 다양한 지역에서 지속적으로 음질을 평가하는데 부적절하다. 이러한 문제를 해결하기 위하여 객관적 음질평가 척도를 이용하여 주관적 음질 평가 척도를 예측하는 자동 음질 평가 시스템이 필수적이다. 반복된 실험을 통 하여 BSD(Bark Spectral Distance)가 주관적 음질 평가 척도와 높은 상관관계가 있음을 확 인하였으며 원래의 음성과 이동 전화 채널을 통과한 음성과의 BSD를 측정한 후 이를 바탕 으로 MOS(Mean Opinion Score)를 추정하는 자동 음질 평가 시스템(Automatic Speech Quality Evaluator)을 개발하였다.
PDF

Speech Quality of a Sinusoidal Model Depending on the Number of Sinusoids

Seo, Jeong-Wook;Kim, Ki-Hong;Seok, Jong-Won;Bae, Keun-Sung
- 음성과학
- /
- 제7권1호
- /
- pp.17-29
- /
- 2000
The STC(Sinusoidal Transform Coding) is a vocoding technique that uses a sinusoidal speech model to obtain high- quality speech at low data rate. It models and synthesizes the speech signal with fundamental frequency and its harmonic elements in frequency domain. To reduce the data rate, it is necessary to represent the sinusoidal amplitudes and phases with as small number of peaks as possible while maintaining the speech quality. As a basic research to develop a low-rate speech coding algorithm using the sinusoidal model, in this paper, we investigate the speech quality depending on the number of sinusoids. By varying the number of spectral peaks from 5 to 40 speech signals are reconstructed, and then their qualities are evaluated using spectral envelope distortion measure and MOS(Mean Opinion Score). Two approaches are used to obtain the spectral peaks: one is a conventional STFT (Short-Time Fourier Transform), and the other is a multiresolutional analysis method.
PDF

Spline 코드북 기반의 spectral folding을 이용한 대역폭 확장 방법 (Bandwidth Expansion Method Using Spline Codebook Based Spectral Folding)

박지훈;한승호;양희식;정상배;한민수
- 대한음성학회:학술대회논문집
- /
- 대한음성학회 2006년도 추계학술대회 발표논문집
- /
- pp.131-134
- /
- 2006
Quality of narrowband speech $(0{\sim}4kHz)$ can be enhanced by the bandwidth expansion technique, by which the high- band components are estimated. This paper proposes the bandwidth expansion method using the spline codebook based spectral folding. For the performance evaluation, the PESQ(Perceptual Evaluation of Speech Quality) scores are measured as the objective measurement In addition, the MOS (Mean Opinion Score) and the preference tests are performed as the subjective measurement. The results show our proposed method outperforms the existing spline based one.
PDF

PROSODY CONTROL BASED ON SYNTACTIC INFORMATION IN KOREAN TEXT-TO-SPEECH CONVERSION SYSTEM

Kim, Yeon-Jun;Oh, Yung-Hwan
- 한국음향학회:학술대회논문집
- /
- 한국음향학회 1994년도 FIFTH WESTERN PACIFIC REGIONAL ACOUSTICS CONFERENCE SEOUL KOREA
- /
- pp.937-942
- /
- 1994
Text-to-Speech(TTS) conversion system can convert any words or sentences into speech. To synthesize the speech like human beings do, careful prosody control including intonation, duration, accent, and pause is required. It helps listeners to understand the speech clearly and makes the speech sound more natural. In this paper, a prosody control scheme which makes use of the information of the function word is proposed. Among many factors of prosody, intonation, duration, and pause are closely related to syntactic structure, and their relations have been formalized and embodied in TTS. To evaluate the synthesized speech with the proposed prosody control, one of the subjective evaluation methods-MOS(Mean Opinion Score) method has been used. Synthesized speech has been tested on 10 listeners and each listener scored the speech between 1 and 5. Through the evaluation experiments, it is observed that the proposed prosody control helps TTS system synthesize the more natural speech.
PDF

접합 왜곡의 최소화 과정이 포함된 음성합성기 (Text-to-Speech Synthesizer with the Process of Minimizing Concatenation Distortion)

박훈재;김상훈;정재호
- 한국음향학회지
- /
- 제17권4호
- /
- pp.38-44
- /
- 1998
대용량의 음성합성용 데이터베이스를 용이하게 구축하기 위해 음성인식 시스템을 이용한 음소 경계 분할이 이루어지고 있다. 그러나 자동 분할 결과를 직접 이용하여 합성음 을 생성할 경우 음소 경계 에러로 인하여 접합 왜곡이 많이 발생하게 된다. 이러한 문제를 해결하기 위해서, 본 연구에서는 단위 접합시 경계 에러를 고려하여 적합한 접합 위치를 찾 고자 하였다. 여기서 적합한 접합 위치는 스펙트럼의 불연속이 최소화된 접합점을 의미한다. 합성음에 대한 MOS(Mean Opinion Score) 테스트와 스펙트로그램(spectrogram)의 모양을 비교하므로써 제안된 방법의 성능을 평가하였다. 제안된 방법은 두 단계로 이루어져 있다. 첫째, 레퍼런스 패턴(reference pattern)과 두 개의 테스트 패턴(test pattern)을 선택하는 단 계와, 둘째, 앞과 뒤 테스트 패턴 사이의 적합한 접합위치를 찾는 단계이다. 본 연구에서는 패턴 사이의 스펙트로그램 비교를 위해 켑스트럼(cepstrum) 피라미터와 패턴 분류기 (pattern classifier)인 DTW(Dynamic Time Warping) 알고리즘을 사용하였다. 제안된 알고 리즘을 평가한 청취 테스트의 결과에서 제안된 알고리즘을 적용하여 합성된 합성음의 음질 이 자동 분절로 생성된 단위를 그대로 이용한 경우의 음질보다 우수함을 보였다.
PDF

팩시밀리 화상품질 측정에 관한 연구 (A Study on Testing Image Quality on Facsimile)

권세혁;황건
- 전자통신동향분석
- /
- 제8권4호
- /
- pp.157-162
- /
- 1993
본 연구는 아날로그 신호를 사용하는 공중교환 전화망과 접속되는 그룹 3(G3) 팩시밀리의 화상 품질을 측정하는 방법을 제시하였다. CCITT(현 ITU-TS) 표준시험 도표 No.2를 이용하여 전송된 화상에 대한 평가는 설문조사를 통해 평가되었고, 그것들은 MOS(Mean Opinion Score) 방법에 의해 계량화되었다. 설문지의 결과에 대한 상관 분석을 통해 문항을 하나의 종합 평가 문항으로 줄일 수 있음을 살펴보았다. 그리고 그 점수들의 평균들에 대한 차이를 분석함으로써 팩시밀리 화상 품질에 영향을 미치는 요인들의 유의성을 검정하였다. 유의성을 검정하는 방법들로 t 검정법과 Vander Waerden Scores 방법을 제시하였다. 그리고 검정 결과 점수 평균이 유의하지 않은 그룹들을 하나의 그룹으로 하여 그 그룹에 있어서 점수 히스토그램을 구하였다. 이 히스토그램을 하나의 정규 분포 곡선으로 근사시켜 팩시밀리 화상 품질 평가치를 살펴보았다.
https://doi.org/10.22648/ETRI.1993.J.080413 인용 PDF

소음 환경에서 강인한 어학용 헤드폰 구현 (The implementation of the Language-Study-Headphone storng to Noise Environment)

손재혁;신재호
- 한국정보통신설비학회:학술대회논문집
- /
- 한국정보통신설비학회 2005년도 하계학술대회
- /
- pp.397-405
- /
- 2005
This paper presents a headphone system which has adopted two algorithm to increase sound clearness and to separate signal from noisy environment. In the field of adaptive signal processing, LMS algorithm which is a kind of steepest decent method, can be implemented with more simple calculation, so that we use it to eliminate unwanted noise elements for the proposed system. Futhermore we generate early echo using some delays, then mix it in signal. This process can increase the clearness of signal. In this paper, we prove that the proposed system can be implemented in real time. The proposed system is satisfied to subject assessment test base on MOS(Mean Opinion Score) of ITU-T.
PDF

검색결과 117건 처리시간 0.029초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)