Search | Korea Science

Implementation of Voice Source Simulator Using Simulink (Simulink를 이용한 음원모델 시뮬레이터 구현)

Jo, Cheol-Woo;Kim, Jae-Hee
- Phonetics and Speech Sciences
- /
- v.3 no.2
- /
- pp.89-96
- /
- 2011
In this paper, details of the design and implementation of a voice source simulator using Simulink and Matlab are discussed. This simulator is an implementation by model-based design concept. Voice sources can be analyzed and manipulated through various factors by choosing options from GUI input and selecting pre-defined blocks or user created ones. This kind of simulation tool can simplify the procedure of analyzing speech signals for various purposes such as voice quality analysis, pathological voice analysis, and speech coding. Also, basic analysis functions are supported to compare the original signal and the manipulated ones.
PDF

Simulation of speech processing and coding strategy for cochlear implants (인공 청각 장치의 음성신호 처리와 자극방법의 시뮬레이션)

Kim, Young-Hoon;Park, Kwang-Suk
- Proceedings of the KOSOMBE Conference
- /
- v.1991 no.11
- /
- pp.30-33
- /
- 1991
The object of speech processor for cochlear implants is to deliver speech information to the central nerve system. In this study we have presented the method which simulate speech processing and coding strategy for cochlear implants and simulated two different processing methods to the 12 adults with normal ears. The formant sinusoidal coding was better than the formant pulse coding In the consonant perception test and learning effects.(p < 0.05)
PDF

An Enhanced Clarity of Husky Voice by Dissonant Frequency Filtering

Kang, Sang-Ki;Baek, Seong-Joon
- Speech Sciences
- /
- v.12 no.4
- /
- pp.71-76
- /
- 2005
There have been numerous studies on the enhancement of noisy speech signal. In this paper, we propose a new speech enhancement method, that is, a filtering of a dissonant frequency combined with noise suppression algorithm. The simulation results indicate that the proposed method provides a significant gain in voice clarity. Therefore if the proposed enhancement scheme is used as a pre-filter, the perceptual clarity of husky voice is greatly enhanced.
PDF

Application of sinusoidal model to perception of electrical hearing in cochlear implants (인공와우 전기 청각 인지에 대한 정현파 모델 적용에 관한 연구)

Lee, Sungmin
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.1
- /
- pp.52-57
- /
- 2022
Speech consists of the sum of complex sine-waves. This study investigated the perception of electrical hearing by applying the sinusoidal model to cochlear implant simulation. Fourteen adults with normal hearing participated in this study. The sentence recognition tests were implemented using the sentence lists processed by the sinusoidal model which extracts 2, 4, 6, 8 sine-wave components and sentence lists processed by the same sinusoidal model along with cochlear implant simulation (8 channel vocoders). The results showed lower speech recognition for the sentence lists processed by the sinusoidal model and cochlear implant simulation compared to those by the sinusoidal model alone. Notably, the lower the number of sine-wave components (2), the larger the difference was. This study provides the perceptual pattern of sine-wave speech for electrical hearing by cochlear implant listeners, and basic data for development of speech processing algorithms in cochlear implants.
https://doi.org/10.7776/ASK.2022.41.1.052 인용 PDF KSCI

A New Speech Enhancement Method Using Adaptive Digital Filter (적응디지털필터를 사용한 음질향상 방법)

임용훈;김완구;차일환;윤대희
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.10
- /
- pp.35-41
- /
- 1993
In this paper, a new speech enhancement method for speech signal corrupted by environmental noise is proposed. Two signals are obtained from the microphone and from the accelerometer attached to the neck, respectively. Since two signals are generated from same source signal, both signals are closely correlated. And environmental noise has no effect on the accelerometer signal. The speech enhancement system identifies the optimum linear system between two signals on the basis of the dependence between the signals. The enhanced speech can be obtained by filtering the noise-free accelerometer signal. Since the characteristcs of the speech signal and environmental noise are changing with time, adaptive filtering system has to be used for characterizing the time-varing system. Simulation results show 7dB enhancement with 0dB speech signal level relative to the white noise.
PDF

A Low Bit Rate Speech Coder Based on the Inflection Point Detection

Iem, Byeong-Gwan
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.15 no.4
- /
- pp.300-304
- /
- 2015
A low bit rate speech coder based on the non-uniform sampling technique is proposed. The non-uniform sampling technique is based on the detection of inflection points (IP). A speech block is processed by the IP detector, and the detected IP pattern is compared with entries of the IP database. The address of the closest member of the database is transmitted with the energy of the speech block. In the receiver, the decoder reconstructs the speech block using the received address and the energy information of the block. As results, the coder shows fixed data rate contrary to the existing speech coders based on the non-uniform sampling. Through computer simulation, the usefulness of the proposed technique is shown. The SNR performance of the proposed method is approximately 5.27 dB with the data rate of 1.5 kbps.
https://doi.org/10.5391/IJFIS.2015.15.4.300 인용 PDF KSCI

Automatic Clustering of Speech Data Using Modified MAP Adaptation Technique (수정된 MAP 적응 기법을 이용한 음성 데이터 자동 군집화)

Ban, Sung Min;Kang, Byung Ok;Kim, Hyung Soon
- Phonetics and Speech Sciences
- /
- v.6 no.1
- /
- pp.77-83
- /
- 2014
This paper proposes a speaker and environment clustering method in order to overcome the degradation of the speech recognition performance caused by various noise and speaker characteristics. In this paper, instead of using the distance between Gaussian mixture model (GMM) weight vectors as in the Google's approach, the distance between the adapted mean vectors based on the modified maximum a posteriori (MAP) adaptation is used as a distance measure for vector quantization (VQ) clustering. According to our experiments on the simulation data generated by adding noise to clean speech, the proposed clustering method yields error rate reduction of 10.6% compared with baseline speaker-independent (SI) model, which is slightly better performance than the Google's approach.
https://doi.org/10.13064/KSSS.2014.6.1.077 인용 PDF KSCI

A Single-Channel Speech Dereverberation Method Using Sparse Prior Imposition in Reverberation Filter Estimation (반향 필터 추정에서 성김 특성을 이용한 단일채널 음성반향제거 방법)

Zee, Min-Seon;Park, Hyung-Min
- Phonetics and Speech Sciences
- /
- v.5 no.4
- /
- pp.227-232
- /
- 2013
Since a reverberation filter is generally much shorter than the corresponding dereverberation filter, a single-channel speech dereverberation method based on reverberation filter estimation has been developed to improve its performance. Unfortunately, a typical reverberation filter still requires too many coefficients to be accurately estimated using limited speech observations. In order to exploit sparseness of reverberation filter coefficients, in this paper, we present an algorithm to impose a sparse prior to the process of reverberation filter estimation. Simulation results demonstrate that the sparse prior imposition further improves performance of the speech dereverberation method based on reverberation filter estimation.
https://doi.org/10.13064/KSSS.2013.5.4.227 인용 PDF

Performance Evaluation of Novel AMDF-Based Pitch Detection Scheme

Kumar, Sandeep
- ETRI Journal
- /
- v.38 no.3
- /
- pp.425-434
- /
- 2016
A novel average magnitude difference function (AMDF)-based pitch detection scheme (PDS) is proposed to achieve better performance in speech quality. A performance evaluation of the proposed PDS is carried out through both a simulation and a real-time implementation of a speech analysis-synthesis system. The parameters used to compare the performance of the proposed PDS with that of PDSs that are based on either a cepstrum, an autocorrelation function (ACF), an AMDF, or circular AMDF (CAMDF) methods are as follows: percentage gross pitch error (%GPE); a subjective listening test; an objective speech quality assessment; a speech intelligibility test; a synthesized speech waveform; computation time; and memory consumption. The proposed PDS results in lower %GPE and better synthesized speech quality and intelligibility for different speech signals as compared to the cepstrum-, ACF-, AMDF-, and CAMDF-based PDSs. The computational time of the proposed PDS is also less than that for the cepstrum-, ACF-, and CAMDF-based PDSs. Moreover, the total memory consumed by the proposed PDS is less than that for the ACF- and cepstrum-based PDSs.
https://doi.org/10.4218/etrij.16.0115.0926 인용 PDF KSCI

A Study on the Development of the Real-Time G.723.1 Speech Codec Using a Fixed-Point DSP(ADSP-2181) (고정소수점 DSP(ADSP-2181)을 이용한 실시간 G.723.1 음성부호화기 개발에 관한 연구)

Park, Jung-Jae;Chung, Ik-Joo
- Speech Sciences
- /
- v.3
- /
- pp.177-186
- /
- 1998
This paper describes the procedure of implementing a real-time speech codec, G.723.1 which was developed by DSP Group and standardized by ITU-T, using fixed-point DSP, ADSP-2181. This codec has two bit rates associated with it, 5.3 and 6.3 kbit/s. We implemented only one bit rate, 6.3 kbit/s, of the two with fixed-point 32-bit precision. According to the result of the experiment, the amount of computational burden is about 55 MIPS and its quality is similar to the result of the PC simulation with floating-point arithmetic. In this paper, we proposed a method to use a fixed-point DSP and a procedure for developing a real-time speech codec using DSPs and finally developed a G.723.l speech codec for ADSP-2181.
PDF

Search Result 299, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)