Search | Korea Science

On the Classification of the Pathological Speech (장애음성의 분류방법에 관한 연구)

김대현
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.08a
- /
- pp.388-391
- /
- 1998
jitter, shimmer 및 켑스트럼 방식의 음원분석에 의한 파라미터를 이용하여 장애음성을 진단, 식별하는 방법을 제안한다. 먼저 통계적 처리결과르 바탕으로 식별에 유효한 파라미터들을 선택하고 이들 파라미터들을 이용하여 최종 진단한다. 식별방법으로는 신경회로망을 이용한다. 입력파라미터로는 jitter, shimmer, HNRR을 사용한다. 신경회로망은 1 은닉층을 갖는 3- layer 신경회로망을 사용한다. 실험결과 효과적으로 정상음성과 장애음성의구분이 가능해졌다.
PDF

A study on the clinical utility of voiced sentences in acoustic analysis for pathological voice evaluation (장애음성의 음향학적 분석에서 유성음 문장의 임상적 유용성에 관한 연구)

Ji-sung Kim
- The Journal of the Acoustical Society of Korea
- /
- v.42 no.4
- /
- pp.298-303
- /
- 2023
This study aimed to investigate the clinical utility of voiced sentence tasks for voice evaluation. To this end, we analyzed the correlation between perturbation-based acoustic measurements [jitter percent (jitter), shimmer percent (shimmer), Noise to Harmonic Ratio (NHR)] using sustained vowel phonation, and cepstrum-based acoustic measurements [Cepstral Peak Prominence (CPP), Low/High spectral ratio (L/H ratio)] using voiced sentences. As a result of analyzing data collected from 65 patients with voice disorders, there was a significant correlation between the CPP and jitter (r = -.624, p = .000), shimmer (r = -.530, p = .000), NHR (r = -.469, p = .000).This suggests that the cepstrum measurement of voiced sentences can be used as an alternative to the analysis limitations of the pathological voice such as not possible perturbation-based acoustic measurement, and result difference according to the analysis section.
https://doi.org/10.7776/ASK.2023.42.4.298 인용 PDF

Normalization of Spectral Magnitude and Cepstral Transformation for Compensation of Lombard Effect (롬바드 효과의 보정을 위한 스펙트럼 크기의 정규화와 켑스트럼 변환)

Chi, Sang-Mun;Oh, Yung-Hwan
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.4
- /
- pp.83-92
- /
- 1996
This paper describes Lombard effect compensation and noise suppression so as to reduce speech recognition error in noisy environments. Lombard effect is represented by the variation of spectral envelope of energy normalized word and the variation of overall vocal intensity. The variation of spectral envelope can be compensated by linear transformation in cepstral domain. The variation of vocal intensity is canceled by spectral magnitude normalization. Spectral subtraction is use to suppress noise contamination, and band-pass filtering is used to emphasize dynamic features. To understand Lombard effect and verify the effectiveness of the proposed method, speech data are collected in simulated noisy environments. Recognition experiments were conducted with contamination by noise from automobile cabins, an exhibition hall, telephone booths in down town, crowded streets, and computer rooms. From the experiments, the effectiveness of the proposed method has been confirmed.
PDF

Study on the Performance of Spectral Contrast MFCC for Musical Genre Classification (스펙트럼 대비 MFCC 특징의 음악 장르 분류 성능 분석)

Seo, Jin-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.4
- /
- pp.265-269
- /
- 2010
This paper proposes a novel spectral audio feature, spectral contrast MFCC (SCMFCC), and studies its performance on the musical genre classification. For a successful musical genre classifier, extracting features that allow direct access to the relevant genre-specific information is crucial. In this regard, the features based on the spectral contrast, which represents the relative distribution of the harmonic and non-harmonic components, have received increased attention. The proposed SCMFCC feature utilizes the spectral contrst on the mel-frequency cepstrum and thus conforms the conventional MFCC in a way more relevant for musical genre classification. By performing classification test on the widely used music DB, we compare the performance of the proposed feature with that of the previous ones.
https://doi.org/10.7776/ASK.2010.29.4.265 인용 PDF KSCI

Cepstrum analysis on the chatter vibration generated by the machine tool (공작기계의 채터진동에 대한 켑스트럼 분석)

김명구;최봉학;이흥식;조종두
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2004.05a
- /
- pp.77-82
- /
- 2004
There were many researches about the chatter vibration occur in the cutting process of machine tools. But there are in sufficient research parts ; the frequency about the chatter vibration and its characteristics and its nonlinear properties. This paper measured signals of vibration that occur before and immediately after and after the chatter vibration. This signals were analyzed through autospectrum obtained by the Fast Fourier Transform(FFT). And then, the nonlinear characteristis were analyzed by cepstrum analysis through FFT of autospectrun.
PDF

A Signal Processing Technique for Predictive Fault Detection based on Vibration Data (진동 데이터 기반 설비고장예지를 위한 신호처리기법)

Song, Ye Won;Lee, Hong Seong;Park, Hoonseok;Kim, Young Jin;Jung, Jae-Yoon
- The Journal of Society for e-Business Studies
- /
- v.23 no.2
- /
- pp.111-121
- /
- 2018
Many problems in rotating machinery such as aircraft engines, wind turbines and motors are caused by bearing defects. The abnormalities of the bearing can be detected by analyzing signal data such as vibration or noise, proper pre-processing through a few signal processing techniques is required to analyze their frequencies. In this paper, we introduce the condition monitoring method for diagnosing the failure of the rotating machines by analyzing the vibration signal of the bearing. From the collected signal data, the normal states are trained, and then normal or abnormal state data are classified based on the trained normal state. For preprocessing, a Hamming window is applied to eliminate leakage generated in this process, and the cepstrum analysis is performed to obtain the original signal of the signal data, called the formant. From the vibration data of the IMS bearing dataset, we have extracted 6 statistic indicators using the cepstral coefficients and showed that the application of the Mahalanobis distance classifier can monitor the bearing status and detect the failure in advance.
https://doi.org/10.7838/jsebs.2018.23.2.111 인용 PDF KSCI

Implementation of the Voice Conversion in the Text-to-speech System (Text-to-speech 시스템에서의 화자 변환 기능 구현)

Hwang Cholgyu;Kim Hyung Soon
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.33-36
- /
- 1999
본 논문에서는 기존의 text-to-speech(TTS) 합성방식이 미리 정해진 화자에 의한 단조로운 합성음을 가지는 문제를 극복하기 위하여, 임의의 화자의 음색을 표현할 수 있는 화자 변환(Voice Conversion) 기능을 구현하였다. 구현된 방식은 화자의 음향공간을 Gaussian Mixture Model(GMM)로 모델링하여 연속 확률 분포에 따른 화자 변환을 가능케 했다. 원시화자(source)와 목적화자(target)간의 특징 벡터의 joint density function을 이용하여 목적화자의 음향공간 특징벡터와 변환된 벡터간의 제곱오류를 최소화하는 변환 함수를 구하였으며, 구해진 변환 함수로 벡터 mapping에 의한 스펙트럼 포락선을 변환했다. 운율 변환은 음성 신호를 정현파 모델에 의해서 모델링하고, 분석된 운율 정보(피치, 지속 시간)는 평균값을 고려해서 변환했다. 성능 평가를 위해서 VQ mapping 방법을 함께 구현하여 각각의 정규화된 켑스트럼 거리를 구해서 성능을 비교 평가하였다. 합성시에는 ABS-OLA 기반의 정현파 모델링 방식을 채택함으로써 자연스러운 합성음을 생성할 수 있었다.
PDF

Real-Time Recognition of the Korean Spingle Vowels Using the Speech Spectrum Anaysis (음성 스펙트럼 분석에 의한 한국어 단모음 실시간 인식)

김엄준;성미영
- Proceedings of the Korea Multimedia Society Conference
- /
- 1998.10a
- /
- pp.226-231
- /
- 1998
본 연구에서는 짧은 시간에 계산이 가능하며, 음성을 특징 지울 수 있는 파라미터로서 영 교차율(zero crossing rate), 단 구간 에너지(short-term, energy) 그리고 포만트(formant)를 사용하였다. 특정 화자의 음성을 입력 받아서 단모음인 'ㅏ, ㅐ, ㅓ, ㅔ, ㅗ, ㅜ, ㅡ. ㅣ'에 대한 인식을 위해 위의 세가지 파라미터를 측정하였다. 영 교차율과 단 구간 에너지 파라미터는 유성음과 무성음의 구별과 음성인지 아닌지를 판별하는데 사용하였다. 포만트 파라미터는 10차 켑스트럼(cepstrum)을 이용하여 구하였으며, 각 단모음을 판별하기 위해서 사용하였다. 하나의 단모음을 입력받아 처리하여 텍스트로 출력하는데 평균 0.065sec에 처리하며, 각각의 단모음에 대해 93%, 10개의 테스트 문장에 대해 72%의 인식률을 보이고 있다.
PDF

Impact Noise Source Localization in Noise (잡음 속에 묻힌 충격 소음원 위치 추정)

최영철;김양한
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2004.05a
- /
- pp.774-779
- /
- 2004
This paper addresses the way in which we can find where impact noise sources are. Specifically, we have an interest in the case that the signal is embedded in noise. We propose a signal processing method that can identify impulsive sources’location. The method is robust with respect to noise; spatially distributed noise. This has been achieved by a beamforming method with regard to cepstrum domain is used. It is noteworthy that the cepstrum has the ability to detect periodic pulse signal in noise. Numerical simulation and experiments are performed to verify the method. Results show that the proposed technique is quite powerful for localizing the faults in noisy environments. The method also required less microphones than conventional beamforming method.
PDF

Impulsive Source Localization in Noise (잡음 속에 묻힌 임펄스 소음원 위치 추정)

Kim Yang-Hann;Choi Young-Chul
- Transactions of the Korean Society for Noise and Vibration Engineering
- /
- v.14 no.9 s.90
- /
- pp.877-883
- /
- 2004
This paper addresses the way in which we can find where impulsive noise sources are. Specifically, we have an interest in the case that the signal is embedded in noise. We propose a signal processing method that can identify impulsive sources' location. The method is robust with respect to spatially distributed noise. This has been achieved by the modified beamforming method with regard to cepstrum domain is used. It is noteworthy that the cepstrum has the ability to detect periodic pulse signal in noise. Numerical simulation and experiments are performed to verify the method. Results show that the proposed technique is quite powerful for localizing the faults in noisy environments. The method also required less microphones than conventional beamforming method.
https://doi.org/10.5050/KSNVN.2004.14.9.877 인용 PDF KSCI

Search Result 58, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)