• 제목/요약/키워드: Normalized noise energy

검색결과 40건 처리시간 0.018초

아동이 산출한 치조마찰음 /ㅅ/에 대한 청지각적·음향학적 연구 (A perceptual and acoustical study of /ㅅ/ in children's speech)

  • 김지연;성철재
    • 말소리와 음성과학
    • /
    • 제10권3호
    • /
    • pp.41-48
    • /
    • 2018
  • This study examined the acoustic characteristics of Korean alveolar fricatives of normal children. Developing children aged 3 and 7, typically produced 2 types of nonsense syllables containing alveolar fricative /sV/ and /VsV/ sequences where V was any one of three corner vowels (/i, a, and u/). Stimuli containing the speech materials used in a production experiment were presented randomly to 12 speech language pathologists (SLPs) for a perception test. The SLPs responded by selecting one of seven alternative sounds. Acoustic measures such as duration of frication noise, normalized intensity, skewness, and center of gravity were examined. There was significant difference in acoustic measures when comparing vowels. Comparison of syllable structures indicated statistically significant differences in duration of frication noise and normalized intensity. Acoustic parameters could account for the perceptual data. Relating the acoustic and perception data by means of logistic regression suggests that duration of frication noise and normalized intensity are the primary cues to perceiving Korean fricatives.

음성 에너지 최대화와 묵음 특징 정규화를 이용한 잡음 환경에 강인한 음성 검출 (Voice Activity Detection in Noisy Environment using Speech Energy Maximization and Silence Feature Normalization)

  • 안찬식;최기호
    • 디지털융복합연구
    • /
    • 제11권6호
    • /
    • pp.169-174
    • /
    • 2013
  • 음성 인식 성능 저하의 문제는 모델 훈련 환경과 인식 환경의 차이이다. 이러한 환경의 불일치를 줄이기 위한 방법으로 다양한 묵음 특징 정규화 방법을 사용하고 있다. 기존의 묵음 특징 정규화 방법은 낮은 신호 대 잡음비에서 묵음 구간의 에너지 레벨이 증가하여 음성과 비음성에 대한 분류의 정확도가 떨어짐으로 인해 인식 성능이 저하되는 문제점이 있다. 본 논문에서는 음성 에너지 최대화와 묵음 특징 정규화를 이용한 잡음 환경에 강인한 음성 검출 방법을 제안하였다. 제안한 방법은 높은 신호 대 잡음비에서는 음성 에너지를 최대화시켜 특징이 잡음의 영향을 적게 받는 특성을 이용하였고 낮은 신호 대 잡음비에서는 음성/비음성의 켑스트럼 특징 분포 특성을 이용하여 인식 성능을 향상시켰다. 인식 실험 결과 기존 방법에 비해 향상된 인식 성능을 확인할 수 있었다.

에너지와 인근 피치간에 유사도를 이용한 잡음레벨 검출에 관한 연구 (A Study on the Noise-Level Measurement Using the Energy and Relation of Closed Pitch)

  • 강인규;이기영;배명진
    • 음성과학
    • /
    • 제11권3호
    • /
    • pp.157-164
    • /
    • 2004
  • Human has average pitch-level when speak naturally. That is 'Habitual pitch level'. However, if noise added at speech, the pitch-wave is changed irregularly. We can estimate noise level of speech by using this point. This paper calculates energy level of the input speech, pitch period from of above limited energy level by NAMDF (Normalized Average Magnitude Difference Function) method, after cut each frame by pitch period unit, and propose a method that estimate noise level through closed pitch of input speech.

  • PDF

Effect of Double Noise-Barrier on Air Pollution Dispersion around Road, Using CFD

  • Jeong, Sang Jin
    • Asian Journal of Atmospheric Environment
    • /
    • 제8권2호
    • /
    • pp.81-88
    • /
    • 2014
  • Noise-barriers on both sides of the roadway (hereafter referred to as double noise-barriers), are a common feature along roads in Korea, and these are expected to have important effects on the near-road air pollution dispersion of vehicle emissions. This study evaluated the double noise-barrier impact on near-road air pollution dispersion, using a FLUENT computational fluid dynamics (CFD) model. The realizable k-${\varepsilon}$ model in FLUENT CFD code was used to simulate vehicle air pollutant dispersion, in around 11 cases of double noise-barriers. The simulated concentration profiles and surface concentrations under no barrier cases were compared with the experimental results. The results of the simulated flows show the following three regimes in this study: isolated roughness (H/W=0.05), wake interface (H/W=0.1), and skimming flow (H/W>0.15). The results also show that the normalized average concentrations at surface (z=1 m) between the barriers increase with increasing double noise-barrier height; however, normalized average concentrations at the top position between the barriers decrease with increasing barrier height. It was found that the double noise-barrier decreases normalized average concentrations of leeward positions, ranging from 0.8 (H/W=0.1, wake interface) to 0.1 (H/W=0.5, skimming flow) times lower than that of the no barrier case, at 10 x/h downwind position; and ranging from 1.0 (H/W=0.1) to 0.4 (H/W=0.5) times lower than that of the no barrier case, at 60 x/h downwind position.

EVRC 음성부호화기의 잡음억제단을 이용한 수중 천이신호 검출 (Detection of Underwater Transient Signals Using Noise Suppression Module of EVRC Speech Codec)

  • 김태환;배건성
    • 한국음향학회지
    • /
    • 제26권6호
    • /
    • pp.301-305
    • /
    • 2007
  • 본 연구에서 관심을 갖는 수중환경에서의 천이소음도 가청주파수 대역임에 착안하여 이동통신 시스템의 표준 음성부호화기인 EVRC의 전처리단을 이용하여 배경잡음 구간을 판별하고 이를 바탕으로 수중 천이신호를 탐지하는 알고리즘을 제안하였다. EVRC 전처리 모듈에 프레임 단위의 입력신호가 들어가면 모듈로부터 잡음 구간을 결정하는 플래그, 각 채널의 에너지, 잡음이 제거된 신호, 입력신호의 에너지, 배경잡음의 에너지, 잡음이 제거된 신호의 에너지에 해당하는 파라미터 값을 얻게 된다. 잡음이 제거된 신호의 에너지를 배경잡음의 에너지로 정규화하고 이를 문턱값과 비교함으로써 천이신호를 검출할 수 있다. 또한 문턱 값은 잡음 구간에서 구한 이전 값을 이용해서 갱신된다. 실험결과를 통해 제안한 알고리즘이 백색 또는 유색잡음 하에서도 4% 미만의 오차를 보여주는 것을 검증한다.

일반학생과 성악도를 대상으로 Dr. Speech의 음향학적 측정치와 EGG 측정치의 상관관계 비교 연구 (A Correlation Study between Acoustic and EGG Parameters in Ordinary College Students and Classical Singing Students)

  • 안종복;유재연;권도하;정옥란
    • 대한후두음성언어의학회지
    • /
    • 제13권1호
    • /
    • pp.28-32
    • /
    • 2002
  • Background and Objective : Classical singing students who have received in systematic voice training appeared distinctive voice characteristics compared to normal people who have not received in systematic voice training. The purpose of this study was to determine the correlation between acoustic parameters and Electroglottography(EGG) parameters in two groups(ordinary college students vs. classical singing students group). Materials and Methods : The 80 ordinary college students and 65 classical singing students participated in this study by utilizing Dr. speech program to obtain acoustic measurements and physiologic measurements simultaneously. The Pearson correlation coefficient was used to find the correlation between acoustic parameters and EGG parameters in two groups(ordinary college students group and classical singing students group). Results : The results of the study were as follows : First, there was no correlation between Jitter and EGG Jitter in ordinary college students group, but there was strong correlation between Jitter and EGG Jitter in classical singing students group. Second, there was no correlation between Shimmer and EGG Shimmer in ordinary college students group, but there was strong correlation between Shimmer and EGG Shimmer in classical singing students group. Third, there was no correlation between Harmonic to Noise Ratio(HNR) and EGG HNR in ordinary college students group, but there was strong correlation between HNR and EGG HNR in classical singing students group. Finally, there was no correlation between Normalized Noise Energy(NNE) and EGG NNE in two groups.

  • PDF

Improved Maximum Access Delay Time, Noise Variance, and Power Delay Profile Estimations for OFDM Systems

  • Wang, Hanho;Lim, Sungmook;Ko, Kyunbyoung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제16권12호
    • /
    • pp.4099-4113
    • /
    • 2022
  • In this paper, we propose improved maximum access delay time, noise variance, and power delay profile (PDP) estimation schemes for orthogonal frequency division multiplexing (OFDM) system in multipath fading channels. To this end, we adopt the approximate maximum likelihood (ML) estimation strategy. For the first step, the log-likelihood function (LLF) of the received OFDM symbols is derived by utilizing only the cyclic redundancy induced by cyclic prefix (CP) without additional information. Then, the set of the initial path powers is sub-optimally obtained to maximize the derived LLF. In the second step, we can select a subset of the initial path power set, i.e. the maximum access delay time, so as to maximize the modified LLF. Through numerical simulations, the benefit of the proposed method is verified by comparison with the existing methods in terms of normalized mean square error, erroneous detection, and good detection probabilities.

Energy Detector based Time of Arrival Estimation using a Neural Network with Millimeter Wave Signals

  • Liang, Xiaolin;Zhang, Hao;Gulliver, T. Aaron
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제10권7호
    • /
    • pp.3050-3065
    • /
    • 2016
  • Neural networks (NNs) are extensively used in applications requiring signal classification and regression analysis. In this paper, a NN based threshold selection algorithm for 60 GHz millimeter wave (MMW) time of arrival (TOA) estimation using an energy detector (ED) is proposed which is based on the skewness, kurtosis, and curl of the received energy block values. The best normalized threshold for a given signal-to-noise ratio (SNR) is determined, and the influence of the integration period and channel on the performance is investigated. Results are presented which show that the proposed NN based algorithm provides superior precision and better robustness than other ED based algorithms over a wide range of SNR values. Further, it is independent of the integration period and channel model.

화자 확인을 위한 다중대역에 기반한 주성분 분석 공분산 모델 (PCA Covariance Model Based on Multiband for Speaker Verification)

  • 최민정;이윤정;서창우
    • 음성과학
    • /
    • 제14권2호
    • /
    • pp.127-135
    • /
    • 2007
  • Feature vectors of speech are generally extracted from whole frequency domain. The inherent character of a speaker is located in the low band or high band frequency. However, if the speech is corrupted by narrowband noise with concentrated energy, speaker verification performance is reduced as the individual characteristic is removed. In this paper, we propose a PCA Covariance Model based on the multiband to extract the robust feature vectors against the narrowband noise. First, we divide the overall frequency band into several subbands. Second, the correlation of feature vectors extracted independently from each subband is removed by PCA. The distance obtained from each subband has different distribution. To normalize against the different distribution, we moved the value into the normalized distribution through the mapping function. Finally, the represented value applying the weighting function is used for speaker verification. In the experiments, the proposed method shows better performance of the speaker verification and reduces the computation.

  • PDF

PZT actuator를 이용한 외팔보의 능동진동제어 (Active control of vibration of cantilever beams using PZT actuators)

  • 신창주;홍진숙;정의봉
    • 한국소음진동공학회:학술대회논문집
    • /
    • 한국소음진동공학회 2008년도 추계학술대회논문집
    • /
    • pp.247-252
    • /
    • 2008
  • This paper presents an active vibration control of cantilever beams under disturbances by a primary force. A direct velocity feedback control using a pair of PZT actuator and a velocity sensor is considered. Variation of the stability and performance with the locations of the sensor/actuator pair is investigated. It is found that the maximum gain varies with the locations of the sensor/actuator pair significantly. The maximum gain shows a symmetric distribution along the beam length with respect to the center point, although the boundary condition of the beam is unsymmetric. The control performance is affected by the location of the primary force as well as the location of the sensor/actuator pair. The active control system can more effectively reduce the vibration when the primary force is located close to the fixed boundary.

  • PDF