• Title/Summary/Keyword: Normalized noise energy

Search Result 40, Processing Time 0.022 seconds

A perceptual and acoustical study of /ㅅ/ in children's speech (아동이 산출한 치조마찰음 /ㅅ/에 대한 청지각적·음향학적 연구)

  • Kim, Jiyoun;Seong, Cheoljae
    • Phonetics and Speech Sciences
    • /
    • v.10 no.3
    • /
    • pp.41-48
    • /
    • 2018
  • This study examined the acoustic characteristics of Korean alveolar fricatives of normal children. Developing children aged 3 and 7, typically produced 2 types of nonsense syllables containing alveolar fricative /sV/ and /VsV/ sequences where V was any one of three corner vowels (/i, a, and u/). Stimuli containing the speech materials used in a production experiment were presented randomly to 12 speech language pathologists (SLPs) for a perception test. The SLPs responded by selecting one of seven alternative sounds. Acoustic measures such as duration of frication noise, normalized intensity, skewness, and center of gravity were examined. There was significant difference in acoustic measures when comparing vowels. Comparison of syllable structures indicated statistically significant differences in duration of frication noise and normalized intensity. Acoustic parameters could account for the perceptual data. Relating the acoustic and perception data by means of logistic regression suggests that duration of frication noise and normalized intensity are the primary cues to perceiving Korean fricatives.

Voice Activity Detection in Noisy Environment using Speech Energy Maximization and Silence Feature Normalization (음성 에너지 최대화와 묵음 특징 정규화를 이용한 잡음 환경에 강인한 음성 검출)

  • Ahn, Chan-Shik;Choi, Ki-Ho
    • Journal of Digital Convergence
    • /
    • v.11 no.6
    • /
    • pp.169-174
    • /
    • 2013
  • Speech recognition, the problem of performance degradation is the difference between the model training and recognition environments. Silence features normalized using the method as a way to reduce the inconsistency of such an environment. Silence features normalized way of existing in the low signal-to-noise ratio. Increase the energy level of the silence interval for voice and non-voice classification accuracy due to the falling. There is a problem in the recognition performance is degraded. This paper proposed a robust speech detection method in noisy environments using a silence feature normalization and voice energy maximize. In the high signal-to-noise ratio for the proposed method was used to maximize the characteristics receive less characterized the effects of noise by the voice energy. Cepstral feature distribution of voice / non-voice characteristics in the low signal-to-noise ratio and improves the recognition performance. Result of the recognition experiment, recognition performance improved compared to the conventional method.

A Study on the Noise-Level Measurement Using the Energy and Relation of Closed Pitch (에너지와 인근 피치간에 유사도를 이용한 잡음레벨 검출에 관한 연구)

  • Kang, In-Gyu;Lee, Ki-Young;Bae, Myung-Jin
    • Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.157-164
    • /
    • 2004
  • Human has average pitch-level when speak naturally. That is 'Habitual pitch level'. However, if noise added at speech, the pitch-wave is changed irregularly. We can estimate noise level of speech by using this point. This paper calculates energy level of the input speech, pitch period from of above limited energy level by NAMDF (Normalized Average Magnitude Difference Function) method, after cut each frame by pitch period unit, and propose a method that estimate noise level through closed pitch of input speech.

  • PDF

Effect of Double Noise-Barrier on Air Pollution Dispersion around Road, Using CFD

  • Jeong, Sang Jin
    • Asian Journal of Atmospheric Environment
    • /
    • v.8 no.2
    • /
    • pp.81-88
    • /
    • 2014
  • Noise-barriers on both sides of the roadway (hereafter referred to as double noise-barriers), are a common feature along roads in Korea, and these are expected to have important effects on the near-road air pollution dispersion of vehicle emissions. This study evaluated the double noise-barrier impact on near-road air pollution dispersion, using a FLUENT computational fluid dynamics (CFD) model. The realizable k-${\varepsilon}$ model in FLUENT CFD code was used to simulate vehicle air pollutant dispersion, in around 11 cases of double noise-barriers. The simulated concentration profiles and surface concentrations under no barrier cases were compared with the experimental results. The results of the simulated flows show the following three regimes in this study: isolated roughness (H/W=0.05), wake interface (H/W=0.1), and skimming flow (H/W>0.15). The results also show that the normalized average concentrations at surface (z=1 m) between the barriers increase with increasing double noise-barrier height; however, normalized average concentrations at the top position between the barriers decrease with increasing barrier height. It was found that the double noise-barrier decreases normalized average concentrations of leeward positions, ranging from 0.8 (H/W=0.1, wake interface) to 0.1 (H/W=0.5, skimming flow) times lower than that of the no barrier case, at 10 x/h downwind position; and ranging from 1.0 (H/W=0.1) to 0.4 (H/W=0.5) times lower than that of the no barrier case, at 60 x/h downwind position.

Detection of Underwater Transient Signals Using Noise Suppression Module of EVRC Speech Codec (EVRC 음성부호화기의 잡음억제단을 이용한 수중 천이신호 검출)

  • Kim, Tae-Hwan;Bae, Keun-Sung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.6
    • /
    • pp.301-305
    • /
    • 2007
  • In this paper, we propose a simple algorithm for detecting underwater transient signals on the fact that the frequency range of underwater transient signals is similar to audio frequency. For this, we use a preprocessing module of EVRC speech codec that is the standard speech codec of the mobile communications. If a signal is entered into EVRC noise suppression module, we can get some parameters such as the update flag, the energy of each channel, the noise suppressed signal, the energy of input signal, the energy of background noise, and the energy of enhanced signal. Therefore the energy of the enhanced signal that is normalized with the energy of the background noise is compared with the pre-defined detection threshold, and then we can detect the transient signal. And the detection threshold is updated using the previous value in the noisy period. The experimental result shows that the proposed algorithm has $0{\sim}4% error rate in the AWGN or the colored noise environment.

A Correlation Study between Acoustic and EGG Parameters in Ordinary College Students and Classical Singing Students (일반학생과 성악도를 대상으로 Dr. Speech의 음향학적 측정치와 EGG 측정치의 상관관계 비교 연구)

  • 안종복;유재연;권도하;정옥란
    • Journal of the Korean Society of Laryngology, Phoniatrics and Logopedics
    • /
    • v.13 no.1
    • /
    • pp.28-32
    • /
    • 2002
  • Background and Objective : Classical singing students who have received in systematic voice training appeared distinctive voice characteristics compared to normal people who have not received in systematic voice training. The purpose of this study was to determine the correlation between acoustic parameters and Electroglottography(EGG) parameters in two groups(ordinary college students vs. classical singing students group). Materials and Methods : The 80 ordinary college students and 65 classical singing students participated in this study by utilizing Dr. speech program to obtain acoustic measurements and physiologic measurements simultaneously. The Pearson correlation coefficient was used to find the correlation between acoustic parameters and EGG parameters in two groups(ordinary college students group and classical singing students group). Results : The results of the study were as follows : First, there was no correlation between Jitter and EGG Jitter in ordinary college students group, but there was strong correlation between Jitter and EGG Jitter in classical singing students group. Second, there was no correlation between Shimmer and EGG Shimmer in ordinary college students group, but there was strong correlation between Shimmer and EGG Shimmer in classical singing students group. Third, there was no correlation between Harmonic to Noise Ratio(HNR) and EGG HNR in ordinary college students group, but there was strong correlation between HNR and EGG HNR in classical singing students group. Finally, there was no correlation between Normalized Noise Energy(NNE) and EGG NNE in two groups.

  • PDF

Improved Maximum Access Delay Time, Noise Variance, and Power Delay Profile Estimations for OFDM Systems

  • Wang, Hanho;Lim, Sungmook;Ko, Kyunbyoung
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.16 no.12
    • /
    • pp.4099-4113
    • /
    • 2022
  • In this paper, we propose improved maximum access delay time, noise variance, and power delay profile (PDP) estimation schemes for orthogonal frequency division multiplexing (OFDM) system in multipath fading channels. To this end, we adopt the approximate maximum likelihood (ML) estimation strategy. For the first step, the log-likelihood function (LLF) of the received OFDM symbols is derived by utilizing only the cyclic redundancy induced by cyclic prefix (CP) without additional information. Then, the set of the initial path powers is sub-optimally obtained to maximize the derived LLF. In the second step, we can select a subset of the initial path power set, i.e. the maximum access delay time, so as to maximize the modified LLF. Through numerical simulations, the benefit of the proposed method is verified by comparison with the existing methods in terms of normalized mean square error, erroneous detection, and good detection probabilities.

Energy Detector based Time of Arrival Estimation using a Neural Network with Millimeter Wave Signals

  • Liang, Xiaolin;Zhang, Hao;Gulliver, T. Aaron
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.7
    • /
    • pp.3050-3065
    • /
    • 2016
  • Neural networks (NNs) are extensively used in applications requiring signal classification and regression analysis. In this paper, a NN based threshold selection algorithm for 60 GHz millimeter wave (MMW) time of arrival (TOA) estimation using an energy detector (ED) is proposed which is based on the skewness, kurtosis, and curl of the received energy block values. The best normalized threshold for a given signal-to-noise ratio (SNR) is determined, and the influence of the integration period and channel on the performance is investigated. Results are presented which show that the proposed NN based algorithm provides superior precision and better robustness than other ED based algorithms over a wide range of SNR values. Further, it is independent of the integration period and channel model.

PCA Covariance Model Based on Multiband for Speaker Verification (화자 확인을 위한 다중대역에 기반한 주성분 분석 공분산 모델)

  • Choi, Min-Jung;Lee, Youn-Jeong;Seo, Chang-Woo
    • Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.127-135
    • /
    • 2007
  • Feature vectors of speech are generally extracted from whole frequency domain. The inherent character of a speaker is located in the low band or high band frequency. However, if the speech is corrupted by narrowband noise with concentrated energy, speaker verification performance is reduced as the individual characteristic is removed. In this paper, we propose a PCA Covariance Model based on the multiband to extract the robust feature vectors against the narrowband noise. First, we divide the overall frequency band into several subbands. Second, the correlation of feature vectors extracted independently from each subband is removed by PCA. The distance obtained from each subband has different distribution. To normalize against the different distribution, we moved the value into the normalized distribution through the mapping function. Finally, the represented value applying the weighting function is used for speaker verification. In the experiments, the proposed method shows better performance of the speaker verification and reduces the computation.

  • PDF

Active control of vibration of cantilever beams using PZT actuators (PZT actuator를 이용한 외팔보의 능동진동제어)

  • Shin, Chang-Joo;Hong, Chin-Suk;Jeong, Weui-Bong
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2008.11a
    • /
    • pp.247-252
    • /
    • 2008
  • This paper presents an active vibration control of cantilever beams under disturbances by a primary force. A direct velocity feedback control using a pair of PZT actuator and a velocity sensor is considered. Variation of the stability and performance with the locations of the sensor/actuator pair is investigated. It is found that the maximum gain varies with the locations of the sensor/actuator pair significantly. The maximum gain shows a symmetric distribution along the beam length with respect to the center point, although the boundary condition of the beam is unsymmetric. The control performance is affected by the location of the primary force as well as the location of the sensor/actuator pair. The active control system can more effectively reduce the vibration when the primary force is located close to the fixed boundary.

  • PDF