• Title/Summary/Keyword: acoustic analyses

Search Result 158, Processing Time 0.031 seconds

Multi-dimensional Representation and Correlation Analyses of Acoustic Cues for Stops (폐쇄음 음향 단서의 다차원 표현과 상관관계 분석)

  • Yun, Weon-Hee
    • MALSORI
    • /
    • v.55
    • /
    • pp.45-60
    • /
    • 2005
  • The purpose of this paper is to represent values of acoustic cues for Korean oral stops in the multi-dimensional space, and to attempt to find possible relationships among acoustic cues through correlation analyses. The acoustic cues used for differentiation of 3 types of Korean stops are closure duration, voice onset time and fundamental frequency of a vowel after a stop. The values of these cues are plotted in the two and three dimensional space to see what the critical cues are for separation of different types of stops. Correlation coefficient analyses show that multi-variate approach to statistical analysis is legitimate, and that there are statistically significant relationships among acoustic cues but Oey are not strong enough to make the conjecture that there is a possible relationship among the articulatory or laryngeal mechanisms employed by the acoustic cues.

  • PDF

Speech recognition rates and acoustic analyses of English vowels produced by Korean students

  • Yang, Byunggon
    • Phonetics and Speech Sciences
    • /
    • v.14 no.2
    • /
    • pp.11-17
    • /
    • 2022
  • English vowels play an important role in verbal communication. However, Korean students tend to experience difficulty pronouncing a certain set of vowels despite extensive education in English. The aim of this study is to apply speech recognition software to evaluate Korean students' pronunciation of English vowels in minimal pair words and then to examine acoustic characteristics of the pairs in order to check their pronunciation problems. Thirty female Korean college students participated in the recording. Speech recognition rates were obtained to examine which English vowels were correctly pronounced. To compare and verify the recognition results, such acoustic analyses as the first and second formant trajectories and durations were also collected using Praat. The results showed an overall recognition rate of 54.7%. Some students incorrectly switched the tense and lax counterparts and produced the same vowel sounds for qualitatively different English vowels. From the acoustic analyses of the vowel formant trajectories, some of these vowel pairs were almost overlapped or exhibited slight acoustic differences at the majority of the measurement points. On the other hand, statistical analyses on the first formant trajectories of the three vowel pairs revealed significant differences throughout the measurement points, a finding that requires further investigation. Durational comparisons revealed a consistent pattern among the vowel pairs. The author concludes that speech recognition and analysis software can be useful to diagnose pronunciation problems of English-language learners.

Multi-dimenstional Representation of Acoustic Cues for Korean Stops (한국어 폐쇄음 음향단서의 다차원 표현)

  • Yun, Weon-Hee
    • Proceedings of the KSPS conference
    • /
    • 2005.04a
    • /
    • pp.25-28
    • /
    • 2005
  • The purpose of this paper is to represent values of acoustic cues for Korean oral stops in the multi-dimensional space, and to attempt to find possible relationships among acoustic cues through correlation coefficient analyses. The acoustic cues used for differentiation of 3 types of Korean stops are closure duration, voice onset time and fundamental frequency of a vowel after a stop. The values of these cues are plotted in the two and three dimensional space and see what the critical cues are for complete separation of different types of stops. Correlation coefficient analyses show that there are statistically significant relationships among acoustic cues but they are not strong enough to make a conjecture that there is a possible articulatory relationship among the mechanisms employed by the acoustic cues.

  • PDF

Gender difference in speech intelligibility using speech intelligibility tests and acoustic analyses

  • Kwon, Ho-Beom
    • The Journal of Advanced Prosthodontics
    • /
    • v.2 no.3
    • /
    • pp.71-76
    • /
    • 2010
  • PURPOSE. The purpose of this study was to compare men with women in terms of speech intelligibility, to investigate the validity of objective acoustic parameters related with speech intelligibility, and to try to set up the standard data for the future study in various field in prosthodontics. MATERIALS AND METHODS. Twenty men and women were served as subjects in the present study. After recording of sample sounds, speech intelligibility tests by three speech pathologists and acoustic analyses were performed. Comparison of the speech intelligibility test scores and acoustic parameters such as fundamental frequency, fundamental frequency range, formant frequency, formant ranges, vowel working space area, and vowel dispersion were done between men and women. In addition, the correlations between the speech intelligibility values and acoustic variables were analyzed. RESULTS. Women showed significantly higher speech intelligibility scores than men and there were significant difference between men and women in most of acoustic parameters used in the present study. However, the correlations between the speech intelligibility scores and acoustic parameters were low. CONCLUSION. Speech intelligibility test and acoustic parameters used in the present study were effective in differentiating male voice from female voice and their values might be used in the future studies related patients involved with maxillofacial prosthodontics. However, further studies are needed on the correlation between speech intelligibility tests and objective acoustic parameters.

A corpus-based study on the effects of voicing and gender on American English Fricatives (성대진동 및 성별이 미국영어 마찰음에 미치는 효과에 관한 코퍼스 기반 연구)

  • Yoon, Tae-Jin
    • Phonetics and Speech Sciences
    • /
    • v.10 no.2
    • /
    • pp.7-14
    • /
    • 2018
  • The paper investigates the acoustic characteristics of English fricatives in the TIMIT corpus, with a special focus on the role of voicing in rendering fricatives in American English. The TIMIT database includes 630 talkers and 2,342 different sentences, and comprises more than five hours of speech. Acoustic analyses are conducted in the domain of spectral and temporal properties by treating gender, voicing, and place of articulation as independent factors. The results of the acoustic analyses revealed that acoustic signals interact in a complex way to signal the gender, place, and voicing of fricatives. Classification experiments using a multiclass support vector machine (SVM) revealed that 78.7% of fricatives are correctly classified. The majority of errors stem from the misclassification of /θ/ as [f] and /ʒ/ as [z]. The average accuracy of gender classification is 78.7%. Most errors result from the classification of female speakers as male speakers. The paper contributes to the understanding of the effects of voicing and gender on fricatives in a large-scale speech corpus.

Numerical Analysis of Acoustic Characteristics in Gas Turbine Combustor with Spatial Non-homogeneity

  • Sohn, Chae-Hoon;Cho, Han-Chang
    • Journal of Mechanical Science and Technology
    • /
    • v.18 no.8
    • /
    • pp.1461-1469
    • /
    • 2004
  • Acoustic characteristics in an industrial gas-turbine combustor are numerically investigated by a linear acoustic analysis. Spatially non-homogeneous temperature field in the combustor is considered in the numerical calculation and the characteristics are analyzed in view of acoustic instability. Acoustic analyses are conducted in the combustors without and with acoustic resonator, which is one of the acoustic-damping devices or combustion stabilization devices. It has been reported that severe pressure fluctuation frequently occurs in the adopted combustor, and the measured signal of pressure oscillation is compared with the acoustic-pressure response from the numerical calculation. The numerical results are in good agreement with the measurement data. In this regard. the phenomenon of pressure fluctuation in the combustor could be caused by acoustic instability. From the numerical results for the combustor with present acoustic resonators installed, the acoustic effects of the resonators are analyzed in the viewpoints of both the frequency tuning and the damping capacity. It is found that the resonators with present specifications are not optimized and thus, the improved specification or design is required.

Acoustic Analyses of Vocal Vibrato of Korean Singers

  • Yoo, Jae-Yeon;Jeong, Ok-Ran;Kwon, Do-Ha
    • Speech Sciences
    • /
    • v.12 no.1
    • /
    • pp.37-43
    • /
    • 2005
  • The phenomenon of vocal vibrato may be regarded as an acoustic representation of one of the most rapid and continuous changes in pitch and intensity that the human vocal mechanism is capable of producing. Singers are likely to use vibrato effectively to enrich their voice. The purpose of this study was to obtain acoustic measurements (vF0 and vAm) of 45 subjects (15 trot and 15 ballad singers and 15 non-singers) and to compare acoustic measurements of the vowel /a/ produced by 3 groups on 2 voice sampling conditions (prolongation and singing of /a/). Thirty singers of trot and ballad were selected by a producer and a concert director working for the KBS (Korean Broadcasting System). The MDVP was used to measure the acoustic parameters. A two-way MANOVA was used for statistical analyses. The results were as follows; Firstly, there was no significant difference among the 3 groups in vF0 and vAm in prolongation of /a/, but in singing voice, there was a significant difference among 3 groups in vF0 and vAm. Secondly, there was an interaction between music genre and voice sampling condition in vF0, and vAm. Finally, trot singers sing with more vibrato than ballad singers. It was concluded that it is very important to analyze singers' voice including various voice conditions (prolongation, reading, conversation, and singing) and to identify differences of singing voice characteristics among music genre.

  • PDF

A Corpus-based study on the Effects of Gender on Voiceless Fricatives in American English

  • Yoon, Tae-Jin
    • Phonetics and Speech Sciences
    • /
    • v.7 no.1
    • /
    • pp.117-124
    • /
    • 2015
  • This paper investigates the acoustic characteristics of English fricatives in the TIMIT corpus, with a special focus on the role of gender in rendering fricatives in American English. The TIMIT database includes 630 talkers and 2342 different sentences, comprising over five hours of speech. Acoustic analyses are conducted in the domain of spectral and temporal properties by treating gender as an independent factor. The results of acoustic analyses revealed that the most acoustic properties of voiceless sibilants turned out to be different between male and female speakers, but those of voiceless non-sibilants did not show differences. A classification experiment using linear discriminant analysis (LDA) revealed that 85.73% of voiceless fricatives are correctly classified. The sibilants are 88.61% correctly classified, whereas the non-sibilants are only 57.91% correctly classified. The majority of the errors are from the misclassification of /ɵ/ as [f]. The average accuracy of gender classification is 77.67%. Most of the inaccuracy results are from the classification of female speakers in non-sibilants. The results are accounted for by resorting to biological differences as well as macro-social factors. The paper contributes to the understanding of the role of gender in a large-scale speech corpus.

An Acoustic Study of English Voiced Sibilants: Correct vs. Incorrect L2 Production

  • Seo, Misun;Lim, Jayeon
    • English Language & Literature Teaching
    • /
    • v.17 no.4
    • /
    • pp.251-271
    • /
    • 2011
  • The present study analyzed Korean learners' production of English /z/-/$d{\Box}$/ and /z/-/${\Box}$/ contrasts in terms of native speaker judgments and acoustic measurements. Korean learner's production was judged to be either correct or incorrect by native English speakers. Correct and incorrect productions were then compared with productions of native speakers' in terms of acoustic analyses. The results indicated that Korean speakers' correct production was more similar to that of native speakers by sharing more acoustic cues. Incorrect production by Korean speakers indicated patterns either different or opposite from that of native speakers, confirming native speaker judgments. The results also revealed acoustic cues on which native speakers rely in judging L2 speech, thereby implying that the more consistent along with more number of acoustic cues used by native speakers may facilitate the acquisition of segment contrasts by L2 learners.

  • PDF

Parametric and Wavelet Analyses of Acoustic Emission Signals for the Identification of Failure Modes in CFRP Composites Using PZT and PVDF Sensors

  • Prasopchaichana, Kritsada;Kwon, Oh-Yang
    • Journal of the Korean Society for Nondestructive Testing
    • /
    • v.27 no.6
    • /
    • pp.520-530
    • /
    • 2007
  • Combination of the parametric and the wavelet analyses of acoustic emission (AE) signals was applied to identify the failure modes in carbon fiber reinforced plastic (CFRP) composite laminates during tensile testing. AE signals detected by surface mounted lead-zirconate-titanate (PZT) and polyvinylidene fluoride (PVDF) sensors were analyzed by parametric analysis based on the time of occurrence which classifies AE signals corresponding to failure modes. The frequency band level-energy analysis can distinguish the dominant frequency band for each failure mode. It was observed that the same type of failure mechanism produced signals with different characteristics depending on the stacking sequences and the type of sensors. This indicates that the proposed method can identify the failure modes of the signals if the stacking sequences and the sensors used are known.