An acoustical analysis of speech of different speaking rates and genders using intonation curve stylization of English

영어의 억양 유형화를 이용한 발화 속도와 남녀 화자에 따른 음향 분석

  • 이서배 (창원대학교, 영어영문학과)
  • Received : 2014.11.05
  • Accepted : 2014.12.14
  • Published : 2014.12.31


An intonation curve stylization was used for an acoustical analysis of English speech. For the analysis, acoustical feature values were extracted from 1,848 utterances produced with normal and fast speech rate by 28 (12 women and 16 men) native speakers of English. Men are found to speak faster than women at normal speech rate but no difference is found between genders at fast speech rate. Analysis of pitch point features has it that fast speech has greater Pt (pitch point movement time), Pr (pitch point pitch range), and Pd (pitch point distance) but smaller Ps (pitch point slope) than normal speech. Men show greater Pt, Pr, and Pd than women. Analysis of sentence level features reveals that fast speech has smaller Sr (sentence level pitch range), Sd (sentence duration), and Max (maximum pitch) but greater Ss (sentence slope) than normal speech. Women show greater Sr, Ss, Sp (pitch difference between the first pitch point and the last), Sd, MaxNr (normalized Max), and MinNr (normalized Min) than men. As speech rate increases, women speak with greater Ss and Sr than men.


  1. Boersma, P. (2001). Praat, a system for doing phonetics by computer. Glot International. 5:9/10, 341-345.
  2. Byrd, D. (1992). Preliminary results on speaker-dependent variation in the TIMIT database. Journal of the Acoustical Society of America, 92(1), 593-596.
  3. Campione, E. (2001). Etiquetage prosodique semi-automatique de corpus oraux : algorithmes et methodologie. These de doctorat. Aix-en-Provence: Universite de Provence.
  4. Caspers, J. & Van Heuven, V. (1991). Phonetic and linguistic aspects of pitch movements in fast speech in Dutch. Proceedings of the International Congress of Phonetic Sciences, Aix-en-Provence, 5, 174-177.
  5. Cummins, F., Grimaldi, M., Leonard, T., & Simko, J. (2006).The CHAINS corpus: Characterizing individual speakers. Proceedings of SPECOM 2006, 431-435, St. Petersburg, RU.
  6. Daly, D. & Warren, P (2001). Pitching it differently in New Zealand English: Speaker sex and intonation patterns. Journal of Sociolinguistics, 5(1), 85-96
  7. Fitch, J., & Holbrook, A. (1970). Modal vocal fundamental frequency of young adults. Archives of Otolaryngology, 92, 379-382.
  8. Fitzsimons M, Sheahan N, & Staunton H. (2001) Gender and the integration of acoustic dimensions of prosody: implications for clinical studies. Brain Language, 78, 94-108.
  9. Forsell, M. (2007). Acoustic correlates of perceived emotions in speech. MS Thesis, KTH, Royal Institute of Technology, Stockholm, Sweden.
  10. Fougeron, C. & Jun, S. (1998). Rate effects on French intonation: prosodic organization and phonetic realization. Journal of Phonetics, 26, 45-69.
  11. Graddol, D., & Swann, J. (1983). Speaking fundamental frequency: Some physical and social correlates. Language & Speech, 26, 351-366.
  12. Hirst, D. & Robert E. (1993). Automatic modelling of fundamental frequency using a quadratic spline function. Travaux de l'Institut de Phonetique d'Aix, 15, 71-85.
  13. Hirst, D., Cristo, A. & Espesser, R. (2000). Levels of representation and levels of analysis for intonation. in M. Horne (ed) Prosody : Theory and Experiment. Kluwer Academic Publishers, Dordrecht. 51-87.
  14. Hudson, A., & Holbrook, A. (1981). A study of the reading fundamental vocal frequency of young black adults. Journal of Speech & Hearing Research, 24, 197-200.
  15. Jun, S. (1993) The phonetics and phonology of Korean prosody. Ph.D. dissertation, The Ohio State University.
  16. Kohler, K. (1983) F0 in speech timing. Arbeitsberichte des Instituts fur Phonetik des Universitat Kiel, 55-97.
  17. Kohler, K. (1986). Parameters of speech rate perception in German words and sentences: Duration, F0 movement and F0 level. Language & Speech, 29, 115-139.
  18. Ninness, B. & Henriksen, S. (2008). Time-scale modification of speech signals, Transactions on Signal Processing, IEEE, 56(4), 1479-1488.
  19. Oh, J. H. (2014a). A Study of methods of standardization for Korean intonation curve. Korean Linguistics, 62, 395-420. (오재혁 (2014a). 한국어 억양 곡선의 정규화 방안에 대한 연구.한국어학, 62, 395-420.)
  20. Oh, J. H. (2014b). A study of intonation curve slopes in Korean spontaneous speech. Phonetics and Speech Sciences, 6(1), 21-30. (오재혁, (2014b). 자유 발화 자료에서 나타나는 한국어 억양 곡선의 기울기 특성에 대한 연구.말소리와 음성과학, 6(1), 21-30.)
  21. Pallett, D., Fiscus, J., Fisher, W., Garofolo, J., Lund, B. & Przybocki, M. (1994). WSJ-CSR Benchmark Test Results, ARPA Spoken Language Systems and Technology Workshop, March 1994.
  22. Quene, H. (2007). On the just noticeable difference for tempo in speech. Journal of Phonetics. 35, 353-362.
  23. Sandra P. & Whiteside, S. (1996). Temporal-based acoustic-phonetic patterns in read speech: Some evidence for speaker sex differences. Journal of the International Phonetic Association, 26(1), 23-40.
  24. Shih, C., & Kochanski, G. (2003). Modeling intonation: Asking forconfirmation in English, Proceedings of the 15th International Congressof Phonetic Sciences, Barcelona, 551-554.
  25. Silverman, K., Beckman, M., Pitrelli, J., Ostendorf, M., Wightman, C., Price, P., Pierrehumbert, J. & Hirschberg, J. (1992). ToBI: A standard for labeling English prosody. Proceedings of the International Conference on Spoken Language Processing, 867-870.
  26. Stoicheff, M. (1981). Speaking fundamental frequency characteristics of nonsmoking female adults. Journal of Speech & Hearing Research, 24, 437-441.
  27. Trouvain, J. & Grice, M. (1999). The effect of tempo on prosodic structure. Proceedings of 14th International Congress of Phonetic Sciences, San Francisco, 2, 1067-1070.
  28. Vaissei're, J. (1983) Language-independent prosodic features. In Prosody: Models and measurements. Berlin: Springer, 53-66.
  29. Wang, D. & Narayanan, S. (2005). Piecewise linear stylization of pitch via wavelet analysis. Proceedings of Eurospeech 2005, Lisbon, 3277-3280.
  30. Whiteside, S. & Hodgson, C. (2000). Speech patterns of children and adults elicited via a picture-naming task: An acoustic study. Speech Communication, 32(4), 267-285.
  31. Whiteside, S. (1995). Temporal-based speaker sex differences in read speech: A sociophonetic approach. Proceedings of the XIIIth International Congress of Phonetic Sciences, 3, 516-519.
  32. Yi, S., & Kim, S. (2011). A study on low pitch accent produced in different locations in English sentences. Phonetics and Speech Sciences, 3(4), 63-70. (이서배, 김수정 (2011). 영어 문장 내 상이한 위치에 나타난 저성조 피치 액센트 연구, 말소리와 음성과학, 3(4), 63-70.)
  33. Yi, S. (2014). An acoustical analysis of emotional speech using close-copy stylization of intonation curve, Phonetics and Speech Sciences, 6(3), 131-138. (이서배, (2014). 억양의 근접복사 유형화를 이용한 감정음성의 음향분석, 말소리와 음성과학, 6(3), 131-138.)

Cited by

  1. An acoustical analysis of synchronous English speech using automatic intonation contour extraction vol.7, pp.1, 2015,