DOI QR코드

DOI QR Code

An evaluation of Korean students' pronunciation of an English passage by a speech recognition application and two human raters

  • Yang, Byunggon (Department of English Education, Pusan National University)
  • Received : 2020.10.20
  • Accepted : 2020.12.01
  • Published : 2020.12.31

Abstract

This study examined thirty-one Korean students' pronunciation of an English passage using a speech recognition application, Speechnotes, and two Canadian raters' evaluations of their speech according to the International English Language Testing System (IELTS) band criteria to assess the possibility of using the application as a teaching aid for pronunciation education. The results showed that the grand average percentage of correctly recognized words was 77.7%. From the moderate recognition rate, the pronunciation level of the participants was construed as intermediate and higher. The recognition rate varied depending on the composition of the content words and the function words in each given sentence. Frequency counts of unrecognized words by group level and word type revealed the typical pronunciation problems of the participants, including fricatives and nasals. The IELTS bands chosen by the two native raters for the rainbow passage had a moderately high correlation with each other. A moderate correlation was reported between the number of correctly recognized content words and the raters' bands, while an almost a negligible correlation was found between the function words and the raters' bands. From these results, the author concludes that the speech recognition application could constitute a partial aid for diagnosing each individual's or the group's pronunciation problems, but further studies are still needed to match human raters.

Keywords

References

  1. Benner, M. L. (2020). Parts of speech. Retrieved from https://webapps.towson.edu/ows/ptsspch.htm
  2. House, A. S. (1957). Analog studies of nasal consonants. Journal of Speech and Hearing Disorders, 22(2), 190-204. https://doi.org/10.1044/jshd.2202.190
  3. IDP Education Canada. (2020). IELTS. Test format. Retrieved from https://ieltscanadatest.com/take-ielts/test-format/
  4. Kang, S., & Ahn, H. (2013). An intelligibility-based approach to English pronunciation teaching: Evidence from [r] production. Language Research, 49(3), 631-646.
  5. Levis, J. (2011). Assessing speech intelligibility: Experts listen to two students. In J. Levis, & K. LeVelle (Eds.), Proceedings of the 2nd Pronunciation in Second Language Learning and Teaching Conference (pp. 56-69). Ames, IA.
  6. Levis, J., & LeVelle, K. (2011). Pronunciation and intelligibility: An overview of the conference. In. J. Levis, & K. LeVelle (Eds.), Proceedings of the 2nd Pronunciation in Second Language Learning and Teaching Conference (pp. 1-6). Ames, IA.
  7. Loukina, A., Lopez, M., Evanini, K., Suendermann-Oeft, D., Ivanov, A., & Zechner, K. (2015, September). Pronunciation accuracy and intelligibility of non-native speech. Proceedings of Interspeech 2015 (pp. 1917-1921). Dresden, Germany.
  8. Luce, P., & Pisoni, D. B. (1998). Recognizing spoken words: The neighborhood activation model. Ear & Hearing, 19(1), 1-36. https://doi.org/10.1097/00003446-199802000-00001
  9. Miller, N. (2013). Measuring up to speech intelligibility. International Journal of Language & Communication Disorders, 48(6), 601-612. https://doi.org/10.1111/1460-6984.12061
  10. Naver. (2020). Papago. Retrieved from https://papago.naver.com/
  11. Nolan, C. (2012). Fire alarm intelligibility. Paper presented in 2012 CFAA-Annual Ontario Technical Seminar. Retrieved from http://www.cfaa.ca/Files/flash/ontario/ATS2012/FireAlarmIntelligibility.pdf
  12. Nordquist, R. (2020). Definition and examples of function words in English. Retrieved from http://thoughtco.com/function-wordgrammar-1690876
  13. Ohala, J. (1990). The phonetics and phonology aspects of assimilation. In J. Kingston, & M. E. Beckman (Eds.), Papers in Laboratory Phonology I: Between the Grammar and Physics of Speech (pp. 258-275). Cambridge, UK: Cambridge University Press.
  14. O'Shea, J. D. (2013). Function word lists. Retrieved from https://semanticsimilarity.files.wordpress.com/2013/08/jim-oshea-fwlist264.pdf
  15. R Core Team. (2020). R: A language and environment for statistical computing (version 3.6.2) [Computer software]. Vienna, Austria: R Foundation for Statistical Computing, Retrieved from https://www.R-project.org/
  16. Rochester Institute of Technology. (2020). Rainbow passage. Retrieved from https://www.rit.edu/ntid/slpros/media/rainbow
  17. Smith, L. E., & Nelson, C. L. (2006). World Englishes and issues of intelligibility. In B. B. Kachru, Y. Kachru, & C. L. Nelson (Eds.), The handbook of World Englishes (pp. 428-445). Malden, MA: Blackwell.
  18. Vitevitch, M., & Luce, P. (2004). A web-based interface to calculate phonotactic probability for words and nonwords in English. Behavior Research Methods, Instruments, & Computers, 36(3), 481-487. https://doi.org/10.3758/BF03195594
  19. Yang, B. (2005). A study of English consonants identified by college students. Speech Sciences, 12(3), 139-151.
  20. Yang, B. (2017). Google speech recognition of an English paragraph produced by college students in clear or casual speech styles. Phonetics and Speech Sciences, 9(4), 43-50. https://doi.org/10.13064/KSSS.2017.9.4.043
  21. Yang, B., & Whalen, D. (2015). Perception and production of English vowels by American males and females. Australian Journal of Linguistics, 35(2), 121-141. https://doi.org/10.1080/07268602.2015.1004998