Comparing English and Korean speakers' word-final /rl/ clusters using dynamic time warping

  • Cho, Hyesun (Department of Education, Graduate School of Education, Dankook University)
  • Received : 2022.01.31
  • Accepted : 2022.03.05
  • Published : 2022.03.31


The English word-final /rl/ cluster poses a particular problem for Korean learners of English because it is the sequence of two sounds, /r/ and /l/, which are not contrastive in Korean. This study compared the similarity distances between English and Korean speakers' /rl/ productions using the dynamic time warping (DTW) algorithm. The words with /rl/ (pearl, world) and without /rl/ (bird, word) were recorded by four English speakers and four Korean speakers, and compared pairwise. The F2-F1 trajectories, the acoustic correlate of velarized /l/, and F3 trajectories, the acoustic correlate of /r/, were examined. Formant analysis showed that English speakers lowered F2-F1 values toward the end of a word, unlike Korean speakers, suggesting the absence of /l/ in Korean speakers. In contrast, there was no significant difference in F3 values. Mixed-effects regression analyses of the DTW distances revealed that Korean speakers produced /r/ similarly to English speakers but failed to produce the velarized /l/ in /rl/ clusters.



  1. Allen, G. D. (1979, December). Transcription of the American /r/. Current issues in the phonetic sciences: Proceedings of the IPS-77 Congress (pp. 1019-1025). Miami Beach, FL.
  2. Boersma, P., & Weenink, D. (2020). Praat: Doing phonetics by computer (version 6.1.35) [Computer program]. Retrieved from
  3. Borden, G., Gerber, A., & Milsark, G. (1983). Production and perception of the /r/-/l/ contrast in Korean adults learning English. Language Learning, 33(4), 499-526.
  4. Boujnah, S., Sun, X., Marshall, D., Rosin, P. L., & Ammari, M. L. (2021). A novel approach for speaker recognition in degraded conditions. In N. Derbel, & O. Kanoun (Eds.), Advanced methods for human biometrics(pp. 139-146). Cham, Switzerland: Springer.
  5. Carr, P. (2020). English phonetics and phonology: An introduction(3rd ed.). Hoboken, NJ: Wiley-Blackwell.
  6. Celce-Murcia, M., Brinton, D. M., & Goodwin, J. M. (2010). English pronunciation: A course book and reference guide (2nd ed.). Cambridge, UK: Cambridge University Press.
  7. Cho, S., Nevler, N., Parjane, N., Cieri, C., Liberman, M., Grossman, M., & Cousins, K. A. Q. (2021). Automated analysis of digitized letter fluency data. Frontiers in Psychology, 12, 654214.
  8. Chung, H., & Pollock, K. E. (2014). Acoustic characteristics of adults' rhotic monophthongs and diphthongs. Communication Sciences & Disorders, 19(1), 113-119.
  9. Delattre, P., & Freeman, D. C. (1968). A dialect study of American R's by X-ray motion picture. Linguistics: An Interdisciplinary Journal of the Language Sciences, 6(44), 29-68.
  10. Espy-Wilson, C. Y. (1992). Acoustic measures for linguistic features distinguishing the semivowels /w j r l/ in American English. The Journal of the Acoustical Society of America, 92(2), 736-757.
  11. Feinberg, D. R. (2022). Parselmouth Praat scripts in Python. Retrieved from
  12. Feng, Z. (2020). Effects of identification and pronunciation training methods on L2 speech perception and production: Training adult Japanese speakers to perceive and produce English /r/-/l/. Studies in Applied Linguistics & TESOL, 20(2), 57-83.
  13. Flege, J. E. (1987). The production of "new" and "similar" phones in a foreign language: Evidence for the effect of equivalence classification. Journal of Phonetics, 15(1), 47-65.
  14. Gimson, A. C. (1989). An introduction to the pronunciation of English(4th ed.). London, UK: Edward Arnold.
  15. Giorgino, T. (2009). Computing and visualizing dynamic time warping alignments in R: The dtw package. Journal of Statistical Software, 31(7), 1-24.
  16. Guenther, F. H., Espy-Wilson, C. Y., Boyce, S. E., Matthies, M. L., Zandipour, M., & Perkell, J. S. (1999). Articulatory tradeoffs reduce acoustic variability during American English /r/ production. The Journal of the Acoustical Society of America, 105(5), 2854-2865.
  17. Hwang, Y. (2021). Articulatory characteristics of word-final English /ɹ/ produced by Korean learners of American English. Studies in Phonetics, Phonology and Morphology, 27(2), 353-371.
  18. Idemaru, K., & Holt, L. L. (2013). The developmental trajectory of children's perception and production of English /r/-/l/. Journal of the Acoustical Society of America, 133(6), 4232-4246.
  19. Ingram, J. C. L., & Park, S. G. (1998). Language, context, and speaker effects in the identification and discrimination of English /r/ and /l/ by Japanese and Korean listeners. Journal of the Acoustical Society of America, 103(2), 1161-1174.
  20. Iverson, G. K., & Sohn, H. S. (1994). Liquid representation in Korean. In Y. K. Kim-Renaud (Ed.), Theoretical issues in Korean linguistics(pp. 79-100). Stanford, CA: Stanford University Centre for the Study of Language and Information.
  21. Jadoul, Y., Thompson, B., & de Boer, B. (2018). Introducing Parselmouth: A Python interface to Praat. Journal of Phonetics, 71, 1-15.
  22. Jang, T. (2005). Construction of an English speech database for Korean learners of English. Language and Linguistics, 35, 293-310.
  23. Johnson, K. (2003). Acoustic and auditory phonetics. Malden, MA: Blackwell.
  24. Kang, H. S. (1999). Production and perception of English /r/ and /l/ by Korean learners of English: An experimental study. Speech Sciences, 6, 7-24.
  25. Kasuya, H., Tan, X., & Yang, C. S. (1994, September). Voice source and vocal tract characteristics associated with speaker individuality. Proceedings of the 3rd International Conference on Spoken Language Processing (ICSLP 94) (pp. 1459-1462). Yokohama, Japan.
  26. Kim, R. E., & Rhee, S. C. (2019). A study on English liquids in the rated L2 English speech corpus of Korean learners. Korean Journal of English Language and Linguistics, 19(1), 53-75.
  27. Kwon, J. S. (2010). An experimental study on the English sonorant cluster /rl/ produced by native speakers and young Korean EFL learners (Master's thesis). Hankuk University of Foreign Studies, Seoul, Korea.
  28. Ladefoged, P., & Maddieson, I. (1996). The sounds of the world's languages. Oxford, UK: Blackwell.
  29. Ladefoged, P., & Johnson, K. (2011). A course in phonetics(6th ed.). Boston, MA: Cengage Learning.
  30. Park, S. (2004). An analysis of the causes of English mispronunciation by Korean learners and solution for their improvement. Journal of the Linguistic Society of Korea, 40, 113-143.
  31. Park, S., & Jang, T. Y. (2016). Acoustic characteristics of English liquids produced by Korean learners of English. Studies in Phonetics, Phonology and Morphology, 22(2), 289-315.
  32. R Core Team. (2021). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. Retrieved from
  33. Regier, K. T. (2014). Formant trajectory analysis using dynamic time warping: Preliminary results. The Journal of the Acoustical Society of America, 136(4), 2082.
  34. Sakoe, H., & Chiba, S. (1978). Dynamic programming algorithm optimization for spoken word recognition. IEEE Transactions on Acoustics, Speech, and Signal Processing, 26(1), 43-49.
  35. Sohn, H. S., & Lim, S. (2020). Phonetic implementation of the darkness of English word-final /l/ across prosodic positions: Comparison of native English speakers and EFL Korean speakers. Korean Journal of English Language and Linguistics, 20, 450-474.
  36. Sproat, R., & Fujimura, O. (1993). Allophonic variation in English /l/ and its implications for phonetic implementation. Journal of Phonetics, 21(3), 291-311.
  37. Stevens, K. (1998). Acoustic phonetics. Cambridge, MA: MIT Press.
  38. Zhou, X., Espy-Wilson, C. Y., Boyce, S., Tiede, M., Holland, C., & Choe, A. (2008). A magnetic resonance imaging-based articulatory and acoustic study of "retroflex" and "bunched" American English /r/. The Journal of the Acoustical Society of America, 123(6), 4466-4481.