References
- L. R. Gottlieb and G. Friedland, "On the Use of Artificial Conversation Data for Speaker Recognition in Cars," IEEE International Conference on Semantic Computing, pp. 124-128, Sept. 2009.
- P. Day and A. K. Nandi, "Robust Text-Independent Speaker Verification Using Genetic Programming," IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 1, pp. 285-295, January 2007. https://doi.org/10.1109/TASL.2006.876765
- P. Song, Y. Jin, C. Zha and L. Zhao, "Speech emotion recognition method based on hidden factor analysis," Electronics Letters, vol. 51, no. 1, pp. 112-114, Jan. 2015. https://doi.org/10.1049/el.2014.3339
- T. Yamada, M. Kumakura and N. Kitawaki, "Performance Estimation of Speech Recognition System Under Noise Conditions Using Objective Quality Measures and Artificial Voice," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 6, pp. 2006-2013, October 2006. https://doi.org/10.1109/TASL.2006.883254
- J. L. Carmona, J. Barker, A. M. Gomez and Ning Ma, "Speech Spectral Envelope Enhancement by HMM-Based Analysis/Resynthesis," IEEE Signal Processing Letters, vol. 20, no. 6, pp. 563-566, June 2013. https://doi.org/10.1109/LSP.2013.2255125
- J. Chen, J. Benesty, Y. Huang and S. Doclo, "New insights into the noise reduction Wiener filter," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1218-1234, July 2006. https://doi.org/10.1109/TSA.2005.860851
- M. Krawczyk-Becker and T. Gerkmann, "On MMSE-Based Estimation of Amplitude and Complex Speech Spectral Coefficients Under Phase-Uncertainty," IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 24, no. 12, pp. 2251-2262, December 2016. https://doi.org/10.1109/TASLP.2016.2602549
- H. K. Kim, S. H. Choi and H. S. Lee, "On approximating line spectral frequencies to LPC cepstral coefficients," IEEE Transactions on Speech and Audio Processing, vol. 8, no. 2, pp. 195-199, March 2000. https://doi.org/10.1109/89.824705
- W. W. Hung and H. C. Wang, "On the use of weighted filter bank analysis for the derivation of robust MFCCs," IEEE Signal Processing Letters, vol. 8, no. 3, pp. 70-73, Mar. 2001. https://doi.org/10.1109/97.905943
- K. V. Veena and M. Dominic, "Speaker Identification and Verification of Noisy Speech Using Multitaper MFCC and Gaussian Models," IEEE International Conference on Power, Instrumentation, Control and Computing, pp. 1-4, Dec. 2015.
- M. Holmberg, D. Gelbart and W. Hemmert, "Automatic speech recognition with an adaptation model motivated by auditory processing," IEEE Trans. on Audio, Speech, and Language Processing, vol. 14, no. 1, pp. 43-49, Jan. 2006. https://doi.org/10.1109/TSA.2005.860349
- S. F. Boll, "Suppression of acoustic noise in speech using spectral subtraction," IEEE Trans. Acoust., Speech, Signal Processing, vol.27, no.2, pp. 113-120, April 1979. https://doi.org/10.1109/TASSP.1979.1163209
- S. K. Pal and S. Mitra, "Multilayer perceptron, fuzzy sets, and classification," IEEE Transaction on Neural Networks, vol. 3, no. 5, pp. 683-697, Sep. 1992. https://doi.org/10.1109/72.159058
- A. Kurematsu, K.Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis," Speech Communication, vol. 9, pp.357-363, 1990. https://doi.org/10.1016/0167-6393(90)90011-W
- D. Rumelhart, G. Hinton and R. Williams, "Learning representations by back-propagation errors," Nature, vol. 323, pp. 533-536, Oct. 1986. https://doi.org/10.1038/323533a0