DOI QR코드

DOI QR Code

An Automatic Method of Detecting Audio Signal Tampering in Forensic Phonetics

법음성학에서의 오디오 신호의 위변조 구간 자동 검출 방법 연구

  • Received : 2014.05.19
  • Accepted : 2014.06.16
  • Published : 2014.06.30

Abstract

We propose a novel scheme for digital audio authentication of given audio files which are edited by inserting small audio segments from different environmental sources. The purpose of this research is to detect inserted sections from given audio files. We expect that the proposed method will assist human investigators by notifying suspected audio section which considered to be recorded or transmitted on different environments. GMM-UBM and GSV-SVM are applied for modeling the dominant environment of a given audio file. Four kinds of likelihood ratio based scores and SVM score are used to measure the likelihood for a dominant environment model. We also use an ensemble score which is a combination of the aforementioned five kinds of scores. In the experimental results, the proposed method shows the lowest average equal error rate when we use the ensemble score. Even when dominant environments were unknown, the proposed method gives a similar accuracy.

Keywords

References

  1. Brixen, E. B. (2007). Techniques for the authentication of digital audio recording, Audio Engineering Society Convention 122.
  2. Ojowu, O. (2012). ENF extraction from digital recordings using adaptive techniques and frequency tracking, Information Forensics and Security. Vol. 7, 1330-1338. https://doi.org/10.1109/TIFS.2012.2197391
  3. Grigoras, C. (2009). Applications of ENF analysis in forensic authentication of digital audio and video recording, Journal of the Audio Engineering Society. Vol. 57, Issue 9, 643-661.
  4. Nicolade, D. P. & Apolinario, J. A. (2009). Evaluating digital audio authenticity with spectral distances and ENF phase change, in proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, 1417-1420.
  5. Malik, H. (2013). Acoustic environment identification and its applications to audio forensics, IEEE Transactions on Information Forensics and Security. Vol. 8, No. 11, 1827-1837. https://doi.org/10.1109/TIFS.2013.2280888
  6. Kraetzer, C., Oermann, A., Dittmann, J. & Lang, A. (2007). Digital audio forensics: a first practical evaluation on microphone and environment classification, in proceedings of the 9th workshop on Multimedia & security, 63-74.
  7. Garcia-Romero, D. & Espy-Wilson, C. Y. (2010). Automatic acquisition device identification from speech recordings, in proceedings of the IEEE International Conference on Acoustics Speech and Signal Processing. 1806-1809.
  8. Luo, D., Luo, W., Yang, R. & Huang, J. (2012). Compression history identification for digital audio signal, in proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 1733-1736.
  9. Jenner, F. & Kwasinski, A. (2012). Highly accurate non-intrusive speech forensics for codec identifications from observed decoded signals, in proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing. 1737-1740.
  10. Reynolds, D. A., Quatieri, T. F. & Dunn, R. B. (2000). Speaker Verification Using Adapted Gaussian Mixture Models, Digital Signal Processing. Vol. 10, 19-41. https://doi.org/10.1006/dspr.1999.0361
  11. Campbell, W. M., Sturim, D. E. & Reynolds, D. A. (2006). Support Vector Machines Using GMM Supervectors for Speaker Verification, IEEE Signal Processing Letters, Vol. 13, No. 5, 308-311. https://doi.org/10.1109/LSP.2006.870086
  12. Scholkopf, B., Williamson, R. C., Smola, A. J., Shawe-Taylor, J. & Platt, J. C. (1999). Support Vector Method for Novelty Detection, Advances in Neural Information Processing Systems. Vol. 12, 582-588.
  13. Kim, M. J., Yang, I. H., So, B. M., Kim, M. S. & Yu, H. J. (2012). Histogram Equalization Using Background Speakers' Utterances for Speaker Identification, Journal of the Korean society of speech sciences. Vol. 4, No. 2, 79-86. (김명재, 양일호, 소병민, 김민석, 유하진. (2012). 화자 식별에서의 배경화자데이터를 이용한 히스토그램 등화 기법. 말소리와 음성과학, 4권 2호, 79-86.) https://doi.org/10.13064/KSSS.2012.4.2.079
  14. FFmpeg, http://www.ffmpeg.org/index.html.
  15. LAME MP3 Encoder, http://lame.sourceforge.net/.
  16. Opus Codec, http://www.opus-codec.org/downloads/.
  17. Young, S., Evermann, G., Gales, M.H.T., Kershaw, D., Liu, X., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V. & Woodland, P. (2009). The HTK Book (for HTK Version 3.4), Cambridge University Engineering Department.
  18. Chang, C. & Lin, C. (2014). LIBSVM -- A Library for Support Vector Machines, http://www.csie.ntu.edu.tw/-cjlin/libsvm/.