Extraction of Chord and Tempo from Polyphonic Music Using Sinusoidal Modeling

  • Published : 2003.12.01

Abstract

As music of digital form has been widely used, many people have been interested in the automatic extraction of natural information of music itself, such as key of a music, chord progression, melody progression, tempo, etc. Although some studies have been tried, consistent and reliable results of musical information extraction had not been achieved. In this paper, we propose a method to extract chord and tempo information from general polyphonic music signals. Chord can be expressed by combination of some musical notes and those notes also consist of some frequency components individually. Thus, it is necessary to analyze the frequency components included in musical signal for the extraction of chord information. In this study, we utilize a sinusoidal modeling, which uses sinusoids corresponding to frequencies of musical tones, and show reliable chord extraction results of sinusoidal modeling. We could also find that the tempo of music, which is the one of remarkable feature of music signal, interactively supports the chord extraction idea, if used together. The proposed scheme of musical feature extraction is able to be used in many application fields, such as digital music services using queries of musical features, the operation of music database, and music players mounting chord displaying function, etc.

Keywords

References

  1. Martinez, Jose M. (UPM-GTI, ES), 'SO/IEC JTC1/SC29/WG11 N4980: MPEG-7 Overview (version 8),' http://www.chiariglione. org/mpeg/standards/mpeg-7/mpeg-7. htm,Klangenfurt, July 2002
  2. Bergman, A. S., Auditory Scene Analysis, 4th Ed., The MIT Press, Cambridge, England, 2001
  3. McNab, R. J., and Smith, L. A. 'Evaluation of a melody transcription system,' IEEE International Conference on Multimedia and Expo, 2, 819-822, 2000
  4. Lee, T. W. and Ziehe A., 'Combining time-delayed decorrelation and ICA: towards solving the cocktail party problem,' ICASSP Proq., 1249-1252, 1998
  5. Rossing, Thomas D., The science of sound, AddisonWesley, 1982
  6. Su, B., and Jeng, S. K., 'Multi-timbre chord classification using wavelet transform and self-organized MAP neural networks,' ICASSP Proc., 3377-3380, 2001
  7. Nishi, K., Ando, S., and Aida, S., 'Optimum harmonics tracking filter for auditory scene analysis,' ICASSP Proc., 573-576, 1996
  8. Mallat, S., Zhang, Z., 'Matching pursuits with timefrequency dictionaries,' IEEE-SP, 41 (12), 3397-3415, Dec. 1993 https://doi.org/10.1109/78.258082
  9. Goodwin, M., 'Matching pursuits with damped sinusoids,' ICASSP Proc., 2037-2040, 1997