A Novel Speech/Music Discrimination Using Feature Dimensionality Reduction

Keum, Ji-Soo;Lee, Hyon-Soo;Hagiwara, Masafumi;

doi:10.5391/IJFIS.2010.10.1.007

International Journal of Fuzzy Logic and Intelligent Systems

제10권1호
/
Pages.7-11
/
2010
/
1598-2645(pISSN)
/
2093-744X(eISSN)

한국지능시스템학회 (Korean Institute of Intelligent Systems)

DOI QR Code

A Novel Speech/Music Discrimination Using Feature Dimensionality Reduction

Keum, Ji-Soo (Faculty of Science and Technology, Keio University) ;
Lee, Hyon-Soo (Dept. of Computer Engineering, Kyung Hee University) ;
Hagiwara, Masafumi (Faculty of Science and Technology, Keio University)

투고 : 2009.12.07
심사 : 2010.01.15
발행 : 2010.03.25

https://doi.org/10.5391/IJFIS.2010.10.1.007 인용 PDF KSCI

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

In this paper, we propose an improved speech/music discrimination method based on a feature combination and dimensionality reduction approach. To improve discrimination ability, we use a feature based on spectral duration analysis and employ the hierarchical dimensionality reduction (HDR) method to reduce the effect of correlated features. Through various kinds of experiments on speech and music, it is shown that the proposed method showed high discrimination results when compared with conventional methods.

키워드

참고문헌

J. Saunders, “Real-time discrimination of broadcast speech/ music,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 993-996, 1996.
E. Scheirer and M. Slaney, “Construction and evaluation of a robust multifeature speech/music discriminator,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP), pp. 1331-1334, 1997.
T. Zhang and J. Kuo, “Audio content analysis for online audiovisual data segmentation and classification,” IEEE Trans. on Speech and Audio Processing, vol. 9, no. 4, pp. 441-457, 2001. https://doi.org/10.1109/89.917689
L. Lu, H. Zhang, and H. Jiang, “Content analysis for audio classification and segmentation,” IEEE Trans. on Speech and Audio Processing, vol. 10, no. 7, pp. 504-516, 2002. https://doi.org/10.1109/TSA.2002.804546
J.-S. Keum and H.-S. Lee, “Speech/music discrimination using spectral peak feature for speaker indexing,” Proc. IEEE Int. Sym. on Intelligent Signal Processing and Communication Systems (ISPACS), pp. 323-326, 2006.
J.-S. Keum, S.-K. Lim, and H.-S. Lee, “Speech/music discrimination using spectrum analysis and neural network,” The Journal of the Acoustical Society of Korea, vol. 26, no. 5, pp. 207-213, 2007.
J.-S. Keum, H.-S. Lee, and M. Hagiwara, “An improved speech/nonspeech classification based on feature combination for audio indexing,” IEICE Trans. on Fundamentals, vol. E93-A, no. 4, 2010.
J.R. Deller, J.G. Proakis, and J.H.L. Hansen, Discrete Time Processing of Speech Signals, Prentice Hall, 1987.
R.O. Duda, P.E. Hart, and D.G. Stork, Pattern Classification, John Wiley & Sons, 2001.
AR Abu-El-Quran and RA Goubran, “Security-monitoring using microphone arrays and audio classification,” IEEE Trans. on Instrumentation and Measurement, vol. 55, no. 4, pp. 1025-1032, 2006. https://doi.org/10.1109/TIM.2006.876394

피인용 문헌

A Classification Method Using Data Reduction vol.12, pp.1, 2012, https://doi.org/10.5391/IJFIS.2012.12.1.1

International Journal of Fuzzy Logic and Intelligent Systems

A Novel Speech/Music Discrimination Using Feature Dimensionality Reduction

초록

키워드

참고문헌

피인용 문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)