Search | Korea Science

Feature Parameter Extraction and Analysis in the Wavelet Domain for Discrimination of Music and Speech (음악과 음성 판별을 위한 웨이브렛 영역에서의 특징 파라미터)

Kim, Jung-Min;Bae, Keun-Sung
- MALSORI
- /
- no.61
- /
- pp.63-74
- /
- 2007
Discrimination of music and speech from the multimedia signal is an important task in audio coding and broadcast monitoring systems. This paper deals with the problem of feature parameter extraction for discrimination of music and speech. The wavelet transform is a multi-resolution analysis method that is useful for analysis of temporal and spectral properties of non-stationary signals such as speech and audio signals. We propose new feature parameters extracted from the wavelet transformed signal for discrimination of music and speech. First, wavelet coefficients are obtained on the frame-by-frame basis. The analysis frame size is set to 20 ms. A parameter $E_{sum}$ is then defined by adding the difference of magnitude between adjacent wavelet coefficients in each scale. The maximum and minimum values of $E_{sum}$ for period of 2 seconds, which corresponds to the discrimination duration, are used as feature parameters for discrimination of music and speech. To evaluate the performance of the proposed feature parameters for music and speech discrimination, the accuracy of music and speech discrimination is measured for various types of music and speech signals. In the experiment every 2-second data is discriminated as music or speech, and about 93% of music and speech segments have been successfully detected.
PDF

Performance Comparison of Feature Parameters and Classifiers for Speech/Music Discrimination (음성/음악 판별을 위한 특징 파라미터와 분류기의 성능비교)

Kim Hyung Soon;Kim Su Mi
- MALSORI
- /
- no.46
- /
- pp.37-50
- /
- 2003
In this paper, we evaluate and compare the performance of speech/music discrimination based on various feature parameters and classifiers. As for feature parameters, we consider High Zero Crossing Rate Ratio (HZCRR), Low Short Time Energy Ratio (LSTER), Spectral Flux (SF), Line Spectral Pair (LSP) distance, entropy and dynamism. We also examine three classifiers: k Nearest Neighbor (k-NN), Gaussian Mixure Model (GMM), and Hidden Markov Model (HMM). According to our experiments, LSP distance and phoneme-recognizer-based feature set (entropy and dunamism) show good performance, while performance differences due to different classifiers are not significant. When all the six feature parameters are employed, average speech/music discrimination accuracy up to 96.6% is achieved.
PDF

A Novel Speech/Music Discrimination Using Feature Dimensionality Reduction

Keum, Ji-Soo;Lee, Hyon-Soo;Hagiwara, Masafumi
- International Journal of Fuzzy Logic and Intelligent Systems
- /
- v.10 no.1
- /
- pp.7-11
- /
- 2010
In this paper, we propose an improved speech/music discrimination method based on a feature combination and dimensionality reduction approach. To improve discrimination ability, we use a feature based on spectral duration analysis and employ the hierarchical dimensionality reduction (HDR) method to reduce the effect of correlated features. Through various kinds of experiments on speech and music, it is shown that the proposed method showed high discrimination results when compared with conventional methods.
https://doi.org/10.5391/IJFIS.2010.10.1.007 인용 PDF KSCI

Performance Comparison of Feature Parameters and Classifiers for Speech/Music Discrimination (음성과 음악 분류를 위한 특징 파라미터와 분류 방법의 성능비교)

Kim Su Mi;Kim Hyung Soon
- Proceedings of the KSPS conference
- /
- 2003.05a
- /
- pp.149-152
- /
- 2003
In this paper, we present a performance comparison of feature parameters and classifiers for speech/music discrimination. Experiments were carried out on six feature parameters and three classifiers. It turns out that three classifiers shows similar performance. The feature set that captures the temporal and spectral structure of the signal yields good performance, while the phone-based feature set shows relatively inferior performance.
PDF

Discrimination of Cancer Cell by Fuzzy Logic in Medical Images

Na Cheol-Hun
- Journal of information and communication convergence engineering
- /
- v.4 no.1
- /
- pp.36-40
- /
- 2006
A new method of digital image analysis technique for medical images of cancer cell is presented. This paper deals with the cancer cell discrimination. The object images were the Thyroid Gland cell images that were diagnosed as normal and abnormal. This paper proposes a new discrimination method based on fuzzy logic algorithm. The focus of this paper is an automatic discrimination of cells into normal and abnormal of medical images by dominant feature parameters method with fuzzy algorithm. As a consequence of using fuzzy logic algorithm, the nucleus were successfully diagnosed as normal and abnormal. As for the experimental result, average recognition rate of 64.66% was obtained by applying single parameter of 16 feature parameters at a time. The discrimination rate of 93.08% was obtained by proposed method.
PDF KSCI

Analysis of the Robustness and Discrimination for Video Fingerprints in Video Copy Detection (복제 비디오 검출에서 비디오 지문의 강인함과 분별력 분석)

Kim, Semin;Ro, Yong Man
- Journal of Korea Multimedia Society
- /
- v.16 no.11
- /
- pp.1281-1287
- /
- 2013
In order to prevent illegal video copies, many video fingerprints have been developed. Video fingerprints should be robust from various video transformations and have high discriminative powers. In general, video fingerprints are generated from three feature spaces such as luminance, gradient, and DCT coefficients. However, there is a few study for the robustness and discrimination according to feature spaces. Thus, we analyzed the property of each feature space by video copy detion task with the robustness and the discrimination of video fingerprints. We generated three video fingerprints from these feature spaces using a same algorithm. In our test, a video fingerprint. based on DCT coefficient outperformed others because the discrimination of it was higher.
https://doi.org/10.9717/kmms.2013.16.11.1281 인용 PDF KSCI KPUBS HTML

Development of Web Based Mold Discrimination System using the Matching Process for Vision Information and CAD DB (비전정보와 캐드DB 매칭을 통한 웹 기반 금형 판별 시스템 개발)

Choi, Jin-Hwa;Jeon, Byung-Cheol;Cho, Myeong-Woo
- Transactions of the Korean Society of Machine Tool Engineers
- /
- v.15 no.5
- /
- pp.37-43
- /
- 2006
The target of this study is development of web based mold discrimination system by matching vision information with CAD database. The use of 2D vision image makes possible speedy mold discrimination from many databases. The image processing such as preprocessing, cleaning is done for obtaining vivid image with object information. The web-based system is a program which runs to exchange messages between a server and a client by making of ActiveX control and the result of mold discrimination is shown on web-browser. For effective feature classification and extraction, signature method is used to make sensible information from 2D data. As a result, the possibility of proposed system is shown as matching feature information from vision image with CAD database samples.
PDF KSCI

Speech/Music Discrimination Using Spectrum Analysis and Neural Network (스펙트럼 분석과 신경망을 이용한 음성/음악 분류)

Keum, Ji-Soo;Lim, Sung-Kil;Lee, Hyon-Soo
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.5
- /
- pp.207-213
- /
- 2007
In this research, we propose an efficient Speech/Music discrimination method that uses spectrum analysis and neural network. The proposed method extracts the duration feature parameter(MSDF) from a spectral peak track by analyzing the spectrum, and it was used as a feature for Speech/Music discriminator combined with the MFSC. The neural network was used as a Speech/Music discriminator, and we have reformed various experiments to evaluate the proposed method according to the training pattern selection, size and neural network architecture. From the results of Speech/Music discrimination, we found performance improvement and stability according to the training pattern selection and model composition in comparison to previous method. The MSDF and MFSC are used as a feature parameter which is over 50 seconds of training pattern, a discrimination rate of 94.97% for speech and 92.38% for music. Finally, we have achieved performance improvement 1.25% for speech and 1.69% for music compares to the use of MFSC.
https://doi.org/10.7776/ASK.2007.26.5.207 인용 PDF KSCI

Speech/Music Discrimination Using Spectral Peak Track Analysis (스펙트럴 피크 트랙 분석을 이용한 음성/음악 분류)

Keum, Ji-Soo;Lee, Hyon-Soo
- Proceedings of the IEEK Conference
- /
- 2006.06a
- /
- pp.243-244
- /
- 2006
In this study, we propose a speech/music discrimination method using spectral peak track analysis. The proposed method uses the spectral peak track's duration at the same frequency channel for feature parameter. And use the duration threshold to discriminate the speech/music. Experiment result, correct discrimination ratio varies according to threshold, but achieved a performance comparable to another method and has a computational efficient for discrimination.
PDF

의료영상진단기의 현황과 전망

조장희
- Journal of Biomedical Engineering Research
- /
- v.10 no.2
- /
- pp.106-108
- /
- 1989
A new method of digital image analysis technique for discrimination of cancer cell was presented in this paper. The object image was the Thyroid eland cells image that was diagnosed as normal and abnormal (two types of abnormal: follicular neoplastic cell, and papillary neoplastic cell), respectively. By using the proposed region segmentation algorithm, the cells were segmented into nucleus. The 16 feature parameters were used to calculate the features of each nucleus. A9 a consequence of using dominant feature parameters method proposed in this paper, discrimination rate of 91.11% was obtained for Thyroid Gland cells.
PDF

Search Result 172, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)