Search | Korea Science

Automatic Music Summarization Using Similarity Measure Based on Multi-Level Vector Quantization (다중레벨 벡터양자화 기반의 유사도를 이용한 자동 음악요약)

Kim, Sung-Tak;Kim, Sang-Ho;Kim, Hoi-Rin
- The Journal of the Acoustical Society of Korea
- /
- v.26 no.2E
- /
- pp.39-43
- /
- 2007
Music summarization refers to a technique which automatically extracts the most important and representative segments in music content. In this paper, we propose and evaluate a technique which provides the repeated part in music content as music summary. For extracting a repeated segment in music content, the proposed algorithm uses the weighted sum of similarity measures based on multi-level vector quantization for fixed-length summary or optimal-length summary. For similarity measures, count-based similarity measure and distance-based similarity measure are proposed. The number of the same codeword and the Mahalanobis distance of features which have same codeword at the same position in segments are used for count-based and distance-based similarity measure, respectively. Fixed-length music summary is evaluated by measuring the overlapping ratio between hand-made repeated parts and automatically generated ones. Optimal-length music summary is evaluated by calculating how much automatically generated music summary includes repeated parts of the music content. From experiments we observed that optimal-length summary could capture the repeated parts in music content more effectively in terms of summary length than fixed-length summary.
PDF KSCI

Style-Specific Language Model Adaptation using TF*IDF Similarity for Korean Conversational Speech Recognition

Park, Young-Hee;Chung, Min-Hwa
- The Journal of the Acoustical Society of Korea
- /
- v.23 no.2E
- /
- pp.51-55
- /
- 2004
In this paper, we propose a style-specific language model adaptation scheme using n-gram based tf*idf similarity for Korean spontaneous speech recognition. Korean spontaneous speech shows especially different style-specific characteristics such as filled pauses, word omission, and contraction, which are related to function words and depend on preceding or following words. To reflect these style-specific characteristics and overcome insufficient data for training language model, we estimate in-domain dependent n-gram model by relevance weighting of out-of-domain text data according to their n-. gram based tf*idf similarity, in which in-domain language model include disfluency model. Recognition results show that n-gram based tf*idf similarity weighting effectively reflects style difference.
PDF KSCI

The assessment of sound quality of loudspeaker system by using factor analysis and muliti-dimensional scaling (인자분석과 다효원척를 이용한 스피이커의 음질평가)

황영수;김영일;차일환
- The Journal of the Acoustical Society of Korea
- /
- v.3 no.1
- /
- pp.16-24
- /
- 1984
The objective data and subjective data correlated in order to rate sound quality of loudspeaker system and these data were analyzed by the Factor Analysis and Multi-Dimensioinal Scaling. The dimensions yielded Factor Analysis were interpreted as "Contrast", "Metallic", "Rich", "Present" and their relation to physical variables were explored by studying the positions of loudspeaker systems in the respective dimension. When the subjective similarity degree of loudspeaker systems was compared with the objective similarity degree of loudspeaker systems by Multi-Dimensional Scaling, the similarity degree of sound pressure response in the listening room closely coincided with the subjective similarity degree regardless of sound source. This result implies the necessity of measurements taken not only in an anechoic room but also in a listening room in order to rate sound quality of loudspeaker systems.
PDF

A code-based chromagram similarity for cover song identification (커버곡 검색을 위한 코드 기반 크로마그램 유사도)

Seo, Jin Soo
- The Journal of the Acoustical Society of Korea
- /
- v.38 no.3
- /
- pp.314-319
- /
- 2019
Computing chromagram similarity is indispensable in constructing cover song identification system. This paper proposes a code-based chromagram similarity to reduce the computational and the storage costs for cover song identification. By learning a song-specific codebook, a chromagram sequence is converted into a code sequence, which results in the reduction of the feature storage cost. We build a lookup table over the learned codebooks to compute chromagram similarity efficiently. Experiments on two music datasets were performed to compare the proposed code-based similarity with the conventional one in terms of cover song search accuracy, feature storage, and computational cost.
https://doi.org/10.7776/ASK.2019.38.3.314 인용 PDF KSCI HTML

A relevance-based pairwise chromagram similarity for improving cover song retrieval accuracy (커버곡 검색 정확도 향상을 위한 적합도 기반 크로마그램 쌍별 유사도)

Jin Soo Seo
- The Journal of the Acoustical Society of Korea
- /
- v.43 no.2
- /
- pp.200-206
- /
- 2024
Computing music similarity is an indispensable component in developing music search service. This paper proposes a relevance weight of each chromagram vector for cover song identification in computing a music similarity function in order to boost identification accuracy. We derive a music similarity function using the relevance weight based on the probabilistic relevance model, where higher relevance weights are assigned to less frequently-occurring discriminant chromagram vectors while lower weights to more frequently-occurring ones. Experimental results performed on two cover music datasets show that the proposed music similarity improves the cover song identification performance.
https://doi.org/10.7776/ASK.2024.43.2.200 인용 PDF

Measurement of Rhythmic Similarity for Auditory Memory Game (청각 기억 게임을 위한 리듬 유사도 측정 기술)

Kim, Ju-Wan;Lee, Se-Won;Park, Ho-Chong
- The Journal of the Acoustical Society of Korea
- /
- v.30 no.3
- /
- pp.136-141
- /
- 2011
In this paper, a method for measuring rhythmic similarity between two sound signals for auditory memory game is proposed. The proposed method analyzes energy fluctuation, the temporal duration of energy peak, the timbre of two signals, and detects beat positions for each signal. Then, it determines the rhythm vector after compensating a difference in tempo and the number of beats between two signals. Finally, a method for rhythmic similarity measurement is defined as a function of the dissimilarity between two rhythm vectors and a difference in the number of beats. The rhythmic similarity measured by the proposed method and that by the subjective listening test are compared, and the correlation of 0.86 between two results is achieved.
https://doi.org/10.7776/ASK.2011.30.3.136 인용 PDF KSCI

Automatic Music Summarization Using Vector Quantization and Segment Similarity

Kim, Sang-Ho;Kim, Sung-Tak;Kim, Hoi-Rin
- The Journal of the Acoustical Society of Korea
- /
- v.27 no.2E
- /
- pp.51-56
- /
- 2008
In this paper, we propose an effective method for music summarization which automatically extracts a representative part of the music by using signal processing technology. Proposed method uses a vector quantization technique to extract several segments which can be regarded as the most important contents in the music. In general, there is a repetitive pattern in music, and human usually recognizes the most important or catchy tune from the repetitive pattern. Thus the repetition which is extracted using segment similarity is considered to express a music summary. The segments extracted are again combined to generate a complete music summary. Experiments show the proposed method captures the main theme of the music more effectively than conventional methods. The experimental results also show that the proposed method could be used for real-time application since the processing time in generating music summary is much faster than other methods.
PDF KSCI

Analytical Study on Performance Evaluation of Large-Sized Silencer using Geometric Similarity Law (기하상사법을 이용한 대형 소음기의 성능평가에 관한 해석적 연구)

Yang, Jun-Hyuk;Lee, Boo-Youn;Kim, Won-Jin
- Journal of Advanced Marine Engineering and Technology
- /
- v.34 no.2
- /
- pp.275-281
- /
- 2010
In this paper, a geometric similarity law is introduced to the performance test of a large-sized silencer used in ship engine or plant system. A test of scale-down model enable to yield the cost and time saving in developing large-sized silencer considerably. Two types of silencer, resonator and expansion chamber, were analyzed by a theoretical method and an acoustical FEM(finite element method) in order to obtain geometric similarity variables. A method is proposed to estimate the transmission loss of prototype model using the test results of scale-down model. Two actual large-sized silencer, which consist of resonator and expansion chamber, were analysed by an acoustical FE analysis. Consequently, the proposed method predicts effectively the performance of prototype silencers using those of scale-down models.
https://doi.org/10.5916/jkosme.2010.34.2.275 인용 PDF KSCI

Acoustical Similarity for Small Cooling Fans Revisited (소형 송풍기 소음의 음향학적 상사성에 관한 연구)

김용철;진성훈;이승배
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 1995.04a
- /
- pp.196-201
- /
- 1995
The broadband and discrete sources of sound in small cooling fans of propeller type and centrifugal type were investigated to understand the turbulent vortex structures from many bladed fans using ANSI test plenum for small air-moving devices (AMDs). The noise measurement method uses the plenum as a test apparatus to determine the acoustic source spectral density function at each operating conditions similar to real engineering applications based on acoustic similarity laws. The characteristics of fans including the head rise vs. volumetric flow rate performance were measured using a performance test facility. The sound power spectrum is decomposed into two non-dimensional functions: an acoustic source spectral distribution function F(St,.phi.) and an acoustic system response function G(He,.phi.) where St, He, and .phi. are the Strouhal number, the Helmholtz number, and the volumetric flow rate coefficient, respectively. The autospectra of radiated noise measurements for the fan operating at several volumetric flow rates,.phi., are analyzed using acoustical similarity. The rotating stall in the small propeller fan with a bell-mouth guided is mainly due to a leading edge separation. It creates a blockage in the passage and the reduction in the flow rate. The sound power levels with respect to the rotational speeds were measured to reveal the mechanisms of stall and/or surge for different loading conditions and geometries, for example, fans installed with a impinging plate. Lee and Meecham (1993) studied the effect of the large-scale motions like impinging normally on a flat plate using Large-Eddy Simulation(LES) and Lighthill's analogy.[ASME Winter Annual Meeting 1993, 93-WA/NCA-22]. The dipole and quadrupole sources in the fans tested are shown closely related to the vortex structures involved using cross-correlations of the hot-wire and microphone signals.
PDF

A music similarity function based on probabilistic linear discriminant analysis for cover song identification (커버곡 검색을 위한 확률적 선형 판별 분석 기반 음악 유사도)

Jin Soo, Seo;Junghyun, Kim;Hyemi, Kim
- The Journal of the Acoustical Society of Korea
- /
- v.41 no.6
- /
- pp.662-667
- /
- 2022
Computing music similarity is an indispensable component in developing music search service. This paper focuses on learning a music similarity function in order to boost cover song identification performance. By using the probabilistic linear discriminant analysis, we construct a latent music space where the distances between cover song pairs reduces while the distances between the non-cover song pairs increases. We derive a music similarity function by testing hypothesis, whether two songs share the same latent variable or not, using the probabilistic models with the assumption that observed music features are generated from the learned latent music space. Experimental results performed on two cover music datasets show that the proposed music similarity improves the cover song identification performance.
https://doi.org/10.7776/ASK.2022.41.6.662 인용 PDF KSCI

Search Result 53, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)