• Title/Summary/Keyword: Music source

Search Result 146, Processing Time 0.027 seconds

Separation of Single Channel Mixture Using Time-domain Basis Functions

  • Jang, Gil-Jin;Oh, Yung-Hwan
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4E
    • /
    • pp.146-155
    • /
    • 2002
  • We present a new technique for achieving source separation when given only a single charmel recording. The main idea is based on exploiting the inherent time structure of sound sources by learning a priori sets of time-domain basis functions that encode the sources in a statistically efficient manner. We derive a learning algorithm using a maximum likelihood approach given the observed single charmel data and sets of basis functions. For each time point we infer the source parameters and their contribution factors. This inference is possible due to the prior knowledge of the basis functions and the associated coefficient densities. A flexible model for density estimation allows accurate modeling of the observation, and our experimental results exhibit a high level of separation performance for simulated mixtures as well as real environment recordings employing mixtures of two different sources. We show separation results of two music signals as well as the separation of two voice signals.

Separation of Single Channel Mixture Using Time-domain Basis Functions

  • 장길진;오영환
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.4
    • /
    • pp.146-146
    • /
    • 2002
  • We present a new technique for achieving source separation when given only a single channel recording. The main idea is based on exploiting the inherent time structure of sound sources by learning a priori sets of time-domain basis functions that encode the sources in a statistically efficient manner. We derive a learning algorithm using a maximum likelihood approach given the observed single channel data and sets of basis functions. For each time point we infer the source parameters and their contribution factors. This inference is possible due to the prior knowledge of the basis functions and the associated coefficient densities. A flexible model for density estimation allows accurate modeling of the observation, and our experimental results exhibit a high level of separation performance for simulated mixtures as well as real environment recordings employing mixtures of two different sources. We show separation results of two music signals as well as the separation of two voice signals.

A Study on Signal Estimation of Modified Beamformer Method using Perturbation Covariance Matrix (섭동공분산행렬을 이용한 수정 빔형성기 방법의 신호 추정에 대한 연구)

  • Lee, Kwan-Hyeong;Cho, Tae-Jun
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.10 no.4
    • /
    • pp.333-339
    • /
    • 2017
  • Transmission signal in wireless environment receives a signal in which a source signal, interference, and noise are mixed. The goal of this study is to estimate the desired signal from the received signal. In this paper, we have studied a method correctly estimating a target in spatial by modified beamformer method. The modified bemaformer uses an adaptive array antenna and perturbation matrix to obtain the optimal weight, and estimate the desired signal by radiating the beam in spatial. We estimate a desired signal of the target by improving resolution with the modified beamformer method which does not have complicated calculation amount. Through simulation, we compare and analyze the modified beamformer method and the MUSIC method with good resolution. In result of simulation, we showed that modified beamformer method has better resolution of 10degree than classical beamformer method and showed similar performance as the MUSIC method. The resolution of this paper was estimated to be about 5 degrees.

Blind Rhythmic Source Separation (블라인드 방식의 리듬 음원 분리)

  • Kim, Min-Je;Yoo, Ji-Ho;Kang, Kyeong-Ok;Choi, Seung-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.8
    • /
    • pp.697-705
    • /
    • 2009
  • An unsupervised (blind) method is proposed aiming at extracting rhythmic sources from commercial polyphonic music whose number of channels is limited to one. Commercial music signals are not usually provided with more than two channels while they often contain multiple instruments including singing voice. Therefore, instead of using conventional modeling of mixing environments or statistical characteristics, we should introduce other source-specific characteristics for separating or extracting sources in the under determined environments. In this paper, we concentrate on extracting rhythmic sources from the mixture with the other harmonic sources. An extension of nonnegative matrix factorization (NMF), which is called nonnegative matrix partial co-factorization (NMPCF), is used to analyze multiple relationships between spectral and temporal properties in the given input matrices. Moreover, temporal repeatability of the rhythmic sound sources is implicated as a common rhythmic property among segments of an input mixture signal. The proposed method shows acceptable, but not superior separation quality to referred prior knowledge-based drum source separation systems, but it has better applicability due to its blind manner in separation, for example, when there is no prior information or the target rhythmic source is irregular.

Harmony Arrangements using B-Spline Tension Curves (B-스플라인 텐션 곡선을 이용한 음악 편곡)

  • Yoo, Min-Joon;Lee, In-Kwon;Kwon, Dae-Hyun
    • Journal of the HCI Society of Korea
    • /
    • v.1 no.1
    • /
    • pp.1-8
    • /
    • 2006
  • We suggest a graphical representation of the tension flow in tonal music using a piecewise parametric curve, which is a function of time illustrating the changing degree of tension in a corresponding chord progression. The tension curve can be edited by using conventional curve editing techniques to reharmonize the original music with reflecting the user's demand to control the tension of music. We introduce three different methods to measure the tension of a chord in terms of a specific key, which can be used to represent the tension of the chord numerically. Then, by interpolating the series of numerical tension values, a tension curve is constructed. In this paper, we show the tension curve editing method can be effectively used in several interesting applications: enhancing or weakening the overall feeling of tension in a whole song, the local control of tension in a specific region of music, the progressive transition of tension flow from source to target chord progressions, and natural connection of two songs with maintaining the smoothness of the tension flow. Our work shows the possibility of controlling the perceptual factor (tension) in music by using numerical methods. Most of the computations used in this paper are not expensive so they can be calculated in real time. We think that an interesting application of our method is an interactive modification of tension in background music according to the user's emotion or current scenario in the interactive environments such as games.

  • PDF

Frequency Range Enhancement for Faster Convergence of Neural Music Source Separation Systems (신경망 기반 음원 분리 시스템의 학습 속도 향상을 위한 음역대 강조 기법)

  • Kim, Min-Seok;Choi, Woo-Sung;Jung, Soon-Young
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2020.05a
    • /
    • pp.567-569
    • /
    • 2020
  • 여러 악기가 섞여 있는 음원으로부터 원하는 악기 소리를 추출하는 음원 분리 기법 중 최근 신경망 기반 시스템이 활발히 연구되고 있다. 악기마다 고유의 음역대를 가진다는 사실에 감안하여, 연구진은 기존 음원 분리 신경망에 적은 수의 학습 파라미터를 추가하여 학습 속도를 대폭 향상시킬 수 있는 음역대 강조 기법을 제안한다.

Blind Audio Source Separation Based On High Exploration Particle Swarm Optimization

  • KHALFA, Ali;AMARDJIA, Nourredine;KENANE, Elhadi;CHIKOUCHE, Djamel;ATTIA, Abdelouahab
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.5
    • /
    • pp.2574-2587
    • /
    • 2019
  • Blind Source Separation (BSS) is a technique used to separate supposed independent sources of signals from a given set of observations. In this paper, the High Exploration Particle Swarm Optimization (HEPSO) algorithm, which is an enhancement of the Particle Swarm Optimization (PSO) algorithm, has been used to separate a set of source signals. Compared to PSO algorithm, HEPSO algorithm depends on two additional operators. The first operator is based on the multi-crossover mechanism of the genetic algorithm while the second one relies on the bee colony mechanism. Both operators have been employed to update the velocity and the position of the particles respectively. Thus, they are used to find the optimal separating matrix. The proposed method enhances the overall efficiency of the standard PSO in terms of good exploration and performance. Based on many tests realized on speech and music signals supplied by the BSS demo, experimental results confirm the robustness and the accuracy of the introduced BSS technique.

Matched Field Processing: Ocean Experimental Data Analysis Using Feature Extraction Method (실 해상 실험 데이터를 이용한 정합장 처리에서의 특성치 추출 기법 분석)

  • Kim Kyung Seop;Seong Woo Jae;Song Hee Chun
    • The Journal of the Acoustical Society of Korea
    • /
    • v.24 no.1E
    • /
    • pp.21-27
    • /
    • 2005
  • Environmental mismatch has been one of important issues discussed in matched field processing for underwater source detection problem. To overcome this mismatch many algorithms professing robustness have been suggested. Feature extraction method (FEM) [Seong and Byun, IEEE Journal of Oceanic Engineering, 27(3), 642-652 (2002)] is one of robust matched field processing algorithms, which is based on the eigenvector estimation. Excluding eigenvectors of replica covariance matrix corresponding to large eigenvalues and forming an incoherent subspace of the replica field, the processor is formulated similarly to MUSIC algorithm. In this paper, by using the ocean experimental data, processing results of FEM and MVDR with white noise constraint (WNC) are presented for two levels of multi-tone source. Analysis of eigen-space of CSDM and FEM performance are also presented.

Effect Analysis of OSMU on Entertainment Contents Export in East-Asia Market (아시아 시장에서 엔터테인먼트 콘텐츠 수출의 One Source Multi-Use(OSMU) 효과분석 - 일본.중국.대만.홍콩 시장을 중심으로 -)

  • Lee, Chan-Do
    • International Commerce and Information Review
    • /
    • v.9 no.1
    • /
    • pp.427-449
    • /
    • 2007
  • The question of what our cultural goods might have known in a major exporting market, has intrigued investigators since 2000 year. Actually, Maybe Korean cultural assets just didn't have time to get to know International or Asia market. But now, a new euphoria can be tasted, on the lips of the small but growing Korean Contents Mania, as New Korean Wave-Crust begin to welcome the priciest contents from korea. Given Asian's surging population for our entertainment contents-drama, movie, music, character, etc., and the sense of a positive response its newly international market, it is hardly surprising. Now, Korea Wave must play an important roles in our country- economy, business, specially. This paper is seeking in OSMU on Korean Contents in East-North Asian Market, and is developing about Korean Wave study model. and It also points to a different strategy for exporting cultural contents, suggesting it should be effected for model to OSMU.

  • PDF

An Experimental Study on the Establishment of Indoor Noise Criteria - Focused on Korean Complex Building- (실내 소음기준 설정에 관한 실험적 연구 -동일 건물내 사업장을 중심으로-)

  • Lee, Tai-Gang;Kook, Chan;Jang, Gil-Soo;Kim, Sun-Woo
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2008.04a
    • /
    • pp.9-12
    • /
    • 2008
  • There are many place of business in complex building, and recently claims of noise have increased in those buildings. It is most desirable reducing the noise problems to establish the noise criteria considering the noise source and the receiving place of business, which are derived from the dose-response of noise and results of the actual condition. The degree of response to the transmission noise could be changed with background noise level in the receiving stores. In this research, the subjective evaluation for three different background level in receiving place of business or rooms were investigated from subjective tests. The eight business sound source including aerobic music were used for the test.

  • PDF