Search | Korea Science

QRAS-based Algorithm for Omnidirectional Sound Source Determination Without Blind Spots (사각영역이 없는 전방향 음원인식을 위한 QRAS 기반의 알고리즘)

Kim, Youngeon;Park, Gooman
- Journal of Broadcast Engineering
- /
- v.27 no.1
- /
- pp.91-103
- /
- 2022
Determination of sound source characteristics such as: sound volume, direction and distance to the source is one of the important techniques for unmanned systems like autonomous vehicles, robot systems and AI speakers. There are multiple methods of determining the direction and distance to the sound source, e.g., using a radar, a rider, an ultrasonic wave and a RF signal with a sound. These methods require the transmission of signals and cannot accurately identify sound sources generated in the obstructed region due to obstacles. In this paper, we have implemented and evaluated a method of detecting and identifying the sound in the audible frequency band by a method of recognizing the volume, direction, and distance to the sound source that is generated in the periphery including the invisible region. A cross-shaped based sound source recognition algorithm, which is mainly used for identifying a sound source, can measure the volume and locate the direction of the sound source, but the method has a problem with "blind spots". In addition, a serious limitation for this type of algorithm is lack of capability to determine the distance to the sound source. In order to overcome the limitations of this existing method, we propose a QRAS-based algorithm that uses rectangular-shaped technology. This method can determine the volume, direction, and distance to the sound source, which is an improvement over the cross-shaped based algorithm. The QRAS-based algorithm for the OSSD uses 6 AITDs derived from four microphones which are deployed in a rectangular-shaped configuration. The QRAS-based algorithm can solve existing problems of the cross-shaped based algorithms like blind spots, and it can determine the distance to the sound source. Experiments have demonstrated that the proposed QRAS-based algorithm for OSSD can reliably determine sound volume along with direction and distance to the sound source, which avoiding blind spots.
https://doi.org/10.5909/JBE.2022.27.1.91 인용 PDF KSCI KPUBS

Independent Component Analysis Based on Frequency Domain Approach Model for Speech Source Signal Extraction (음원신호 추출을 위한 주파수영역 응용모델에 기초한 독립성분분석)

Choi, Jae-Seung
- The Journal of the Korea institute of electronic communication sciences
- /
- v.15 no.5
- /
- pp.807-812
- /
- 2020
This paper proposes a blind speech source separation algorithm using a microphone to separate only the target speech source signal in an environment in which various speech source signals are mixed. The proposed algorithm is a model of frequency domain representation based on independent component analysis method. Accordingly, for the purpose of verifying the validity of independent component analysis in the frequency domain for two speech sources, the proposed algorithm is executed by changing the type of speech sources to perform speech sources separation to verify the improvement effect. It was clarified from the experimental results by the waveform of this experiment that the two-channel speech source signals can be clearly separated compared to the original waveform. In addition, in this experiments, the proposed algorithm improves the speech source separation performance compared to the existing algorithms, from the experimental results using the target signal to interference energy ratio.
https://doi.org/10.13067/JKIECS.2020.15.5.807 인용 PDF KSCI

Audio Source Separation Method based on Beamspace-domain Multichannel Non-negative Matrix Factorization, Part II: A Study on the Beamspace Transform Algorithms (빔공간-영역 다채널 비음수 행렬 분해 알고리즘을 이용한 음원 분리 기법 Part II: 빔공간-변환 기법에 대한 고찰)

Lee, Seok-Jin;Park, Sang-Ha;Sung, Koeng-Mo
- The Journal of the Acoustical Society of Korea
- /
- v.31 no.5
- /
- pp.332-339
- /
- 2012
Beamspace transform algorithm transforms spatial-domain data - such as x, y, z dimension - into incidence-angle-domain data, which is called beamspace-domain data. The beamspace transform method is generally used in source localization and tracking, and adaptive beamforming problem. When the beamspace transform method is used in multichannel audio source separation, the inverse beamspace transform is also important because the source image have to be reconstructed. This paper studies the beamspace transform and inverse transform algorithms for multichannel audio source separation system, especially for the beamspace-domain multichannel NMF algorithm.
https://doi.org/10.7776/ASK.2012.31.5.332 인용 PDF KSCI

A Study on Respiratory-Reflected Music Play Using Skin Image (피부영상을 이용한 호흡 반영 음원 조율방법에 관한 연구)

KIM, Sung-Hyuck;Hong, Kwang_Seok
- Proceedings of the Korea Information Processing Society Conference
- /
- 2018.10a
- /
- pp.863-865
- /
- 2018
본 논문에서는 피부영상을 이용한 호흡 반영 음원 조율 방법을 제안한다. 얼굴 영상으로부터 호흡 신호를 추정하기 위해 ROI(Region of Interest)를 지정하고 지정된 영역의 색상 체계를 RGB에서 YCgCo로 변환한다. 피부 관심 영역으로부터 계산된 Cg색상 데이터 평균값에 필터링을 적용하여 호흡 신호를 검출한다. 검출된 호흡 신호를 통하여 사용자의 호흡 상태를 반영한 음원 조율방법을 제안하고, 이를 구현한 응용 프로그램을 소개한다. 구현한 응용프로그램의 성능평가를 위해 피험자 15명을 대상으로 블라인드 테스트와 MOS 평가방법을 사용하였으며, 실험 결과 9명의 피실험자가 호흡을 반영한 음원과 반영하지 않은 음원에 대한 차이를 느꼈다. 또한, MOS 평가방법으로 두 음원의 선호도를 조사한 결과 총 5점 만점 중 호흡을 반영한 음원이 4점, 원음이 3.6점을 얻었으며 이를 통해 피실험자들이 호흡이 반영된 음원을 선호한다는 결과를 확인하였다.
https://doi.org/10.3745/PKIPS.y2018m10a.863 인용 PDF

Frequency Domain Blind Source Seperation Using Cross-Correlation of Input Signals (입력신호 상호상관을 이용한 주파수 영역 블라인드 음원 분리)

Sung Chang Sook;Park Jang Sik;Son Kyung Sik;Park Keun-Soo
- Journal of Korea Multimedia Society
- /
- v.8 no.3
- /
- pp.328-335
- /
- 2005
This paper proposes a frequency domain independent component analysis (ICA) algorithm to separate the mixed speech signals using a multiple microphone array By estimating the delay timings using a input cross-correlation, even in the delayed mixture case, we propose a good initial value setting method which leads to optimal convergence. To reduce the calculation, separation process is performed at frequency domain. The results of simulations confirms the better performances of the proposed algorithm.
PDF

Prestack Reverse Time Migration for Seismic Reflection data in Block 5, Jeju Basin (제주분지 제 5광구 탄성파자료의 중합전 역시간 구조보정)

Ko, Chin-Surk;Jang, Seong-Hyung
- Economic and Environmental Geology
- /
- v.43 no.4
- /
- pp.349-358
- /
- 2010
For imaging complex subsurface structures such as salt dome, faults, thrust belt, and folds, seismic prestack reverse-time migration in depth domain is widely used, which is performed by the cross-correlation of shot-domain wavefield extrapolation with receiver-domain wavefield extrapolation. We apply the prestack reverse-time migration, which had been developed at KIGAM, to the seismic field data set of Block 5 in Jeju basin of Korea continental shelf in order to improve subsurface syncline stratigraphy image of the deep structures under the shot point 8km at the surface. We performed basic data processing for improving S/N ratio in the shot gathers, and constructed a velocity model from stack velocity which was calculated by the iterative velocity spectrum. The syncline structure of the stack image appears as disconnected interfaces due to the diffractions, but the result of the prestack migration shows that the syncline image is improved as seismic energy is concentrated on the geological interfaces.
PDF KSCI

Correlation Analyst of Music Frequence and Heart Rate Variability (음원의 주파수와 심박변화율의 상관관계 분석)

Kim, Jae-Kyung;Park, Min-Ho;Jang, Gye-Sun;Ko, Il-Ju
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2008.06a
- /
- pp.337-341
- /
- 2008
음원은 개인의 감정변화에 많은 영향을 주는 것으로 알려져 있다. 하지만 객관적인 근거가 없어 어떤 자극원이 영향을 주는지 알 수가 없다. 사람은 감정은 실시간적으로 변한다. 그렇기 때문에 이것을 측정 할 수 있는 심전도 센서를 가지고 심박변화율을 측정하여 변화되는 자극을 측정 할 수 있다. 음악은 주파수로 이루어져 있으며 음악을 들을 때 동시에 여러 대역에서 음악이 나온다. 이러한 주파수의 변화와 심박의 변화를 분석하면 감정변화의 기반하는 특징을 알 수 있을 것이다. 이것의 기초 단계로 안정감을 주는 음악은 주파수 영역이 저음 영역으로 규칙적이고 반복적이며 파형의 변화가 없다. 저음영역에서 고음영역으로 변화 되는 음원을 사용하여 심박변화율을 살펴봄으로써 자극음원이 사람에게 영향을 끼치는지를 분석하였다.
PDF

An efficient space dividing method for the two-dimensional sound source localization (2차원 상의 음원위치 추정을 위한 효율적인 영역분할방법)

Kim, Hwan-Yong;Choi, Hong-Sub
- The Journal of the Acoustical Society of Korea
- /
- v.35 no.5
- /
- pp.358-367
- /
- 2016
SSL (Sound Source Localization) has been applied to several applications such as man-machine interface, video conference system, smart car and so on. But in the process of sound source localization, angle estimation error is occurred mainly due to the non-linear characteristics of the sine inverse function. So an approach was proposed to decrease the effect of this non-linear characteristics, which divides the microphone's covering space into narrow regions. In this paper, we proposed an optimal space dividing way according to the pattern of microphone array. In addition, sound source's 2-dimensional position is estimated in order to evaluate the performance of this dividing method. In the experiment, GCC-PHAT (Generalized Cross Correlation PHAse Transform) method that is known to be robust with noisy environments is adopted and triangular pattern of 3 microphones and rectangular pattern of 4 microphones are tested with 100 speech data respectively. The experimental results show that triangular pattern can't estimate the correct position due to the lower space area resolution, but performance of rectangular pattern is dramatically improved with correct estimation rate of 67 %.
https://doi.org/10.7776/ASK.2016.35.5.358 인용 PDF KSCI

Efficient Sound Source Localization System Using Angle Division (영역 분할을 이용한 효율적인 음원 위치 추정 시스템)

Kim, Yong-Eun;Cho, Su-Hyun;Chung, Jin-Gyun
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.46 no.2
- /
- pp.114-119
- /
- 2009
Sound source localization systems in service robot applications estimate the direction of a human voice. Time delay information obtained from a few separate microphones is widely used for the estimation of the sound direction. Correlation is computed in order to calculate the time delay between two signals. Inverse cosine is used when the position of the maximum correlation value is converted to an angle. Because of nonlinear characteristic of inverse cosine, the accuracy of the computed angle is varied depending on the position of the specific sound source. In this paper, we propose an efficient sound source localization system using angle division. By the proposed approach, the region from $0^{\circ}$ to $180^{\circ}$ is divided into three regions and we consider only one of the three regions. Thus considerable amount of computation time is saved. Also, the accuracy of the computed angle is improved since the selected region corresponds to the linear part of the inverse cosine function. By simulations, it is shown that the error of the proposed algorithm is only 31% of that of the conventional a roach.
PDF KSCI

Audio Source Separation Method Based on Beamspace-domain Multichannel Non-negative Matrix Factorization, Part I: Beamspace-domain Multichannel Non-negative Matrix Factorization system (빔공간-영역 다채널 비음수 행렬 분해 알고리즘을 이용한 음원 분리 기법 Part I: 빔공간-영역 다채널 비음수 행렬 분해 시스템)

Lee, Seok-Jin;Park, Sang-Ha;Sung, Koeng-Mo
- The Journal of the Acoustical Society of Korea
- /
- v.31 no.5
- /
- pp.317-331
- /
- 2012
In this paper, we develop a multichannel blind source separation algorithm based on a beamspace transform and the multichannel non-negative matrix factorization (NMF) method. The NMF algorithm is a famous algorithm which is used to solve the source separation problems. In this paper, we consider a beamspace-time-frequency domain data model for multichannel NMF method, and enhance the conventional method using a beamspace transform. Our decomposition algorithm is applied to audio source separation, using a dataset from the international Signal Separation Evaluation Campaign 2010 (SiSEC 2010) for evaluation.
https://doi.org/10.7776/ASK.2012.31.5.317 인용 PDF KSCI

Search Result 111, Processing Time 0.026 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)