• Title/Summary/Keyword: Original Sound

Search Result 227, Processing Time 0.023 seconds

Real-Time Implementation for Vocal-Removal Algorithm (보컬 제거 알고리즘의 실시간 구현)

  • Kim, Hyun-Tae;Do, Jin-Gyu;Park, Jang-Sik
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.10a
    • /
    • pp.268-270
    • /
    • 2010
  • Recently, According to increasing interest to original sound Karaoke instrument, MIDI type karaoke manufacturer attempt to make more cheap method instead of original recoding method. In this paper, we developed how to create MR from AR, recorded in stereo, by using the energy difference in the frequency domain and how to implement in DSP(TMS320C6713) were developed. At the output of the DSP board, 6-channel audio output interface designed for real-time stereophonic generating original sound, vocals removed MR, and separated vocals simultaneously. Real-time listening test using DSP show vocal separating and removal task successfully.

  • PDF

EEG-based Analysis of Auditory Stimulations Generated from Watching Disgust-Eliciting Videos (혐오 영상 시청시 청각적 자극에 대한 EEG 기반의 분석)

  • Lee, Mi-Jin;Kim, Hae-Lin;Kang, Hang-Bong
    • Journal of Korea Multimedia Society
    • /
    • v.19 no.4
    • /
    • pp.756-764
    • /
    • 2016
  • In this paper, we present electroencephalography (EEG)-based power spectra analysis and auditory stimuli methods as coping mechanisms for disgust affection and phobia. Disgust affection is a negative emotion generated from trying to eliminate something harmful to one. It is usually related to mental illnesses such as obsessive-compulsive disorder, specifically phobia and depression. In our experiments, participants watched videos on horrible body mutilation and disgusting creatures, with either the original sound track or relaxing and exciting music as auditory stimulation. After watching the videos with original sound track, the participants watched the same video with a different audio background, such as soothing or cheerful music. We analyzed the EEG data utilizing relative power spectra and examined survey results of the participants. The results demonstrated that disgust affection is decreased when participants watched the video with relaxing or exciting music instead of the original soundtracks. Moreover, we confirmed that human's brainwave reacts according to types of audio and sources of disgust affection.

Age-related Deficits in Response Characteristics on Safety Warning of Intelligent Vehicle (지능형 자동차의 안전 경고음에 대한 고령운전자의 반응 특성)

  • Kim, Man-Ho;Lee, Yong-Tae;Son, Joon-Woo;Jang, Chee-Hwan
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.26 no.12
    • /
    • pp.131-137
    • /
    • 2009
  • Recent technological advances made a vehicle more intelligent to increase safety and comfort. An intelligent vehicle provides drivers with safety warning information through audible sounds, visual displays, and tactile devices. However, elderly drivers have been known to decrease the physical and cognitive abilities such as muscular strength, hearing, eyesight, short term memory, and spatial perception. Therefore, possible age-related deficits should be considered to design an effective warning system. This paper aims to evaluate the impact of advancing age on response performance on audible safety warnings which are widely used for alerting driving hazards. In order to understand the effect of age-related hearing loss and movement slowing, three sound characteristics (frequency, intensity, and period) and three age groups (younger, middle, and older) are considered. Data was drawn from 38 drivers who drove a simulated rural road in a driving simulator. Experimental results show that age influences driver's response performance. In conclusion, the appropriate range of a warning sound is suggested.

Sound Field Reconstruction Technology Using a Three Dimensional Loudspeaker Array (3차원 라우드스피커 어레이를 이용한 음장재현기술)

  • Seo, Jeong-Il;Kang, Kyeong-Ok;Fazi, Filippo M.;Nelson, Philip A.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.28 no.8
    • /
    • pp.723-731
    • /
    • 2009
  • In this paper, we propose a novel sound field reconstruction algorithm using a three dimensional loudspeaker array for providing realistic sound field to multiple listeners. The proposed algorithm is based on minimization of the squared error between the original sound field and the reconstructed sound field by the loudspeaker array over a predefined three dimensional region of the space using a loudspeaker array surrounding the listening area. For evaluating the proposed algorithm, we constructed the three dimensional array composed of 40 loudspeakers and discuss the relevant experiment results.

Implementation of Active Sound Enrichment Control for Improving Engine Sound Quality Inside the Cabin of a Passenger Car (차량 실내공간의 가속 시 엔진음 음질 향상을 위한 실시간 능동음향증강 제어 구현)

  • Lee, Young-Sup;Kim, Jeakwan;Ryu, Seokhoon;Kim, Seonghyeon;Park, Dong Chul
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.26 no.2
    • /
    • pp.195-202
    • /
    • 2016
  • In this study, a concept of active sound enrichment (ASE) control system was implemented and demonstrated for improving engine sound quality inside the cabin of a passenger car during acceleration. Unlike the active noise control cancels the noise for disturbance rejection, the ASE adds additional sound to the noise for tracking control. This approach requires a new algorithm to provide additional artificial sound to the original engine sound using active control strategy to achieve a target sound profile, which is predefined to satisfy required interior sound quality. The ASE algorithm was implemented in a digital controller dSPACE DS1401 and real-time control experiment was accomplished in an actual car. The ASE control results show that the actively enriched sound of each engine order against RPM tracks the target profiles precisely and quickly and improves the discontinuity, the level ratios and the sound pressure level of each engine order. Thus it is anticipated the ASE system can be applied for the improvement of the engine sound quality inside the cabin during acceleration.

Optimum Image Compression Rate Maintaining Diagnostic Image Quality of Digital Intraoral Radiographs

  • Song Ju-Seop;Koh Kwang-Joon
    • Imaging Science in Dentistry
    • /
    • v.30 no.4
    • /
    • pp.265-274
    • /
    • 2000
  • Purpose: The aims of the present study are to determine the optimum compression rate in terms of file size reduction and diagnostic quality of the images after compression and evaluate the transmission speed of original or each compressed image. Materials and Methods: The material consisted of 24 extracted human premolars and molars. The occlusal surfaces and proximal surfaces of the teeth had a clinical disease spectrum that ranged from sound to varying degrees of fissure discoloration and cavitation. The images from Digora system were exported in TIFF and the images from conventional intraoral film were scanned and digitalized in TIFF by Nikon SF-200 scanner (Nikon, Japan). And six compression factors were chosen and applied on the basis of the results from a pilot study. The total number of images to be assessed were 336. Three radiologists assessed the occlusal and proximal surfaces of the teeth with 5-rank scale. Finally diagnosed as either sound or carious lesion by one expert oral pathologist. And sensitivity, specificity and k value for diagnostic agreement was calculated. Also the area (Az) values under the ROC curve were calculated and paired t-test and oneway ANOVA test was performed. Thereafter, transmission time of the image files of the each compression level was compared with that of the original image files. Results: No significant difference was found between original and the corresponding images up to 7% (1 : 14) compression ratio for both the occlusal and proximal caries (p<0.05). JPEG3 (1 : 14) image files are transmitted fast more than 10 times, maintained diagnostic information in image, compared with original image files. Conclusion: 1 : 14 compressed image file may be used instead of the original image and reduce storage needs and transmission time.

  • PDF

Comparison of Speech Intelligibility & Performance of Speech Recognition in Real Driving Environments (자동차 주행 환경에서의 음성 전달 명료도와 음성 인식 성능 비교)

  • Lee Kwang-Hyun;Choi Dae-Lim;Kim Young-Il;Kim Bong-Wan;Lee Yong-Ju
    • MALSORI
    • /
    • no.50
    • /
    • pp.99-110
    • /
    • 2004
  • The normal transmission characteristics of sound are hardly obtained due to the various noises and structural factors in a running car environment. It is due to the channel distortion of the original source sound recorded by microphones, and it seriously degrades the performance of the speech recognition in real driving environments. In this paper we analyze the degree of intelligibility under the various sound distortion environments by channels according to driving speed with respect to speech transmission index(STI) and compare the STI with rates of speech recognition. We examine the correlation between measures of intelligibility depending on sound pick-up patterns and performance in speech recognition. Thereby we consider the optimal location of a microphone in single channel environment. In experimentation we find that high correlation is obtained between STI and rates of speech recognition.

  • PDF

Study for Visualization of Rotating Sound Source Using Microphone Array (마이크로폰 어레이를 이용한 회전하는 소음원 가시화에 관한 연구)

  • Rhee, Wook;Park, Sung;Lee, Ja-Hyung;Kim, Jai-Moo;Choi, Jong-Soo
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.16 no.6 s.111
    • /
    • pp.565-573
    • /
    • 2006
  • Acoustic analysis of a moving sound source required that the measured sound signals be do-Dopplerized and restored as of the original emission signals. The purpose of this research is development of beamforming technique can be applied to the rotor noise source identification. For the do-Dopplerization and reconstruction of emitted sound wave, Forward Propagation Method is applied to the time domain beamforming technique. And validation test were performed using rotating sound source constructed by bended pipe and horn driver. In the validation test using sinusoidal sound wave, sufficient performance of signal processing can be seen, and the effect of measuring duration for accuracy was compared. In the prop-rotor measurements, the acoustic source locations were successfully verified in varying positions for different frequencies and collective pitch angle, in hover condition.

HRTF Interpolation Using a Spherical Head Model (원형 머리 모델을 이용한 머리 전달 함수의 보간)

  • Lee, Ki-Seung;Lee, Seok-Pil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.27 no.7
    • /
    • pp.333-341
    • /
    • 2008
  • In this paper, a new interpolation model for the head related transfer function (HRTF) was proposed. In the method herein, we assume that the impulse response of the HRTF for each azimuth angle is given by linear interpolation of the time-delayed neighboring impulse responses of HRTFs. The time delay of the HRTF for each azimuth angle is given by sum of the sound wave propagation time from the ears to the sound source, which can be estimated by using azimuth angle, the physical shape of the underlying head and the distance between the head and sound source, and the refinement time yielding the minimum mean square error. Moreover, in the proposed model, the interpolation intervals were not fixed but varied, which were determined by minimizing the total number of HRTFs while the synthesized signals have no perceptual difference from the original signals in terms of sound location. To validate the usefulness of the proposed interpolation model, the proposed model was applied to the several HRTFs that were obtained from one dummy-head and three human heads. We used the HRTFs that have 5 degree azimuth angle resolution at 0 degree elevation (horizontal plane). The experimental results showed that using only $30\sim40%$ of the original HRTFs were sufficient for producing the signals that have no audible differences from the original ones in terms of sound location.