• Title/Summary/Keyword: audio signal processing

Search Result 156, Processing Time 0.028 seconds

Denoising Algorithm using Wavelet and Element Deviation-based Median Filter (웨이브렛과 원소 편차 기반의 중간값 필터를 이용한 잡음제거 알고리즘)

  • Bae, Sang-Bum;Kim, Nam-Ho
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.14 no.12
    • /
    • pp.2798-2804
    • /
    • 2010
  • The audio and image signal are corrupted by various noises in signal processing, many studies are being accomplished to restore those signals. In this paper, the algorithm is proposed to remove additive Gaussian noise and impulse noise at one dimension signal like an speech signal. The algorithm is composed to remove Gaussian noise after removing impulse noise. And the method using wavelet coefficient accumulation is used to remove the Gaussian noise, and the median filter based on element deviation is applied to remove the impulse noise. Also we compare existing methods using SNR(signal-to-noise ratio) as the standard of judgement of improvemental effect.

Development and Basic Experiment of Active Noise Control System for Reduction of Road Noise (도로 소음 저감을 위한 능동소음제어 시스템의 개발 및 기초실험)

  • Moon, Hak Ryong;Kang, Won Pyoung;Lim, You Jin
    • International Journal of Highway Engineering
    • /
    • v.15 no.6
    • /
    • pp.41-47
    • /
    • 2013
  • PURPOSES : The purpose of this study is about noise which is generated from roads and is consist of irregular frequency variation from low frequency to various band. The existing methods of noise reduction are sound barrier that uses insulation material and absorbing material or have applied passive technology of noise reduction by devices. The total frequency band is needed to apply active noise control. METHODS : In this study applies to the field of road traffic environment, signal processing controller and various analog signal input/output, the amplifier module is based on parallel-core embedded processor designed. DSP performs the control algorithm of the road traffic noise. Noise sources in the open space performance of evaluation were applied. In this study, controller of active signal processor was designed based on the module of audio input/output and main controller of embedded process. The controller of active signal processor operates noise reduction algorithm and performance tests of noise reduction in inside and outside environment were executed. RESULTS : The signal processing controller with OMAP-L137 parallel-core processors as the center, DSP processors in the active control operations dealt with quickly. To maximize the operation speed of an object and ARM processor is external function keys and display for functions and evaluating the performance management system was designed for the purpose of the interface. Therefore the reduction of road traffic noise has established an electronic controller-based noise reduction. CONCLUSIONS : It is shown that noise reduction is effective in the case of pour tonal sound and complex tonal sound below 500Hz by appling to Fx-LMS.

Realtime Stereo Sound Image Expansion System Using Hass Effect& Phase shifting (선착효과 및 위상처리를 이용한 실시간 스테레오 음상 확장 시스템 구현)

  • 이종철;이상훈
    • Proceedings of the IEEK Conference
    • /
    • 1998.10a
    • /
    • pp.1227-1230
    • /
    • 1998
  • Phase control methods are used to expand the sound image in general AV system. However, these methods are effective only to the signal under 1kHz, and the listener must be located in front center of the speaker system. In this paper, we realize the realtime processing system in which phase shifting method is dominant at low frequency and precedence effect is dominant at high frequency. Two sound cards are used to process the audio signal in realtime with 16 bits stereo channel of 44.1 kHz sampling frequency. And the analog circuit is designed to process the phase shifting. In experiments the usefulness of the proposed stereo system is confirmed.

  • PDF

A Study on Analysis and 3D Web Environment for the Treatment Alcoholism (알코중독 치료를 위한 Web 환경 시스템과 분석에 대한 연구)

  • Paek, Seung-Eun
    • The Journal of Information Technology
    • /
    • v.9 no.1
    • /
    • pp.9-19
    • /
    • 2006
  • Medications or conitive-behavior methods have been mainly used as a treatment of alcoholism. lately the virtualy technology has been applied to the kink of alcoholic disorders. A virtual environment makes him having avility to over come the drink. In this study, we were implemented by making panorama images and 3D object modules using 3D MAX, VRML, JAVA. And the BAR stimulator that composed with a position sensor, head mount display, and audio system, is suggested. To illustrate the physiological difference between a person who has a alcoholism and without a liquor bottle, heart rate was measured during experiment, and also measured a person's HR after the virtual reality training. The system measures the Physiological signals such as ECG, Temperature, analyzes those data automatically. The system has two parts, one is physiological data acquisition part which gets the body signal, and the other one is mobile nuit which includes signal processing and transmission functions, And Bluetooth allows two parts to communicate with each other. we demonstrated the subjective effectiveness of virtual reality psychotherapy through the clinical experiment.

  • PDF

A Real-Time Sound Recognition System with a Decision Logic of Random Forest for Robots (Random Forest를 결정로직으로 활용한 로봇의 실시간 음향인식 시스템 개발)

  • Song, Ju-man;Kim, Changmin;Kim, Minook;Park, Yongjin;Lee, Seoyoung;Son, Jungkwan
    • The Journal of Korea Robotics Society
    • /
    • v.17 no.3
    • /
    • pp.273-281
    • /
    • 2022
  • In this paper, we propose a robot sound recognition system that detects various sound events. The proposed system is designed to detect various sound events in real-time by using a microphone on a robot. To get real-time performance, we use a VGG11 model which includes several convolutional neural networks with real-time normalization scheme. The VGG11 model is trained on augmented DB through 24 kinds of various environments (12 reverberation times and 2 signal to noise ratios). Additionally, based on random forest algorithm, a decision logic is also designed to generate event signals for robot applications. This logic can be used for specific classes of acoustic events with better performance than just using outputs of network model. With some experimental results, the performance of proposed sound recognition system is shown on real-time device for robots.

Musical Instrument Recognition for the Categorization of UCC Music Source (UCC 음원분류를 위한 연주악기 분류에 대한 연구)

  • Kwon, Soon-Il;Park, Wan-Joo
    • The KIPS Transactions:PartB
    • /
    • v.17B no.2
    • /
    • pp.107-114
    • /
    • 2010
  • A guitar, a piano, and a violin are popular musical instruments for User Created Contents(UCC). However the patterns of audio signal generated by a guitar and a piano are too similar to differentiate. The difference between two musical instruments can be found by analyzing the frequency variation per each band near signal peaks. The distribution of probability on the existence of signal peaks based on Cumulative Histogram were applied to musical instrument recognition. Experiments with statistical models of the frequency variation per each band near signal peaks showed the 14% improvement of musical instrument recognition.

Threshold Based Buffer Management Algorithm for Fairness Improvement between Input Channels in ATM Networks (ATM 망에서 채널간 공평성 향상을 위한 문턱값 기반 버퍼 관리 알고리즘)

  • 고유신;강은성;고성택
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.5 no.1
    • /
    • pp.79-83
    • /
    • 2004
  • The purpose of ATM traffic management is to protect the network using minimum resources and guarantee the requred QoS and also it is desirable to provide fairness between input channels. In this paper, we propose the TBBM(threshold based buffer management) algorithm to improve fairness between input channels and utilization of ATM networks. TBBM algorithm controls output cell rate dynamically based on threshold. The result shows that the required bandwidth of the TBBM algorithm is 14.3% lower in audio traffic and 41.8% lower in video traffic than that of theoretically calculated equivalent capacity method. and also reveals that the TBBM algorithm provide improved CLR fairness between input channels

  • PDF

Collaborative Filtering and Genre Classification for Music Recommendation

  • Byun, Jeong-Yong;Nasridinov, Aziz
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.11a
    • /
    • pp.693-694
    • /
    • 2014
  • This short paper briefly describes the proposed music recommendation method that provides suitable music pieces to a listener depending on both listeners' ratings and content of music pieces. The proposed method consists of two methods. First, listeners' ratings prediction method is a combination the traditional user-based and item-based collaborative filtering methods. Second, genre classification method is a combination of feature extraction and classification procedures. The feature extraction step obtains audio signal information and stores it in data structure, while the second one classifies the music pieces into various genres using decision tree algorithm.

Design and Construction of a FFT Analyzer Using a Microcomputer (마이크로컴퓨터를 이용한 FFT 분석기의 설계 및 제작)

  • Lee, Hyeun Tae;Kim, Jung Gyu;Lee, Sang Bae
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.23 no.6
    • /
    • pp.944-949
    • /
    • 1986
  • By improving the ability of arithmatic processing with an arithmatic processor in a microcomputer and realizing the data input system for real time analysis, an FFT analyzer that is usable within the range of audio frequency is designed and constructed. The input signal passes through a gain programmable pre-amplifier and anti-aliasing lowpass filter into an analogditital converter to be converted into digital form. The converted input data is processed by an Apple II microcomputer. The results of the processing are displayed using a microcomputer display unit and can be copied on a printer or stored in a floppy disk.

  • PDF

MPEG-H 3D Audio Decoder Structure and Complexity Analysis (MPEG-H 3D 오디오 표준 복호화기 구조 및 연산량 분석)

  • Moon, Hyeongi;Park, Young-cheol;Lee, Yong Ju;Whang, Young-soo
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.42 no.2
    • /
    • pp.432-443
    • /
    • 2017
  • The primary goal of the MPEG-H 3D Audio standard is to provide immersive audio environments for high-resolution broadcasting services such as UHDTV. This standard incorporates a wide range of technologies such as encoding/decoding technology for multi-channel/object/scene-based signal, rendering technology for providing 3D audio in various playback environments, and post-processing technology. The reference software decoder of this standard is a structure combining several modules and can operate in various modes. Each module is composed of independent executable files and executed sequentially, real time decoding is impossible. In this paper, we make DLL library of the core decoder, format converter, object renderer, and binaural renderer of the standard and integrate them to enable frame-based decoding. In addition, by measuring the computation complexity of each mode of the MPEG-H 3D-Audio decoder, this paper also provides a reference for selecting the appropriate decoding mode for various hardware platforms. As a result of the computational complexity measurement, the low complexity profiles included in Korean broadcasting standard has a computation complexity of 2.8 times to 12.4 times that of the QMF synthesis operation in case of rendering as a channel signals, and it has a computation complexity of 4.1 times to 15.3 times of the QMF synthesis operation in case of rendering as a binaural signals.