Search | Korea Science

Speech Processing System Using a Noise Reduction Neural Network Based on FFT Spectrums

Choi, Jae-Seung
- Journal of information and communication convergence engineering
- /
- v.10 no.2
- /
- pp.162-167
- /
- 2012
This paper proposes a speech processing system based on a model of the human auditory system and a noise reduction neural network with fast Fourier transform (FFT) amplitude and phase spectrums for noise reduction under background noise environments. The proposed system reduces noise signals by using the proposed neural network based on FFT amplitude spectrums and phase spectrums, then implements auditory processing frame by frame after detecting voiced and transitional sections for each frame. The results of the proposed system are compared with the results of a conventional spectral subtraction method and minimum mean-square error log-spectral amplitude estimator at different noise levels. The effectiveness of the proposed system is experimentally confirmed based on measuring the signal-to-noise ratio (SNR). In this experiment, the maximal improvement in the output SNR values with the proposed method is approximately 11.5 dB better for car noise, and 11.0 dB better for street noise, when compared with a conventional spectral subtraction method.
https://doi.org/10.6109/jicce.2012.10.2.162 인용 PDF KSCI

Adaptive Noise Subtraction in Auditory Evoked Field (적응 필터를 이용한 청각 자극에 의한 뇌자도 신호에서 노이즈 제거)

이동훈;안창범
- The Transactions of the Korean Institute of Electrical Engineers D
- /
- v.52 no.10
- /
- pp.606-610
- /
- 2003
Noise subtraction using reference channel data has been used to improve signal-to-noise ratio in magnetoencephalography. In this paper, an adaptive noise subtraction model is proposed and parameters for the model are optimized. A criterion to determine an optimal update period for the filter coefficients is proposed based on the ratio of peak amplitude of evoked field (N100m) divided by the output standard deviation. Experiments are carried out using a 40 channel MEG system. From the experiments, the proposed noise subtraction method shows superior performances over existing non-adaptive methods. Two-dimensional topographic map is shown for a diagnosis with a cubic spline interpolation.
PDF KSCI

32-Channel EEG and Evoked Potential Mapping System (32채널 뇌파 및 뇌유전발전위 Mapping 시스템)

안창범;박대준
- Journal of Biomedical Engineering Research
- /
- v.17 no.2
- /
- pp.179-188
- /
- 1996
A clinically oriented 32-channel electroencephalogram (EEG) and evoked potential (EP) mapping system has been developed EEG and EP signals acquired from 32-channel electrodes attached on the heroid surface are amplified by a pre-amplifier which is separated from main amplifier and is located near the patient to reduce signal attenuation and noise contamination between electrodes and the amplifier. The amplified signals are further amplified by a main amplifier where various filtering and gain contr61 are achieved An automatic artifact rejection scheme is employed using neural network-based EEG and artifact classifier, by which examination time is substantially reduce4 The continuously measured EEG sigrlals are used for spectral mapping, and auditory and visual evoked potentials measured in synchronous to the auditory and visual stimuli are used for temporal evoked potential mapping. A user-friendly graphical interface based on the Microsoft Window 3.1 is developed for the operation of the system. Statistical databases for comparisons of group and individual are included to support a statistically-based diagnosis.
PDF

A Study on Speech Recognition Using Auditory Model and Recurrent Network (청각모델과 회귀회로망을 이용한 음성인식에 관한 연구)

김동준;이재혁
- Journal of Biomedical Engineering Research
- /
- v.11 no.1
- /
- pp.157-162
- /
- 1990
In this study, a peripheral auditory model is used as a frequency feature extractor and a recurrent network which has recurrent links on input nodes is constructed in order to show the reliability of the recurrent network as a recognizer by executing recognition tests for 4 Korean place names and syllables. In the case of using the general learning rule, it is found that the weights are diverged for a long sequence because of the characteristics of the node function in the hidden and output layers. So, a refined weight compensation method is proposed and, using this method, it is possible to improve the system operation and to use long data. The recognition results are considerably good, even if time worping and endpoint detection are omitted and learning patterns and test patterns are made of average length of data. The recurrent network used in this study reflects well time information of temporal speech signal.
PDF

A SPECTRAL SUBTRACTION USING PHONEMIC AND AUDITORY PROPERTIES

Kang, Sun-Mee;Kim, Woo-Il;Ko, Han-Seok
- Speech Sciences
- /
- v.4 no.2
- /
- pp.5-15
- /
- 1998
This paper proposes a speech state-dependent spectral subtraction method to regulate the blind spectral subtraction for improved enhancement. In the proposed method, a modified subtraction rule is applied over the speech selectively contingent to the speech state being voiced or unvoiced, in an effort to incorporate the acoustic characteristics of phonemes. In particular, the objective of the proposed method is to remedy the subtraction induced signal distortion attained by two state-dependent procedures, spectrum sharpening and minimum spectral bound. In order to remove the residual noise, the proposed method employs a procedure utilizing the masking effect. Proposed spectral subtraction including state-dependent subtraction and residual noise reduction using the masking threshold shows effectiveness in compensation of spectral distortion in the unvoiced region and residual noise reduction.
PDF

The Hearing Ability of the Dusky spinefoot Siganus fuscescens(Houttuyn)to Audible Sound 2. The Auditory Critical Ratio (가청음에 의한 독가시치의 청각 능력 2. 청각 임계비)

Lee, Chang-Heon;Moon, Jong-Wook;Seo, Du-Ok
- Journal of Fisheries and Marine Sciences Education
- /
- v.12 no.2
- /
- pp.191-198
- /
- 2000
An experiment was carried out to obtain the fundamental data on the auditory thresholds of fishes for catching method using audible frequency sound, the auditory thresholds of dusky spinefoot Siganus fuscescens(Houttuyn) were measured in the presence of masking noise in the spectrum level range of 74 - 83dB re $1{\mu}Pa/{\sqrt{Hz}}$ by heartbeat conditioning technique using pure tones coupled with a delayed electric shock. The auditory critical ratios were about 23 - 34dB at measurement frequency range. The ratio increased almost linearly with increasing frequency from 200 to 500Hz. The noise spectrum level at the start of masking was about 61 - 73dB within the measurement frequency range. This suggests that hearing of dusky spinefoot is masked in the natural environment with the noise spectrum level above 70dB. The sound pressure level of which the signal sound of 100Hz is recognized by dusky spinefoot under the white noise of 70dB is above 98dB and the critical ratio of them is above 23dB.
PDF

A Study on Auditory Perception Characteristics of Directional Tonal Noise (방향성을 가진 회전체 소음의 청각계 인지 특성에 관한 연구)

Seo, Kang-Won;Kim, Eui-Youl;Kim, Sung-Ki
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2012.04a
- /
- pp.348-353
- /
- 2012
This paper presents the HRTF based experimental approach to figure out why the human auditory perception on the interior noise source including the directional tonal components does not well match with the dominant features extracted from recorded acoustic signals in terms of psycho-acoustics. Since the general objective evaluation models for tonalness among various sound attributes are a function of width, frequency, excessive level of tonal components respectively, the directional tonal components cannot be properly evaluated without considering the effects of head-related transfer function on the binaural auditory perception. Thus, the directivity of source is additionally considered to prevent the erroneous conclusions from the same sound source in the process of source identification. The signal synthesis technique is used to solve a little difficulty in measuring all of desired acoustic signals for jury evaluation. The sound attributes of synthetic acoustics signals are analyzed to roughly predict the results of jury evaluation in advance by using sound quality factors such as loudness, sharpness, roughness, fluctuation strength and tonality. The jury evaluation is carefully conducted based on the recommended guideline suggested by N. Ottoet al. Each sound is respectively evaluated by selecting a value between -2 and 2 in intervals of 0.2 point. Through above procedure, based on the results of jury evaluation, it is confirmed that serious problems can be caused in the process of analyzing the dominant sound attributes in terms of psycho-acoustics according to the type of a microphone and a playback system.
PDF

The Amplification of the Morse Codes, which Cho Ji-Hoon's Poem Silent Night 1 Leaves in the Human Body

Park, In-Kwa
- International Journal of Advanced Culture Technology
- /
- v.6 no.1
- /
- pp.42-49
- /
- 2018
In this study, we tried to reveal the state of stillness of Cho Ji-Hoon's poem "Silent Night 1" as a healing modifier. The language of poem is synaptically linked to the calmness emotion of the human body, seeking a principle that leads to a state of healing. Therefore, this study was carried out for the purpose of applying the principle to literary therapy program. The silent signal embedded in the poem is encoded into the signals of the sound as it is synapsed to the human body. Encoding of auditory nerves by poem lines is like a Morse code that word and word leave in the human body. The action potential of the auditory nerve is further activated by the potential difference between the word and the word represented by the neural network, such as a Morse code, which is accessed to the human body by such a path. There is worked as amplified potential difference between the words perceived by a sound which is synapsed to the human body and by a silence which is synapsed to the human body. The phenomenon of the words approaching the human body and setting the absence of sound and amplifying the sound is because the words amplifies the Morse codes in the human neural network. At this time, the signals overlap each other. Thereby this poem is increasing the amplitude of the sound. This overlapping of auditory signals appears and amplifies the catharsis. If this Cho Ji-Hoon Poem's principle is applied to literary therapy program in the future, more effective treatment will be done.
https://doi.org/10.17703/IJACT.2018.6.1.42 인용 PDF KSCI

Development of Processing Program for Audio-vision System Based on Auditory Input (청각을 이용한 시각 재현장치의 분석프로그램 개발)

Heo, Se-Jin;Bang, Sung-Sik;Seo, Jee-Hye;Choi, Hyun-Woo;Kim, Tae-Ho;Lee, Na-Hee;Lee, Yu-Jin;Park, Ji-Won;Lee, Hui-Joong;Won, Chul-Ho;Lee, Jong-Min
- Journal of Korea Multimedia Society
- /
- v.13 no.1
- /
- pp.58-65
- /
- 2010
The final goal of our research is developing not a simple collision a1ann equipment for the blinded walkers, but the apparatus (Audio- Vision System) which can simulate vision based on auditory information so that the blinds can figure the three dimensional space in front of them. On the way to the final goal, in this study, simulation software was developed and verified. Thirty normal volunteers were included in the subject group and the average age Was 25.8 years old. After being accustomed to the system by evaluating 10 blinded virtual spaces, the volunteers performed test using another set of 10 blinded virtual spaces. The results of test were scored by shape, center, margin, and gradient surface of objects in virtual space. The score of each checking point ranged from 1 to 5, and the full score was converted to 100. As results of this study, the total score ranged from 77 to 97 with the average of 88.7. In this study, a simulation software was developed and verified to have acceptable success rale. By combining to visual sensors, the vision-reconstruction system based on auditory signal (Audio-vision System) may be developed.
PDF KSCI

Pattern classification of the synchronized EEG records by an auditory stimulus for human-computer interface (인간-컴퓨터 인터페이스를 위한 청각 동기방식 뇌파신호의 패턴 분류)

Lee, Yong-Hee;Choi, Chun-Ho
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.12 no.12
- /
- pp.2349-2356
- /
- 2008
In this paper, we present the method to effectively extract and classify the EEG caused by only brain activity when a normal subject is in a state of mental activity. We measure the synchronous EEG on the auditory event when a subject who is in a normal state thinks of a specific task, and then shift the baseline and reduce the effect of biological artifacts on the measured EEG. Finally we extract only the mental task signal by averaging method, and then perform the recognition of the extracted mental task signal by computing the AR coefficients. In the experiment, the auditory stimulus is used as an event and the EEG was recorded from the three channel $C_3-A_1$, $C_4-A_2$ and $P_Z-A_1$. After averaging 16 times for each channel output, we extracted the features of specific mental tasks by modeling the output as 12th order AR coefficients. We used total 36th order coefficient as an input parameter of the neural network and measured the training data 50 times per each task. With data not used for training, the rate of task recognition is 34-92 percent on the two tasks, and 38-54 percent on the four tasks.
https://doi.org/10.6109/jkiice.2008.12.12.2349 인용 PDF KSCI

Search Result 176, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)