Search | Korea Science

On a Performance Improvement of Speaker Recognition by using the Auditory Characteristics of Speech (음성의 청각특성을 이용한 화자식별시스템의 성능향상에 관한 연구)

이윤주;오세영배재옥배명진
- Proceedings of the IEEK Conference
- /
- 1998.10a
- /
- pp.1223-1226
- /
- 1998
The pre-emephasis filter as the conventional method emphasizes all components of high frequency that reflects the speaker characteristics. However this filter don't show the auditory characteristics of speaker's speech. In order to emphasize the perceptual characteristics, we propose the speaker recognition system that uses the perceptual weighting as the preprocessor because the Auditory characteristic of human is sensitive to the formant peaks. This filter has the characteristcs that both deemphasizes the low-formants and emphasizes the high formants. As a result of the proposed method, we improve the total recognition rate 1.7% better than the conventional method.
PDF

The Study of the Sensorineural Hearing Loss Compensation Algorithm using Psychoacoustics Model (심리음향모델을 적용한 난청 보정 알고리즘의 연구)

노형철;김헌중;한헌수;차형태
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.189-192
- /
- 2000
본 논문에서는 청각 장애인의 보다 향상된 보청 환경을 조성하고자 청각손실을 심리음향 모델을 적용하여 감음 신경성 난청을 보정하는 알고리즘을 제안한다. 제안한 알고리즘에서는 난청의 유형은 내이에서부터 중추 뇌에 걸친 감음계와 신경계의 장애에서 비롯되는 감음신경성 난청(sensorineural hearing loss)으로 주파수 영역상에서 MTH(minimum hearing threshold)가 균일하지 않게 상승하게되어 가청영역이 좁아지는 문제점을 해결하기 위한 방법으로 각각의 주파수 밴드마다 멀티밴드 압축 알고리즘을 적용하였다. 그러나 이 경우 각각의 주파수 밴드에 따른 서로 다른 가청 영역의 영향에 의한 변형된 스펙트럼 모양으로 인해 spectral contrast reduction과 변형된 마스킹 특성으로 인해 음성 변별력에 제한을 가하게 된다. 이것은 주변 주파수 성분들에 의한 마스킹 효과에 의한 것으로, 신호에 대한 난청인이 느끼는 지각 영역(perceptual domain)에서의 해석과 심리음향 모델 파라미터를 통한 보청기의 개발이 이루어져야 하며, 본 논문에서 그 알고리즘을 적용하였다.
PDF

Auto fitting Parameter Extraction for Digital Hearing Aids (디지털 보청기의 자동 보정 파라미터 추출)

석수영;정호열;정현열
- Journal of Korea Multimedia Society
- /
- v.3 no.5
- /
- pp.495-505
- /
- 2000
In this paper, we propose an efficient auto-fitting system for digital hearing-aids which automatically adjusts the fitting parameters according to the auditory characteristics of hearing handicapped person. The fitting parameters are extracted from audiogram of hearing handicapped and are applied to digital hearing-aid purposed GM3036 chip. The characteristics of each parameter are compared with those from theoretical 2cc graph. The purposed system has applied to 50 patients and their satisfaction ratios show to the very high. As results, it shows effectiveness of proposed system.
PDF

The Development of Auditory Emotion Recognition Function on Human Sounds for Psycho-acoustic Parameters (소리의 심리음향특성을 활용한 사람 소리에 대한 청각-감성 인식 함수 개발)

Choi, Y.I.;Kim, M.H.;Jung, S.S.;Lee, S.;Choi, I.M.;Park, Y.K.
- Proceedings of the Korean Society of Precision Engineering Conference
- /
- 2013.05a
- /
- pp.1039-1040
- /
- 2013
PDF

A Study on the Spectrum Variation of Korean Speech (한국어 음성의 스펙트럼 변화에 관한 연구)

Lee Sou-Kil;Song Jeong-Young
- Journal of Internet Computing and Services
- /
- v.6 no.6
- /
- pp.179-186
- /
- 2005
We can extract spectrum of the voices and analyze those, after employing features of frequency that voices have. In the spectrum of the voices monophthongs are thought to be stable, but when a consonant(s) meet a vowel(s) in a syllable or a word, there is a lot of changes. This becomes the biggest obstacle to phoneme speech recognition. In this study, using Mel Cepstrum and Mel Band that count Frequency Band and auditory information, we analyze the spectrums that each and every consonant and vowel has and the changes in the voices reftects auditory features and make it a system. Finally we are going to present the basis that can segment the voices by an unit of phoneme.
PDF

Isolated-Word Speech Recognition in Telephone Environment Using Perceptual Auditory Characteristic (인지적 청각 특성을 이용한 고립 단어 전화 음성 인식)

Choi, Hyung-Ki;Park, Ki-Young;Kim, Chong-Kyo
- Journal of the Institute of Electronics Engineers of Korea TE
- /
- v.39 no.2
- /
- pp.60-65
- /
- 2002
In this paper, we propose GFCC(gammatone filter frequency cepstrum coefficient) parameter which was based on the auditory characteristic for accomplishing better speech recognition rate. And it is performed the experiment of speech recognition for isolated word acquired from telephone network. For the purpose of comparing GFCC parameter with other parameter, the experiment of speech recognition are carried out using MFCC and LPCC parameter. Also, for each parameter, we are implemented CMS(cepstral mean subtraction)which was applied or not in order to compensate channel distortion in telephone network. Accordingly, we found that the recognition rate using GFCC parameter is better than other parameter in the experimental result.
PDF KSCI

Antioxidative Activity of Mustard Leaf Kimchi with Optional Ingredients (부재료 첨가에 따른 갓김치의 항산화성)

최영숙;황정희;김재이;전영수;최홍식
- Journal of the Korean Society of Food Science and Nutrition
- /
- v.29 no.6
- /
- pp.1003-1008
- /
- 2000
Antioxidative activities (AA) of mustard leaf kimchi (MLK) by the addition of optional ingredients among selected minor materials were studied. In order to determine AA of MLK with different spices, the model systems of ground cooked beef with green onion, garlic, and red pepper powder were prepared and stored for 4 weeks at 4$^{\circ}C$. AA of red pepper added group was stronger than those of others. AA of red MLK was relatively higher than that of (green) MLK. For the enhancement of AA of MLK, another model systems were prepared with the selected antioxidative optional ingredients, which were bonnet bellflower root, leek, burdock, sea tangle, sea mustard, seastaghorn at the level of 2% or 4%. The extracts of water, 75% methanol and hexane of MLK, bonnet bellflower root added MLK, and seastaghorn added MLK had a considerable AA with the inhibition of peroxide formation during the autioxidation of linoleic acid mixtures in aqueous model systems at 37$^{\circ}C$. Therefore, AA was more effective in MLK containing specific optional ingredients than that of MLK alone significantly.
PDF

Phoneme Recognition using Temporal and Spectral Features based on Spikegram (스파이크그램 기반의 주파수 및 시간 특성을 이용한 음소 인식)

Han, Seokhyeon;Kim, Jaewon;An, Soonho;Shin, Seonghyeon;Park, Hochong
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2019.06a
- /
- pp.156-157
- /
- 2019
본 논문에서는 스파이크그램 기반의 주파수 및 시간 특성을 이용한 음속 인식 방법을 제안한다. 기존의 MFCC 특성은 프레임 단위의 평균 특성이기 때문에 시간 해상도가 낮고, 짧은 음소의 특성을 반영하기에는 어렴움이 있다. 반면, 스파이크그램은 청각 모델을 기반으로 샘플 단위로 계산하기 때문에높은 시간 해상도를 가진다. 고 해상도의 스파이크그램을 분석하면 음소 인식에 특화된 특성 벡터를 추출할 수 있다. 추출된 특성으로 심층 신경망을 학습시켜 음소 인식기를 구현하였고, TMIT 테이터 세트로 성능을 평가하였다. 성능 평가를 통하여 스파이크그램 기반의 새로운 시간-주파수 특성을 사용하여 MFCC 특성과 유사한 성능의 음소인식이 가능한 것을 확인하였다.
PDF

Emotion of People with Visual Disability for Enhancing Web Accessibility (웹 접근성 향상을 위한 시각장애인과 일반인의 감성 비교)

Park, Joo-Hyun;Ryoo, Han-Young
- Science of Emotion and Sensibility
- /
- v.11 no.4
- /
- pp.589-598
- /
- 2008
The purpose of this study was to compare the emotional responses of people with visual disability with those of normal people and to understand their similarity or differences in order to apply the new understandings into the future research on Web Accessibility Guidelines. For this purpose, a Web survey system was developed using 15 auditorial stimuli prepared based on the Media Taxonomy and 11 emotion measuring criteria selected from the literature review. After developing the system, emotional responses of 31 people with visual disability and 53 normal people were collected through the Web. The results of the survey showed that the emotional responses of people with visual disability were similar to those of normal people, although there were some exceptional cases. Therefore, it is clear that emotional needs of people with disability should be taken count of in the Web accessibility discussions and further in-depth studies on the emotional characteristics of people with disability are necessary.
PDF

Attentional Effects of Crossmodal Spatial Display using HRTF in Target Detection Tasks (항공 목표물 탐지과제 수행에서 머리전달함수(HRTF)를 이용한 이중감각적 공간 디스플레이의 주의효과)

Lee, Ju-Hwan
- Journal of Advanced Navigation Technology
- /
- v.14 no.4
- /
- pp.571-577
- /
- 2010
Driving aircraft requires extremely complicated and detailed information processing. Pilots perform their tasks by selecting the information relevant to them. In this processing, spatial information presented simultaneously through crossmodal link is advantageous over the one provided in singular sensory mode. In this paper, probability to apply providing visual spatial information along with auditory information to enemy tracking system in aircraft navigation is empirically investigated. The result shows that auditory spatial information, which is virtually created through HRTF is advantageous to visual spatial information alone in attention processing. The findings suggest auditory spatial information along with visual one can be presented through crossmodal link by utilizing stereophonic sound such as HRTF. which is available in the existing simple stereo system.
PDF KSCI

Search Result 331, Processing Time 0.034 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)