통합 검색 | Korea Science

잡음환경에서의 음성인식 성능 향상을 위한 이중채널 음성의 CASA 기반 전처리 방법 (CASA-based Front-end Using Two-channel Speech for the Performance Improvement of Speech Recognition in Noisy Environments)

박지훈;윤재삼;김홍국
- 대한전자공학회:학술대회논문집
- /
- 대한전자공학회 2007년도 하계종합학술대회 논문집
- /
- pp.289-290
- /
- 2007
In order to improve the performance of a speech recognition system in the presence of noise, we propose a noise robust front-end using two-channel speech signals by separating speech from noise based on the computational auditory scene analysis (CASA). The main cues for the separation are interaural time difference (ITD) and interaural level difference (ILD) between two-channel signal. As a result, we can extract 39 cepstral coefficients are extracted from separated speech components. It is shown from speech recognition experiments that proposed front-end has outperforms the ETSI front-end with single-channel speech.
PDF

User Needs for Haptic Communication of VR Fashion Product Shopping

Kim, Jongsun;Ha, Jisoo
- 한국의류산업학회지
- /
- 제21권4호
- /
- pp.401-411
- /
- 2019
Non-contact judgment and evaluation for products are increasingly needed along with a rapid environmental change in fashion that sows urgency in the need to implement services that allows users to judge and experience a tactile sense in a fashion product without actual contact. Technological development is required to provide users with syn-aesthetic experiences that integrate the visual, tactile and the auditory. There is also a need to conduct research to increase immersion that provides users with ICT-related experiences communicated through fashion images. The study analyzed demands for haptic communication technology by Korean users in immersive VR fashion product shopping. Accordingly, it defined haptic communication through literature research, investigated immersion in the VR environment and conducted in-depth interviews for haptic communication applicable to VR shopping. Findings show that hedonic reactions by fantasy, emotion and fun function are an important motive in selecting VR shopping. VR fashion product shopping steps were divided into 4: move to store, search in store, search of product and purchase based on offline store shopping experience. It defined the haptic communication by steps and analyzed the types of the haptic feedback to be implemented. The study results provide basic data for developing haptic communication technology that can enhance e a sense of the presence and immersion experiences that can help lay a groundwork for pilot studies on the convergence of the virtual and the real.
https://doi.org/10.5805/SFTI.2019.21.4.401 인용 PDF KSCI

Communication of Young Black-Tailed Gulls, Larus crassirostris, in response to Parents Behavior

Chung, Hoon;Cheong, Seok-Wan;Park, Shi-Ryong
- Animal cells and systems
- /
- 제8권4호
- /
- pp.295-300
- /
- 2004
In the breeding colony of black-tailed gull, as nests of conspecific neighbors are very closely located, chicks are permanently exposed by sound and visual stimuli produced by adult conspecifics approaching their nests. The chicks, therefore, may need to learn ways to appropriately respond to their parents approach. In this study we experimentally manipulated sensory stimulation that is potentially provided by the parents to the offspring. Chicks incubated in the laboratory were exposed to a mew call of the conspecific adult. Then they were tested in three situations differing in sensory stimulation: 1) visual stimulation only, 2) auditory stimulation only, and 3) Simultaneous visual and auditory stimulations. We observed occurrence of different response of the chicks, which were categorized into three behaviors (begging call response, chirirah call and pecking behavior). We also investigated intensity of the chicks call in response to the different stimulations and the degree of response with age. The chicks exposed to only auditory stimulation made significantly more chirirah calls. The intensities (dB) of the mew call and chicks chirirah call were directly correlated. On the other hand, when chicks just saw the stuffed adult gull, they responded significantly more with a begging call and pecking behavior. In the situation of costimulation, the chicks responded with a begging call and pecking, but less frequently than visual stimulation only. The results suggest that young black-tailed gulls use call repertories to properly respond to parents behavior. Such results suggest an evolutionary process for uncreasing their survival rate in a group breeding site.
PDF KSCI

부모 듣기 지도 프로그램이 청각장애아동의 언어 능력과 의사소통 행동에 미치는 영향 (The Effect of Parent Involvement Auditory Training Program on Communication Ability of Children with Hearing Impairments)

채정희;허명진;박찬희
- 수산해양교육연구
- /
- 제28권3호
- /
- pp.818-830
- /
- 2016
The purpose of this study is to examine the effects of the parents listening guidance program, which allows the parents to understand their hearing impaired children and how to listen at home, on the communication skills of the hearing impaired children. The research subjects were 3 hearing impaired children who did not accompany with the intellectual, emotional and behabioral problems, and the listening guidance has been performed for their parents for 3 months through the listening guidance program. The changes in the communication skills in the hearing impaired children were observed comparing before and after the education. In the results, first, the receptive language skill of the hearing impaired children was improved after than before the parents listening guidance. Second, the expressive language skill of the hearing impaired children was improved after than before the parents listening guidance, too. Third, in the communication behavior of the hearing impaired children, the phonation and the speech production were increased together with the gesture after the parents listening guidance. In conclusion, it is deemed that the parents listening guidance program would have positive influence on the communication behavior of the hearing impaired children.
https://doi.org/10.13000/JFMSE.2016.28.3.818 인용 PDF KSCI

신경회로망을 사용한 잡음이 중첩된 음성 강조 (Speech Enhancement in Noisy Speech Using Neural Network)

최재승
- 대한전자공학회논문지SP
- /
- 제42권5호
- /
- pp.165-172
- /
- 2005
잡음이 존재하는 환경 하에서 음성인식을 실시하는 경우, 잡음을 제거하고 음성을 강조하는 시스템이 필요하다. 따라서 우수한 스펙트럴 분석기강인 인간의 청각계를 모의하는 것은 음성강조에 있어서 효과적이다. 이러한 것을 구현하는 하나의 방법으로서 상호억제라고 하는 청각기강을 적응적으로 사용하는 방법을 제안한다. 이것은 신경회로망에 의해서 잡음의 크기를 추정하여 각 프레임에 대해서 그 크기에 따라서 적응적으로 상호억제 계수와 진폭성분조정 계수를 조정함으로써 음성을 강조하는 방법이다. 스펙트럴왜곡율 척도의 평가로부터 백색잡음뿐만 아니라 유색잡음 및 자동차의 주행잡음에 대해서도 본 방식이 효과적이라는 것을 확인한다.
PDF KSCI

인지적 청각 특성을 이용한 고립 단어 전화 음성 인식 (Isolated-Word Speech Recognition in Telephone Environment Using Perceptual Auditory Characteristic)

최형기;박기영;김종교
- 대한전자공학회논문지TE
- /
- 제39권2호
- /
- pp.60-65
- /
- 2002
본 논문에서는, 음성 인식률 향상을 위하여 청각 특성을 기반으로 한 GFCC(gammatone filter frequency cepstrum coefficients) 파라미터를 음성 특징 파라미터로 제안한다. 그리고 전화망을 통해 얻은 고립단어를 대상으로 인식실험을 수행하였다. 성능비교를 위하여 MFCC(mel frequency cepstrum coefficients)와 LPCC(linear predictive cepstrum coefficient)를 사용하여 인식 실험을 하였다. 또한, 각 파라미터에 대하여 전화망의 채널 왜곡 보상기법으로 CMS(cepstral mean subtraction)를 도입한 방법과 적용시키지 않은 방법으로 인식실험을 하였다. 실험 결과로서, GFCC를 사용하여 인식을 수행한 방법이 다른 파라미터를 사용한 방법에 비해 향상된 결과를 얻었다.
PDF KSCI

바이스펙트럼에 의한 비선형 시계열 신호 해석과 그 응용 (Analysis of Nonlinear Time Series by Bispectrum Methods and its Applications)

김응수;이유정
- 한국정보처리학회논문지
- /
- 제6권5호
- /
- pp.1312-1322
- /
- 1999
The world of linearity, which is regular, predictable and irrelevant to time sequence in most natural phenomenon, is a very small part. In fact, signals generated from natural phenomenon with which we're in contact are showed only slight linearity. Therefore it is very difficult to understand and analyze natural phenomenon with only predictable and regular linear systems. Due to these reasons researches concerning non-linear signals that of analysis were excluded being regarded as noise are being actively carried out. Countless signals generated from nonlinear system have the information about itself, and analyzing those signals and get information from it, that will be able to be used effectively in so may fields. Hence, in this paper we used a higher order spectrum, especially the bispectrum. After we prove the validity applying bispectrum to logistic map, which is typical chaotic signal. Subsequently by showing the result applying for actual signal analysis of EEG according to auditory stimuli, we show that higher order spectra is a very useful parameter in analysis of non-linear signals and the result of EEG analysis according to auditory stimuli.
PDF

가상현실 점포의 특성에 관한 연구 -현대백화점 VR 스토어와 eBay VR 백화점 사례를 중심으로- (A Research on the Characteristics of Virtual Reality Stores -Focused on Hyundai VR Store and eBay VR Department Store-)

장주연;전재훈
- 한국의류학회지
- /
- 제42권4호
- /
- pp.671-688
- /
- 2018
This study investigates the characteristics of VR stores that emerged as new fashion communication media. Two case studies on Hyundai and eBay VR Department stores were conducted along with a discussion of the function and meaning of the fashion VR store. The results showed that both stores provide novel shopping experiences; however, the two were differentiated in terms of production method and technology implementation level. Functional aspects such as providing shopping efficiency and purchasing service was insufficient in both stores. Instead, they were complementing by means of product rotation, recommendation system, voice guidance, or linkage with an online shopping mall. In experiential aspects, both stores provided a strong sense of immersion. Hyundai VR store enhanced immersion with a high resolution image of a real offline store; however, it lacked in the ability to provide multisensory stimulation such as kinetic sense or auditory stimulation. The eBay VR Department store intensified the immersion experience by providing auditory stimulation as well as visual stimulation that enhanced the speed and distance sense through the utilization of animation. However, the extent of experience was limited in terms of agency and transformation because of the low interactivity found in both store systems.
https://doi.org/10.5850/JKSCT.2018.42.4.671 인용 PDF KSCI

계산적 청각 장면 분석 시스템에서 가중치 상호상관계수를 이용한 음성 분리 (Speech Segmentation using Weighted Cross-correlation in CASA System)

김정호;강철호
- 전자공학회논문지
- /
- 제51권5호
- /
- pp.188-194
- /
- 2014
계산적 청각 장면 분석 시스템의 특징 추출은 시간 연속성과 주파수 채널간에 유사성을 이용하여 청각 요소의 상관지도를 구성한다. 세그먼테이션은 상호상관계수 함수를 이용하여 2진 마스크를 구성하고, 마스크 성분 1(음성)은 동일한 주기성과 동기를 가진다. 그러나 채널간에 비슷한 주기성을 갖지만 지연이 있는 경우에 음성으로 잘못 결정되는 문제가 있다. 본 논문에서는 세그먼테이션에서 가중치 상호상관계수를 이용해 채널간에 유사성의 변별력을 높이는 방법을 제안한다. 계산적 청각 장면 분석 시스템의 음성분리 성능을 평가하기 위하여 배경 잡음(사이렌, 기계, 백색, 자동차, 군중) 환경에서 신호 대 잡음비(5dB, 0dB)의 변화에 따라 실험을 수행하였다. 본 논문에서는 기존의 방법과 제안한 방법과 비교한 결과, 제안한 방법이 기존의 방법에 비하여 각각 신호 대 잡음비 5dB에서 2.75dB 그리고 0dB에서 4.84dB 향상되었다.
https://doi.org/10.5573/ieie.2014.51.5.188 인용 PDF KSCI

인간-컴퓨터 인터페이스를 위한 청각 동기방식 뇌파신호의 패턴 분류 (Pattern classification of the synchronized EEG records by an auditory stimulus for human-computer interface)

이용희;최천호
- 한국정보통신학회논문지
- /
- 제12권12호
- /
- pp.2349-2356
- /
- 2008
본 논문에서는 정상인의 정신적인 뇌 활동에 의한 순수한 뇌파를 측정하고 효과적으로 분류하기 위한 방법을 제시한다. 과정은 대상자가 특정한 작업에 대해 생각하게 하고 이때의 뇌파를 청각이벤트에 동기시켜 측정하고, 측정된 뇌파의 기준선의 이동과, 생리적인 아티펙트의 영향을 줄인다. 마지막으로, 평균가산법에 의해 정신적인 작업에 대한 신호만을 추출하고 AR 계수를 가지고 인식작업을 수행한다. 실험에서, 청각자극을 이벤트로 사용하였으며, 뇌파의 도출은 $C_3-A_1$, $C_4-A_2$, $P_Z-A_1$의 3채널에서 기록하였다. 각 채널당 16회의 평균가산후에, 12차 AR계수로 특정한 정신적인 작업에 대한 특징을 추출하였다. 전체 36개의 특징계수를 신경망의 입력으로 사용하고, 각 작업 당 50회를 훈련데이터로 사용하였다. 제안한 방법의 인식률은 2종류 작업에 대해 34-92%, 4종류 작업에 대해 38-54%를 얻었다.
https://doi.org/10.6109/jkiice.2008.12.12.2349 인용 PDF KSCI

검색결과 102건 처리시간 0.02초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)