Search | Korea Science

Study of the Noise Processing to Technique Speech Recognition System (음성인식 시스템에서의 잡음 제거 개선에 관한 연구)

이창윤;이영훈
- Journal of the Korea Society of Computer and Information
- /
- v.7 no.2
- /
- pp.73-78
- /
- 2002
Recognition system of noise processing technique. A method combining SNR normalization with RAS is considered as a noise Processing and the performance of the speech recognition system can be improved using other noise processing technique. Experiment of recognition system is the internal organs that using a general digital signal processor(TMS320C31). Recognition word set is composed of 60 command words for of Rce environment and order of computer. Simulation is considered as a colored noise of general environment. The results of experiment showed that the recognition word set gives 94.61% of efficiency of recognition at maximum in case of the combination of SNR normalization and spectral subtraction.
PDF

Signal Processing for Speech Recognition in Noisy Environment (잡음 환경에서 음성 인식을 위한 신호처리)

Kim, Weon-Goo;Lim, Yong-Hoon;Cha, Il-Whan;Youn, Dae-Hee
- The Journal of the Acoustical Society of Korea
- /
- v.11 no.2
- /
- pp.73-84
- /
- 1992
This paper studies noise subtraction methods and distance measures for speech recognition in a noisy environment, and investigates noise robustness of the distance measures applied to the problem of isolated word recognition in white Gaussian and colored noise (vehicle noise) environments. Noise subtraction methods which can be used as a pre-processor for the speech recognition system, such as the spectral subtraction method, autocorrelation subtraction method, adaptive noise cancellation and acoustic beamforming are studied, and distance measures such and Log Likelihood Ratio ($d_{LLR}$), cepstral distance measure ($d_{CEP}$), weighted cepstral distance measure ($d_{WCEP}$), spectral slope distance measure ($d_{RPS}$) and cepstral projection distance measure ($d_{CP},\;d_{BCP},\;d_{WCP},\;d_{BWCP}$) are also investigated. Testing of the distance measures for speaker-dependent isolated word recognition in a noisy environment indicate that $d_{RPS}\;and\;d_{WCEP}$ which weigh higher order cepstral coefficients more heavily give considerable performance improvement over $d_{CEP}and\;d_{LLR}$. In addition, when no pre-emphasis is performed, the recognizer can maintain higher performance under high noise conditions.
PDF

Performance Model and Analysis for Improving Efficient Packet Service of GGSN in CPRS Network (GPRS 망에서 GGSN 노드의 패킷 처리 향상을 위한 성능 모델 및 분석)

Kwak, Yong-Won;Min, Jae-Hong;Jeong, Young-Sic;Park, Wung
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2002.11a
- /
- pp.826-834
- /
- 2002
Asynchronous third generation mobile communication system is able to service Packet Switching through adding GPRS Network to the second generation system GSM. Therefore, it is necessary to study packet traffic service of GGSN node which is due to perform gateway role that GPRS Network is enable to inter-connect with Internet in order to optimize the capability and performance of GGSN. In this paper, the Internet packet traffic model that it is arrived to GGSN node from the Internet is studied and In order to process the Inter traffic efficiently, performance analysis model in GGSN is proposed to optimize packet processing capability of each processor. In order to guarantee QoS requirement of the real time traffic Speech and Video, several scheduling algorithm is applied to performance model and each mechanism is compared with several performance parameters.
PDF

Real-time implementation of the G.728 speech codec using the Vincent6 DSP core (Vincent6 DSP코어를 이용한 G.728 음성 부호화기의 실시간 구현)

성호상
- Proceedings of the IEEK Conference
- /
- 2000.09a
- /
- pp.131-135
- /
- 2000
본 논문에서는 고성능 고정 소수점 DSP (Digital Signal Processor) 코어인 Vincent6 코어 [1]를 이용하여 ITU-T C.728 음성 부호화기를 실시간으로 구현하였다 G.728 은 16 kb/s전송률의 ITU-T표준 음성 부호화기이며, 입력신호는 8 kHz로 샘플링되며 샘플 당 16 bit 로 양자화된 PCM 신호이다. G.728 은 LD-CELP(Low Delay Code Excited Linear Prediction)라고도 하며, 알고리 듬 delay는 0.625ms 이다. Vincent6 DSP core 는 VLIW (Very-Long Instruction Word) 특성을 가지므로 다중 명령 (multiple instruction)을 수행할 수 있다 이를 위해서 G.728 annex G를 이용하여 고정 소숫점 연산으로 코드를 작성한 후, 이를 vincent6 어셈블리 코드로 구현하였다. 최종적으로 구현된 코드는 ITU-T 의 test vector 에 대 해 bit exact 한 결과를 보이며 34 MCPS (Million Cycles Per Second)의 계산량을 가지며 사용 메모리크기는 데이터 메모리가 약 9KByte, 프로그램 메모리가 약 57 KByte 이다.
PDF

Implementation of adaptive speech enhancement system using TMS320C6413 DSP processor (TMS320C6413 DSP프로세서를 이용한 적응 음질개선 시스템의 구현에 관한 연구)

Lee Young-Il;Lee Soon-Reyo;Shin Yoon-Ki;Choi Hong-Sub
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.101-104
- /
- 2004
본 논문에서는 보상기를 채용하여 안정성을 확보한 적응순환필터인 ACHARF(Adaptive Compensated Hyperstable Adaptive Recursive Filter)를 사용하여 잡음제거를 통한 음성의 음질개선을 DSP 프로세서를 통하여 구현하였다. 실험에서는 TI사의 최신 DSP 프로세서인 TMS320C6413와 스테레오 오디오 코덱인 TLV320AIC23을 탑재한 Evaluation board를 사용하였다. 2개의 입력마이크를 이용하여 음성신호와 기준 잡음신호를 별도로 수집하여 알고리즘을 수행하였으며, 실험 결과로 음질개선 효과를 확인할 수 있었다. 본 연구를 통해서 시스템의 성능개선의 핵심은 입력으로 들어오는 음성신호와의 상관도가 가능한 적은 잡음신호를 수집하는 방법이라 생각되며 앞으로 이에 대한 연구가 필요하겠다.
PDF

Implementation and Performance Analysis of a Speaker Verification System (화자 확인 시스템의 설계 제작 및 성능 분석)

권석규;이병기
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.30B no.3
- /
- pp.1-9
- /
- 1993
This paper discusses issues on the disign and implementation of real-time automatic speaker verification system, as well as the performance analysis of the implemented system. The system employs TI's TMS320C25 digital signal processor TMS320C25 and high speed SRAMs. The system is designed to be used stand-alone as well as via hand-shaking with IBM-PC. The speech parameters used for speaker verification are PARCOR and LPC-cepstrum coefficients, and the employed decision logics are those based on the generalized weighted distance comcept. The implemented system showed the performance of 5.3% error rate for the PARCOR coefficient, and 4.7% error rate for the LPG-cepstrum coefficient.
PDF

Usage of the Korean Phonetic Alphabet on PC Wordprocessing (컴퓨터를 이용한 한글음성문자(KPA)의 활용)

Lee H. B.;Jung I. J.;Joh W. I.
- Proceedings of the KSPS conference
- /
- 1996.10a
- /
- pp.320-322
- /
- 1996
The Korean Phonetic Alphabet(KPA) as devised by H. B. Lee on the basis of Han-geul, the Korean Alphabet, was incorporated into the Hangul Word Processor(HWP) 1. $^{*}$ to be used on personal computers. With the upgrading of the HWP software from $1.^{*}$ to more sophisticated versions of $2.^{*}$, $3.^{*}$, etc., it became necessary to convert the HWP $1.^{*}$ KPA into upgraded version. This paper traces the history of the computerized KPA software from the initial version of HWP $1.^{*}$ to the latest one.
PDF

Fixed-point Implementation of LPD Decoder in MPEG-D USAC (MPEG-D USAC : LPD 복호화기의 고정 소수점 알고리즘 구현)

Song, Eunwoo;Song, Jeongook;Kang, Hong-Goo
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2012.07a
- /
- pp.254-256
- /
- 2012
본 논문에서는 MPEG-D 오디오 서브그룹에서 진행 중인 Unified Speech and Audio Coding (USAC) 표준의 Linear Prediction Domain (LPD) 복호화기 모듈을 고정소수점 알고리즘으로 제안한다. USAC 부호화기는 두 개의 최신 음성-오디오 부호화기가 융합된 형태로, 음성 및 오디오 신호에 대하여 우수한 성능을 갖는 부호화기이다. USAC의 표준 완료와 본격적인 서비스화에 앞서서 USAC LPD 복호화기의 구조적인 특성을 분석하고, Digital Signal Processor (DSP)구현을 위한 LPD 복호화기의 고정소수점 알고리즘을 구축하는 동시에 모듈의 복잡도를 측정하고자 한다. 또한 고정소수점 알고리즘으로 구현된 LPD 복호화기와 기존의 부동소수점 복호화기의 성능을 비교하고, LPD 복호화기의 두 가지 부호화 모드에 따른 복잡도 이슈를 다루도록 한다.
PDF

Speech Recognition System for Home Automation Using DSP (DSP를 이용한 홈 오토메이션용 음성인식 시스템의 실시간 구현)

Kim I-Jae;Kim Jun-sung;Yang Sung-il;Kwon Y.
- Proceedings of the Acoustical Society of Korea Conference
- /
- autumn
- /
- pp.171-174
- /
- 2000
본 논문에서는 홈 오토메이션 시스템을 음성인식을 도입하여 설계하였다. 많은 계산량과 방대한 양의 데이터의 처리를 요구하는 음성인식을 DSP(Digital Signal Processor)를 통하여 구현해 보고자 본 연구를 수행하였다. 이를 위해 실시간 끝점검출기를 이용하여 추가의 입력장치가 필요하지 않도록 시스템을 구성하였다. 특징벡터로는 LPC로부터 유도한 10차의 cepstrum과 log 스케일 에너지를 이용하였고, 음소수에 따라 상태의 수를 다르게 구성한 DHMM(Discrete Hidden Marcov Model)을 인식기로 사용하였다. 인식단어는 가정 자동화를 위하여 많이 쓰일 수 있는 10개의 단어를 선택하여 화자 독립으로 인식을 수행하였다. 또한 단어가 인식이 되면 인식된 단어에 대해서 현재의 상태를 음성으로 알려주고 이에 대해 자동으로 실행하도록 시스템을 구성하였다.
PDF

Real-Time Implementation of a SBC Codec Using a NEC 7720 DSP (NEC 7720 DSP를 이용한 SBC codec의 실시간 구현)

Oh, Soo Hwan;Lee, Sang Uk
- Journal of the Korean Institute of Telematics and Electronics
- /
- v.23 no.4
- /
- pp.429-438
- /
- 1986
In this paper we have designed and implemented a real-time, full-duplex SBC (sub-band coding) codec at 16kbps using a high speed digital signal processor, NEC 7720. The SBC codec employs a QMF(quadrature mirror filter) filter bank based on the tree structures of two-band analysis-synthesis pairs to partition speech signal into 4 octabe bands. Computer simulation has been done to investigate the effect of fixed-point computation of the NEC 7720. Three different performance measures, the conventional signal-to-noise ratio, the informal listening test, and an LPC(linear predictive coding)distance measure, have been used in this simulation. The necessary parameters have been optimized through the simulation. The developed hardware and software have been tested in real-time operation using a hardware emulator.
PDF

Search Result 94, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)