Search | Korea Science

Enhaced 2.4 kbps Harmonic Stochastic Excitation Coding for Time/Frequency Transitional Speech (시간/주파수 전이신호를 위한 향상된 2.4 kbps 하모닉 스토케스틱 여기 음성 부호화 방법)

김종학;이인성
- The Journal of the Acoustical Society of Korea
- /
- v.19 no.7
- /
- pp.53-58
- /
- 2000
본 논문은 주파수 전이신호와 시간 전이 신호에 대해서 고조파 잡음 여기 방법과 시간 분리 여기 방법을 적용한 2.4 kbps 음성부호화 방법을 제안한다. 혼합 여기 부호화 방법은 주기 신호와 비 주기 신호를 효과적으로 표현하기 위해 하모닉 잡음 모델을 사용한다. 혼합신호에 대한 잡음 성분은 캡스트럴 분석 방법을 사용함으로써 추출되고, AR (Autoregressive Model) 모델에 의해 표현된다. 시간 전이구간 신호에서의 모호한 음성을 효과적으로 제거하기 위한 또 다른 방법이 제안된다. 제안된 시간 분리 방법은 시간 에너지 변화정도를 관찰함으로써 전이 시점을 감지하고 다른 시간 길이를 가지는 두 블록으로 분리하여 분석한다. 시간 분리 방법은 분석을 위한 비대칭 윈도우와 합성에서의 위상 합성 방법을 포함한다. 제안된 방법을 사용한 2.4 kbps 음성부호화 방법은 주관적 음질 평가에서 전이구간에서의 지각적 음질의 향상을 보여주었으며, 원본 음성 스펙트럼과의 고조파 비 매칭에 의한 윙윙거리는 기계적인 잡음을 감소시킨다.
PDF

Ultrasonic Image Reconstruction using Mode-Converted Rayleigh Wave (파형 변환된 레이리파를 이용한 초음파영상복원)

Suh Dong-Man
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.403-408
- /
- 1999
In this paper, ultrasonic tomography by the Mode-Converted Rayleigh wave (MCRW) in the back-scattered direction is presented. When a beam with a short pulse and narrow beam width enters a reflector with smooth surface, in general, two major arrivals can be observed in the output waveform: the specular reflection and the radiation of the MCRW from the reflector surface. The time-delay between the two waves is relatively large and thus can be measured easily. This large time-delay is due to the fact that the MCRW is slower than incident wave. In our method, this large time- delay is used for ultrasonic image reconstruction. To effectively detect the MCRW, the arrayed-receiving transducers are circularly arranged around the transmitter. In addition, a deconvolution method is employed to remove specular echo signals for reconstructing the MCRW image.
PDF

Speech Unit Concatenation by Phase Succession in an ABS/OLA Sinusoidal Model (ABS/OLA Sinusoidal 모델에서 위상계승을 이용한 단위음성의 연결)

Bae Jae-Hyun;Byeon Heo-Jin;Oh Yung-Hwan
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.11-14
- /
- 1999
본 논문에서는 중첩가산 Sinusoidal 합성방식에서 매칭된 정현파별로 위상을 계승하는 단위음성 연결방법을 제안한다. 선행 단위음의 마지막 프레임, 후행 단위음의 첫 프레임, 후행 단위음의 나머지 프레임의 단계로 나누어 각 단계마다 제안한 방식으로 선행 프레임의 위상을 계승하였다. 실험결과 후행 단위음의 연결 위치를 이동하는 기존의 방식을 사용한 연결음에 비해 연결부분에서 음성파형의 급격한 변화가 줄었다.
PDF

A Study on Standing Wave Type Ultrasonic Linear Motors (정재파형 초음파 리니어 모터에 관한 연구)

권재화;이수성;강국진;노용래
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.8
- /
- pp.38-43
- /
- 2001
We developed a new standing wave type ultrasonic linear motor that can be driven bi-directionally. The operation principle of the motor was derived in an analytical form and the detailed structure was designed by the finite element method. Based on the design, a motor sample and a driving circuit were fabricated, and validity of the structure was verified through experiments.
PDF

Emproving the resolution of the finite-length sinusoids burried in noise (잡음에 의해 손상된 유한 구간 정현파 추정의 해상도 개선책)

Shin, Yoon-Ki
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.5
- /
- pp.122-129
- /
- 1997
In the signal processing fields, sinusoidal wave is of much meaning because it may carry other important informations. But in reality due to the finite number of sensors along with the noise detected by the sensors, the resolution of frequency detection is in general much degraded. In this paper, new method is proposed to embrove the frequency resolution of the finite-length sinusoids burried in noise.
PDF

An Experimental Speech Translation System for Hotel Reservation (호텔예약을 위한 자동통역 시스템)

구명완;김웅인;김재인;도삼주;강용범;박상규;손일현;김우성;장두성
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.105-108
- /
- 1995
한국에 있는 손님이 한국어 만을 사용하여 일본 호텔을 예약할 수 있도록 해 주는 한일간 자동통역 시연 시스템에 관해 기술하였다. 이 시스템은 한국어 음성인식부, 한일 기계번역부, 한국어 음성합성부로 구성되어 있다. 한국어 음성인식부는 기본적으로 HMM을 이용하는 화자독립, 약 300단어급 연속음성인식 시스템으로서 전향 언어 모델로 바이그램 언어 모델, 후향 언어 모델로는 의존 문법을 사용하여 N-BEST 문장을 생성해낸다. 실험결과, 단어 인식률은 top1 문장에 대해 약 94.5%, top5 문장에 대해 약 94.7%의 인식률을 얻었다. 인식 시간은 길이가 다른 여러 문장들에 대해 약 0.1~3초가 걸렸다. 기계번역부에서는 음성인식에서 의존 문법을 사용하여 분석된 파싱 결과를 이용, 직접 번역 방식을 채택하여 일본어를 생성한다. 음성 합성부는 반음소를 합서의 기본단위로 하고, 합성방식으로는 주기 파형 분해 및 재배치 방식으로 하였다. 실험 환경은 2 CPU를 장착한 SPARC 20 workstation 이었으며 실시간 특징 추출을 위해 TMS320C30 DSP 보드 1개를 이용하였다.
PDF

The Study on the Expential Smoothing Method of the Concatenation Parts in the Speech Waveform (음성 파형분절의 지수함수 스므딩 기법에 관한 연구)

박찬수
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1991.06a
- /
- pp.7-10
- /
- 1991
In a text-to-speech system, sound units (phonemes, words, or phrases, etc.) can be concatenated together to produce required utterance. The quality of the resulting speech is dependent on factors including the phonological/prosodic contour, the quality of basic concatenation units, and how well the units join together. Thus although the quality of each basic sound unit is high, if occur the discontinuity in the concatenation part then the quality of synthesis speech is decrease. To solve this problem, a smoothing operation should be carried out in concatenation parts. But a major problem is that, as yet, no method of parameter smoothing is available for joining the segment together. Thus in this paper, we proposed a new aigorithm that smoothing the unnatural discountinuous parts which can be occured in speech waveform editing. This algorithm used the exponential smoothing method.
PDF

The Smoothing Method of the Concatenation Parts in Speech Waveform by using the Forward/Backward LPC Technique (전, 후방향 LPC법에 의한 음성 파형분절의 연결부분 스므딩법)

이미숙
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1991.06a
- /
- pp.15-20
- /
- 1991
In a text-to-speech system, sound units (e. q., phonemes, words, or phrases) can be concatenated together to produce required utterance. The quality of the resulting speech is dependent on factors including the phonological/prosodic contour, the quality of basic concatenation units, and how well the units join together. Thus although the quality of each basic sound unit is high, if occur the discontinuity in the concatenation part then the quality of synthesis speech is decrease. To solve this problem, a smoothing operation should be carried out in concatenation parts. But a major problem is that, as yet, no method of parameter smoothing is availalbe for joining the segment together.
PDF

On Altering the Pitch of Speech Signals in Waveform Coding -(Altering Method by the LPC and the Pitch Halving)- (음성 파형코딩의 음원피치 변경에 관한 연구 - LPC와 주기반분법에 의한 피치변경법 -)

민경중
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1991.06a
- /
- pp.45-49
- /
- 1991
In area of the speech synthesis, the waveform coding with high quality are mainly used to the synthesis by analysis. However, it is difficult to applying the waveform coding to the synthesis by rule, because the parameters of this coding are not classified as either excitation parameters and vocal tract parameters. In this paper, we proposed a new pitch change method that can alter the pitch periods in the waveform coding. The proposed method expands the pitch period by the LPC synthesis method, and then the period is compressed by the waveform halving technique. Thus, it is possible that the waveform coding is carried out the synthesis by rule in speech processing.
PDF

An End Point Detection Technique Using the LSP Distance in EVRC Packets (EVRC 패킷에서 LSP 거리를 이용한 음성 끝점 검출)

민병준;강명수
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.6
- /
- pp.44-48
- /
- 1999
This paper presents a simple and fast method for end point detection under low-level noisy environment. The proposed algorithm uses a threshold logic with LSP distances and takes vocoded packets as input to the recognition system. The results from the proposed method are compared with those manually checked in decoded speeches. From the result it exhibits acceptable accuracy.
PDF

Search Result 191, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)