Search | Korea Science

A study on the phonemic feature changes according to Korean speech waveform edition (한국어 음성 파형의 편집에 의한 한국어 음운 변화에 관한 연구)

Kim, Seon-Il;Hong, Ki-Won;Lee, Haing-Sei
- The Journal of the Acoustical Society of Korea
- /
- v.13 no.6
- /
- pp.60-65
- /
- 1994
A study on phonemic feature changes is accomplished by human perception of the discrimination of the phonemic feature of Korean edited speech waveform which is partially elimination or exchange. We found that speech waveforms has tarnsitional, stationary. equivalent and critical phonemic parts.
PDF

On the Reduction of Pitch Search Time for G.723.1 Using the Skipping Technique (G.723.1에서 Skipping Technique을 이용한 피치검색시간 단축에 관한 연구)

김정진
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1998.06e
- /
- pp.285-288
- /
- 1998
G.723.1은 저 전송률 환경에서 고음질을 제공하여 주고 있으나 CELP형 부호화기가 갖는 합성에 의한 분석(analysis by synthesis) 방식의 구조로 인해 많은 처리 시간과 계산량을 요구하게 된다. 본 논문에서는 G.723.1에 대해 skipping 기법을 이용하여 피치 검색과정이 계산량을 줄여 부호화기의 전체 처리 시간을 감소시키는 방법을 제안하였다. 예측 피치를 찾기 위한 개회로 피치 예측(open loop pitch estimation) 과정에서 계산량을 줄이기 위해 skipping 기법을 사용하였다. 피치 예측 과정시 상관관계를 파형은 양과 음의 파형이 교대로 나타나는 특징을 가지고 있기 때문에 계산시 음의 파형을 생략하는 방법을 사용하였다. 실제 음성시료에 대해 제안한 피치 검색법을 적용하였을 때 부호화시 평균 처리시간은 약 10%정도 감소하였으며 기존 G.723.1과 제안한 방법을 적용한 G.723.1의 음질 비교를 위하여 MOS 평가를 했을 때 기존의 방법이 평균 3.76인데 비해 제안한 방법의 평균 MOS는 3.73으로 주관적인 음질 저하는 거의 나타나지 않았다.
PDF

The Development of Speech Synthesizer In Korean TTS System (한국어 문어변환 시스템 내에서의 음성 합성기 개발)

강찬희;진용옥
- The Journal of the Acoustical Society of Korea
- /
- v.12 no.2
- /
- pp.14-27
- /
- 1993
본 논문은 매 40ms 정도의 음성파형으로부터 추출된 6내지 9ms 정도의 1피치주기 파형을 합성단위로 사용하여 합성시킨 시간영역에서의합성방식을 한국어 문어 변환 시스템내에서의 음성합성기에 적용시킨 연구결과이다. 시험 결과, 4가지 유형의 한국어 음절 합성이 가능하고, 장단강약과 같은 운율요소의 제어가 용이하고, 또한 합성 알고리즘이 간단하여 실시간 처리가 가능하였으나, 문장 단위의 음성을 합성하기 위하여는 문장내에서의 다양한 피치 패턴에 대한 연구와 이의 효율적인 제어에 관한 연구가 이루어져야 할 것이다. 합성음에 대한 평가방법으로는 원음과 합성음에 대한 시간영역에서의 파형비교, 주파수 영역에서의 스펙트럼 포락선 유사성 비교 및 합성음에 대한 청취도 실험을 행하였다.
PDF

On the Use of Pre=-and Post-Filters in Speech Waveform Coding (PRE-FILTER와 POST-FILTER를 사용하여 음성파형 부호화 방법에 관하여)

조동호;은종관;김제우
- The Journal of the Acoustical Society of Korea
- /
- v.4 no.3
- /
- pp.33-41
- /
- 1985
이 논문에서는 frequency-weighted MSE를 최소화하는 적응 pre-filter와 post-filter를 음성파형 부호화기에 적용했을 때의 성능을 분석한다. 먼저 여러 다양한 pre-filter와 post-filter에 의한 noise shaping 효과를 이론적으로 보여준다. 그리고 frequency-weighted SNR 척도를 사용하여 적응 pre-filter 와 post filter에 의한 성능면에서의 이득을 이론적으로 유도한다. 적응 pre-filter와 post-filter를 ADM과 ADPCM 부호화기에 적용해본 결과에 의하면 음성파형 부호화기의 성능을 FWSNR\sub SEG\ 척도로 약 3dB 정도 개선할 수 있음을 알 수있다. 또한 pre-filter와 post-filter를 사용하면 청각적으로 중요한 영향을 미치는 1kHz에서 3kHz 사이의 양자화 잡음을 효과적으로 줄일 수 있다.
PDF

On a Pitch Alteration Method by Time-axis Scaling Compensated with the Spectrum for High Quality Speech Synthesis (고음질 합성용 스펙트럼 보상된 시간축조절 피치 변경법)

Bae, Myung-Jin;Lee, Won-Cheol;Im, Sung-Bin
- The Journal of the Acoustical Society of Korea
- /
- v.14 no.4
- /
- pp.89-95
- /
- 1995
The waveform coding technique has concerned with simply preserving the waveform shape of speech signal through a redundancy reduction process. In the case of speech synthesis, the waveform coding with high sound quality is mainly used to the synthesis by analysis. However, since the parameters of this coding are not classified into either excitation or vocal tract parameters, it is difficult to applying the waveform coding to the synthesis by rule. In order to apply the waveform coding to the synthesis by rule, the pitch alteration technique is required in prosody control. In this paper, we propose a new pitch alteration method that can change the pitch period in waveform coding by scaling the time-axis and compensating the spectrum. This is relevant to the time-frequency domain method were the phase components of the waveform is preserved with a little spectrum distortion of 2.5 % and less for 50% pitch change.
PDF

Acoustic Full-waveform Inversion using Adam Optimizer (Adam Optimizer를 이용한 음향매질 탄성파 완전파형역산)

Kim, Sooyoon;Chung, Wookeen;Shin, Sungryul
- Geophysics and Geophysical Exploration
- /
- v.22 no.4
- /
- pp.202-209
- /
- 2019
In this study, an acoustic full-waveform inversion using Adam optimizer was proposed. The steepest descent method, which is commonly used for the optimization of seismic waveform inversion, is fast and easy to apply, but the inverse problem does not converge correctly. Various optimization methods suggested as alternative solutions require large calculation time though they were much more accurate than the steepest descent method. The Adam optimizer is widely used in deep learning for the optimization of learning model. It is considered as one of the most effective optimization method for diverse models. Thus, we proposed seismic full-waveform inversion algorithm using the Adam optimizer for fast and accurate convergence. To prove the performance of the suggested inversion algorithm, we compared the updated P-wave velocity model obtained using the Adam optimizer with the inversion results from the steepest descent method. As a result, we confirmed that the proposed algorithm can provide fast error convergence and precise inversion results.
https://doi.org/10.7582/GGE.2019.22.4.202 인용 PDF KSCI

Transmission waveform design for compressive sensing active sonar using the matrix projection from Gram matrix to identity matrix and a constraint for bandwidth (대역폭 제한 조건과 Gram 행렬의 단위행렬로의 사영을 이용한 압축센싱 능동소나 송신파형 설계)

Lee, Sehyun;Lee, Keunhwa;Lim, Jun-Seok;Cheong, Myoung-Jun
- The Journal of the Acoustical Society of Korea
- /
- v.38 no.5
- /
- pp.522-533
- /
- 2019
The compressive sensing model for range-Doppler estimation can be expressed as an under-determined linear system y = Ax. To find the solution of the linear system with the compressive sensing method, matrix A should be sufficiently incoherent and x to be sparse. In this paper, we propose a transmission waveform design method that maintains the bandwidth required by the sonar system while lowering the mutual coherence of the matrix A so that the matrix A is incoherent. The proposed method combines two methods of optimizing the sensing matrix with the alternating projection and suppressing unwanted frequency bands using the DFT (Discrete Fourier Transform) matrix. We compare range-Doppler estimation performance of existing waveform LFM(Linear Frequency Modulated) and designed waveform using the matched filter and the compressive sensing method. Simulation shows that the designed transmission waveform has better detection performance than the existing waveform LFM.
https://doi.org/10.7776/ASK.2019.38.5.522 인용 PDF KSCI

On a Pitch Alteration Technique in the V/UV Spectrum for High Quality Speech Synthesis Technique (고음질 합성방식용 V/UV 스펙트럼상의 피치변경법에 관한 연구)

Jo, Wang-Rae;Bae, Myung-Jin;Kim, Dong-Sung
- The Journal of the Acoustical Society of Korea
- /
- v.15 no.6
- /
- pp.99-103
- /
- 1996
Most waveform coding techniques attempt to reduce redundancy of speech signal while preserving the shape of the waveform. In speech synthesis, wavefrom coding methods are used to the synthesis by rule for high quality speech. However, it is difficult to apply the waveform coding to the synthesis by rule because the parameters of the wavefrom coding cannot be classified as either the excitation or the vocal tract parameters. The proposed method shows little spectrum distortion of 2.7% or less for 50% pitch changes. It also achieves smooth connection of wavefrom magnitudes among the frames by compensating the phase in time domain.
PDF

Analysis of the Performance of Ultrasonic Transducers (초음파 탐촉자의 성능 평가법)

노용래
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.6
- /
- pp.12-22
- /
- 1991
초음파 탐촉자 성능의 이론적 평가법을 개발해 이중소자 탐촉자에 적용하였다. 본 방법은 탐촉 자의 발신, 회절, 수신특성을 주파수, 시간 영역에서 평가할 수 있으며, 특히 시간영역에서는 통상 사용 하는 역푸리에 변환법에 의존하지 않고 음파의 투과, 반사 특성을 고려해 파형을 직접 구할 수 있다.
PDF

On Detecting the Steady State Segments of Phonemes by Using the Magnitude Distribution of Speech Waveforms (음성파형의 진폭분포를 이용한 음소의 정상상태 구간 검출)

정덕조;배명진;안수길
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.6
- /
- pp.5-11
- /
- 1991
연속음 인식을 위하여 연결된 음향 신호를 음소단위로 분할하는 것이 필요하다. 본 논문에서는 연속 음성에서의 정상상태 구간 검출을 위한 파라미터로서 진폭분포를 이용하는 방법을 제안하였다. 제 안된 진폭분포는 음성신호의 변화특성을 정확히 나타내며 이러한 프레임사이의 진폭분포를 이용하는 방 법을 제안하였다. 제안된 지폭분포는 음성 신호의 변화특성을 정확히 나타내며 이러한 프레임사이의 진 폭 분포 차이값을 비교하여 프레임의 안정구간과 천이구간을 구분할 수 있었다.
PDF

Search Result 191, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)