Search | Korea Science

A Spectral Smoothing Algorithm for Unit Concatenating Speech Synthesis (코퍼스 기반 음성합성기를 위한 합성단위 경계 스펙트럼 평탄화 알고리즘)

Kim Sang-Jin;Jang Kyung Ae;Hahn Minsoo
- MALSORI
- /
- no.56
- /
- pp.225-235
- /
- 2005
Speech unit concatenation with a large database is presently the most popular method for speech synthesis. In this approach, the mismatches at the unit boundaries are unavoidable and become one of the reasons for quality degradation. This paper proposes an algorithm to reduce undesired discontinuities between the subsequent units. Optimal matching points are calculated in two steps. Firstly, the fullback-Leibler distance measurement is utilized for the spectral matching, then the unit sliding and the overlap windowing are used for the waveform matching. The proposed algorithm is implemented for the corpus-based unit concatenating Korean text-to-speech system that has an automatically labeled database. Experimental results show that our algorithm is fairly better than the raw concatenation or the overlap smoothing method.
PDF

Improvement of Synthetic Speech Quality using a New Spectral Smoothing Technique (새로운 스펙트럼 완만화에 의한 합성 음질 개선)

장효종;최형일
- Journal of KIISE:Software and Applications
- /
- v.30 no.11
- /
- pp.1037-1043
- /
- 2003
This paper describes a speech synthesis technique using a diphone as an unit phoneme. Speech synthesis is basically accomplished by concatenating unit phonemes, and it's major problem is discontinuity at the connection part between unit phonemes. To solve this problem, this paper proposes a new spectral smoothing technique which reflects not only formant trajectories but also distribution characteristics of spectrum and human's acoustic characteristics. That is, the proposed technique decides the quantity and extent of smoothing by considering human's acoustic characteristics at the connection part of unit phonemes, and then performs spectral smoothing using weights calculated along a time axis at the border of two diphones. The proposed technique reduces the discontinuity and minimizes the distortion which is caused by spectral smoothing. For the purpose of performance evaluation, we tested on five hundred diphones which are extracted from twenty sentences using ETRI Voice DB samples and individually self-recorded samples.
PDF KSCI

Speech Synthesis using Diphone Clustering and Improved Spectral Smoothing (다이폰 군집화와 개선된 스펙트럼 완만화에 의한 음성합성)

Jang, Hyo-Jong;Kim, Kwan-Jung;Kim, Gye-Young;Choi, Hyung-Il
- The KIPS Transactions:PartB
- /
- v.10B no.6
- /
- pp.665-672
- /
- 2003
This paper describes a speech synthesis technique by concatenating unit phoneme. At that time, a major problem is that discontinuity is happened from connection part between unit phonemes, especially from connection part between unit phonemes recorded by different persons. To solve the problem, this paper uses clustered diphone, and proposes a spectral smoothing technique, not only using formant trajectory and distribution characteristic of spectrum but also reflecting human's acoustic characteristic. That is, the proposed technique performs unit phoneme clustering using distribution characteristic of spectrum at connection part between unit phonemes and decides a quantity and a scope for the smoothing by considering human's acoustic characteristic at the connection part of unit phonemes, and then performs the spectral smoothing using weights calculated along a time axes at the border of two diphones. The proposed technique removes the discontinuity and minimizes the distortion which can be occurred by spectrum smoothing. For the purpose of the performance evaluation, we test on five hundred diphones which are extracted from twenty sentences recorded by five persons, and show the experimental results.
https://doi.org/10.3745/KIPSTB.2003.10B.6.665 인용 PDF KSCI

A Study on the Improvement of Image Fusion Accuracy Using Smoothing Filter-based Replacement Method (SFR 기법을 이용한 영상 융합의 정확도 향상에 관한 연구)

Yun Kong-Hyun;Sohn Hong-Gyoo
- Proceedings of the Korean Society of Surveying, Geodesy, Photogrammetry, and Cartography Conference
- /
- 2006.04a
- /
- pp.187-192
- /
- 2006
Image fusion techniques are widely used to integrate a lower spatial resolution multispectral image with a higher spatial resolution panchromatic image. However, the existing techniques either cannot avoid distorting the image spectral properties or involve complicated and time-consuming decomposition and reconstruction processing in the case of wavelet transform-based fusion. In this study a simple spectral preserve fusion technique: the Smoothing Filter-based Replacement(SFR) is proposed based on a simplified solar radiation and land surface reflection model. By using a ratio between a higher resolution image and its low pass filtered (with a smoothing filter) image, spatial details can be injected to a co-registered lower resolution multispectral image minimizing its spectral properties and contrast. The technique can be applied to improve spatial resolution for either colour composites or individual bands. The fidelity to spectral property and the spatial quality of SFM are convincingly demonstrated by an image fusion experiment using IKONOS panchromatic and multispectral images. The visual evaluation and statistical analysis compared with other image fusion techniques confirmed that SFR is a better fusion technique for preserving spectral information.
PDF

The Development of Gamma Energy Identifying Algorithm for Compact Radiation Sensors Using Stepwise Refinement Technique

Yoo, Hyunjun;Kim, Yewon;Kim, Hyunduk;Yi, Yun;Cho, Gyuseong
- Journal of Radiation Protection and Research
- /
- v.42 no.2
- /
- pp.91-97
- /
- 2017
Background: A gamma energy identifying algorithm using spectral decomposition combined with smoothing method was suggested to confirm the existence of the artificial radio isotopes. The algorithm is composed by original pattern recognition method and smoothing method to enhance the performance to identify gamma energy of radiation sensors that have low energy resolution. Materials and Methods: The gamma energy identifying algorithm for the compact radiation sensor is a three-step of refinement process. Firstly, the magnitude set is calculated by the original spectral decomposition. Secondly, the magnitude of modeling error in the magnitude set is reduced by the smoothing method. Thirdly, the expected gamma energy is finally decided based on the enhanced magnitude set as a result of the spectral decomposition with the smoothing method. The algorithm was optimized for the designed radiation sensor composed of a CsI (Tl) scintillator and a silicon pin diode. Results and Discussion: The two performance parameters used to estimate the algorithm are the accuracy of expected gamma energy and the number of repeated calculations. The original gamma energy was accurately identified with the single energy of gamma radiation by adapting this modeling error reduction method. Also the average error decreased by half with the multi energies of gamma radiation in comparison to the original spectral decomposition. In addition, the number of repeated calculations also decreased by half even in low fluence conditions under $10^4$ ($/0.09cm^2$ of the scintillator surface). Conclusion: Through the development of this algorithm, we have confirmed the possibility of developing a product that can identify artificial radionuclides nearby using inexpensive radiation sensors that are easy to use by the public. Therefore, it can contribute to reduce the anxiety of the public exposure by determining the presence of artificial radionuclides in the vicinity.
https://doi.org/10.14407/jrpr.2017.42.2.91 인용 PDF KSCI

A Study on the Improvement of Image Fusion Accuracy Using Smoothing Filter-based Replacement Method (SFR기법을 이용한 영상 융합의 정확도 향상에 관한 연구)

Yun Kong-Hyun
- Spatial Information Research
- /
- v.14 no.1 s.36
- /
- pp.85-94
- /
- 2006
Image fusion techniques are widely used to integrate a lower spatial resolution multispectral image with a higher spatial resolution panchromatic image. However, the existing techniques either cannot avoid distorting the image spectral properties or involve complicated and time-consuming decomposition and reconstruction processing in the case of wavelet transform-based fusion. In this study a simple spectral preserve fusion technique: the Smoothing Filter-based Replacement(SFR) is proposed based on a simplified solar radiation and land surface reflection model. By using a ratio between a higher resolution image and its low pass filtered (with a smoothing filter) image, spatial details can be injected to a co-registered lower resolution multispectral image minimizing its spectral properties and contrast. The technique can be applied to improve spatial resolution for either colour composites or individual bands. The fidelity to spectral property and the spatial quality of SFM are convincingly demonstrated by an image fusion experiment using IKONOS panchromatic and multispectral images. The visual evaluation and statistical analysis compared with other image fusion techniques confirmed that SFR is a better fusion technique for preserving spectral information.
PDF

Smoothing Parameter Selection in Nonparametric Spectral Density Estimation

Kang, Kee-Hoon;Park, Byeong-U;Cho, Sin-Sup;Kim, Woo-Chul
- Communications for Statistical Applications and Methods
- /
- v.2 no.2
- /
- pp.231-242
- /
- 1995
In this paper we consider kernel type estimator of the spectral density at a point in the analysis of stationary time series data. The kernel entails choice of smoothing parameter called bandwidth. A data-based bandwidth choice is proposed, and it is obtained by solving an equation similar to Sheather(1986) which relates to the probability density estimation. A Monte Carlo study is done. It reveals that the spectral density estimates using the data-based bandwidths show comparatively good performance.
PDF

The Use of The Spectral Properties of Basis Splines in Problems of Signal Processing

Nasiritdinovich, Zaynidinov Hakim;Egamberdievich, MirzayevAvaz;Panjievich, Khalilov Sirojiddin
- Journal of Multimedia Information System
- /
- v.5 no.1
- /
- pp.63-66
- /
- 2018
In this work, the smoothing and the interpolation basis splines are analyzed. As well as the possibility of using the spectral properties of the basis splines for digital signal processing are shown. This takes into account the fact that basic splines represent finite, piecewise polynomial functions defined on compact media.
https://doi.org/10.9717/JMIS.2018.5.1.63 인용 PDF KSCI HTML

A Recognition Time Reduction Algorithm for Large-Vocabulary Speech Recognition (대용량 음성인식을 위한 인식기간 감축 알고리즘)

Koo, Jun-Mo;Un, Chong-Kwan;,
- The Journal of the Acoustical Society of Korea
- /
- v.10 no.3
- /
- pp.31-36
- /
- 1991
We propose an efficient pre-classification algorithm extracting candidate words to reduce the recognition time in a large-vocabulary recognition system and also propose the use of spectral and temporal smoothing of the observation probability to improve its classification performance. The proposed algorithm computes the coarse likelihood score for each word in a lexicon using the observation probabilities of speech spectra and duration information of recognition units. With the proposed approach we could reduce the computational amount by 74% with slight degradation of recognition accuracy in 1160-word recognition system based on the phoneme-level HMM. Also, we observed that the proposed coarse likelihood score computation algorithm is a good estimator of the likelihood score computed by the Viterbi algorithm.
PDF

Isolated Digit and Command Recognition in Car Environment (자동차 환경에서의 단독 숫자음 및 명령어 인식)

양태영;신원호;김지성;안동순;이충용;윤대희;차일환
- The Journal of the Acoustical Society of Korea
- /
- v.18 no.2
- /
- pp.11-17
- /
- 1999
This paper proposes an observation probability smoothing technique for the robustness of a discrete hidden Markov(DHMM) model based speech recognizer. Also, an appropriate noise robust processing in car environment is suggested from experimental results. The noisy speech is often mislabeled during the vector quantization process. To reduce the effects of such mislabelings, the proposed technique increases the observation probability of similar codewords. For the noise robust processing in car environment, the liftering on the distance measure of feature vectors, the high pass filtering, and the spectral subtraction methods are examined. Recognition experiments on the 14-isolated words consists of the Korean digits and command words were performed. The database was recorded in a stopping car and a running car environments. The recognition rates of the baseline recognizer were 97.4% in a stopping situation and 59.1% in a running situation. Using the proposed observation probability smoothing technique, the liftering, the high pass filtering, and the spectral subtraction the recognition rates were enhanced to 98.3% in a stopping situation and to 88.6% in a running situation.
PDF

Search Result 52, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)