• 제목/요약/키워드: linear spectral transformation

검색결과 28건 처리시간 0.028초

Maximum mutual information estimation을 이용한 linear spectral transformation 기반의 adaptation (Maximum mutual information estimation linear spectral transform based adaptation)

  • 유봉수;김동현;육동석
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 춘계 학술대회 발표논문집
    • /
    • pp.53-56
    • /
    • 2005
  • In this paper, we propose a transformation based robust adaptation technique that uses the maximum mutual information(MMI) estimation for the objective function and the linear spectral transformation(LST) for adaptation. LST is an adaptation method that deals with environmental noises in the linear spectral domain, so that a small number of parameters can be used for fast adaptation. The proposed technique is called MMI-LST, and evaluated on TIMIT and FFMTIMIT corpora to show that it is advantageous when only a small amount of adaptation speech is used.

  • PDF

A Closed-Form Solution of Linear Spectral Transformation for Robust Speech Recognition

  • Kim, Dong-Hyun;Yook, Dong-Suk
    • ETRI Journal
    • /
    • 제31권4호
    • /
    • pp.454-456
    • /
    • 2009
  • The maximum likelihood linear spectral transformation (ML-LST) using a numerical iteration method has been previously proposed for robust speech recognition. The numerical iteration method is not appropriate for real-time applications due to its computational complexity. In order to reduce the computational cost, the objective function of the ML-LST is approximated and a closed-form solution is proposed in this paper. It is shown experimentally that the proposed closed-form solution for the ML-LST can provide rapid speaker and environment adaptation for robust speech recognition.

GMM based Nonlinear Transformation Methods for Voice Conversion

  • Vu, Hoang-Gia;Bae, Jae-Hyun;Oh, Yung-Hwan
    • 대한음성학회:학술대회논문집
    • /
    • 대한음성학회 2005년도 추계 학술대회 발표논문집
    • /
    • pp.67-70
    • /
    • 2005
  • Voice conversion (VC) is a technique for modifying the speech signal of a source speaker so that it sounds as if it is spoken by a target speaker. Most previous VC approaches used a linear transformation function based on GMM to convert the source spectral envelope to the target spectral envelope. In this paper, we propose several nonlinear GMM-based transformation functions in an attempt to deal with the over-smoothing effect of linear transformation. In order to obtain high-quality modifications of speech signals our VC system is implemented using the Harmonic plus Noise Model (HNM)analysis/synthesis framework. Experimental results are reported on the English corpus, MOCHA-TlMlT.

  • PDF

기하학적 기법을 이용한 하이퍼스펙트럴 영상의 Linear Spectral Mixing모델에 관한 연구 (A Study on Linear Spectral Mixing Model for Hyperspectral Imagery with Geometric Method)

  • 장은석;김대성;김용일
    • 한국GIS학회:학술대회논문집
    • /
    • 한국GIS학회 2003년도 추계학술대회논문집
    • /
    • pp.23-29
    • /
    • 2003
  • Detection in remotely sensed images can be conducted spatially, spectrally or both [2]. If the images have high spatial resolution, materials can be detected by using spatial and spectral information, unless we can't see the object embedded in a pixel. In this paper, we intend to solve the limit of spatial resolution by using the hyperspectral image which has high spectral resolution. Therefore, the Linear Spectral Mixing(LSM) Model which is sub-pixel detection algorithm is used to solve this problem. To find class Endmembers, we applied Geometric Model with MNF(Minimum Noise Fraction) transformation. From the result of sub-pixel detection algorithm, we can see the detection of water is satisfied and the object shape cannot be extracted but the possibility of material existence can be identified.

  • PDF

Spectral Feature Transformation for Compensation of Microphone Mismatches

  • Jeong, So-Young;Oh, Sang-Hoon;Lee, Soo-Young
    • The Journal of the Acoustical Society of Korea
    • /
    • 제22권4E호
    • /
    • pp.150-154
    • /
    • 2003
  • The distortion effects of microphones have been analyzed and compensated at mel-frequency feature domain. Unlike popular bias removal algorithms a linear transformation of mel-frequency spectrum is incorporated. Although a diagonal matrix transformation is sufficient for medium-quality microphones, a full-matrix transform is required for low-quality microphones with severe nonlinearity. Proposed compensation algorithms are tested with HTIMIT database, which resulted in about 5 percents improvements in recognition rate over conventional CMS algorithm.

3차원 공간에서 바닥의 움직임에 의한 규칙파의 생성을 모의할 수 있는 선형 스펙트럼법 (Linear Spectral Method for Simulating the Generation of Regular Waves by a Moving Bottom in a 3-dimensional Space)

  • 정재상;이창훈
    • 한국해안·해양공학회논문집
    • /
    • 제36권2호
    • /
    • pp.70-79
    • /
    • 2024
  • 본 연구에서는 3차원 공간에서 바닥의 움직임에 따른 선형파의 생성을 모의할 수 있는 스펙트럼 법을 소개한다. 지배방정식은 선형의 동역학적 및 운동학적 자유수면 경계조건이며, 두 식은 Fourier 공간에서 해석된다. 해석된 속도포텐셜 및 자유수면변위는 연속방정식과 운동학적 바닥경계조건을 항상 만족해야 한다. 수치해석에서 시간 적분은 4차 Runge-Kutta 법을 이용하여 해석하였다. Fourier 공간에서 해석한 결과는 Fourier 역변환을 통해 실제 공간에서의 속도포텐셜과 자유수면변위로 표현된다. 본 수치모델을 이용하여 다양한 형상의 바닥이 규칙적으로 움직이는 경우 생성되는 규칙파에 대해 모의하였다. 또한 바닥의 움직임을 이용하여 비스듬히 전파하는 규칙파의 생성도 모의하였다. 수치모델의 결과는 해석해와 비교하였으며, 거의 일치하는 결과를 보였다.

ON CLENSHAW-CURTIS SPECTRAL COLLOCATION METHOD FOR VOLTERRA INTEGRAL EQUATIONS

  • CHAOLAN, HUANG;CHUNHUA, FANG;JIANYU, WANG;ZHENGSU, WAN
    • Journal of applied mathematics & informatics
    • /
    • 제40권5_6호
    • /
    • pp.983-993
    • /
    • 2022
  • The main purpose of this paper is to solve the second kind Volterra integral equations by Clenshaw-Curtis spectral collocation method. First of all, we can transform the integral interval from [-1, x] to [-1, 1] through a simple linear transformation, and discretize the integral term in the equation by Clenshaw-Curtis quadrature formula to obtain the collocation equations. Then we provide a rigorous error analysis for the proposed method. At last, several numerical example are used to verify the results of theoretical analysis.

Partial Spectrum Detection and Super-Gaussian Window Function for Ultrahigh-resolution Spectral-domain Optical Coherence Tomography with a Linear-k Spectrometer

  • Hyun-Ji, Lee;Sang-Won, Lee
    • Current Optics and Photonics
    • /
    • 제7권1호
    • /
    • pp.73-82
    • /
    • 2023
  • In this study, we demonstrate ultrahigh-resolution spectral-domain optical coherence tomography with a 200-kHz line rate using a superluminescent diode with a -3-dB bandwidth of 100 nm at 849 nm. To increase the line rate, a subset of the total number of camera pixels is used. In addition, a partial-spectrum detection method is used to obtain OCT images within an imaging depth of 2.1 mm while maintaining ultrahigh axial resolution. The partially detected spectrum has a flat-topped intensity profile, and side lobes occur after fast Fourier transformation. Consequently, we propose and apply the super-Gaussian window function as a new window function, to reduce the side lobes and obtain a result that is close to that of the axial-resolution condition with no window function applied. Upon application of the super-Gaussian window function, the result is close to the ultrahigh axial resolution of 4.2 ㎛ in air, corresponding to 3.1 ㎛ in tissue (n = 1.35).

롬바드 효과의 보정을 위한 스펙트럼 크기의 정규화와 켑스트럼 변환 (Normalization of Spectral Magnitude and Cepstral Transformation for Compensation of Lombard Effect)

  • 지상문;오영환
    • 한국음향학회지
    • /
    • 제15권4호
    • /
    • pp.83-92
    • /
    • 1996
  • 본 연구에서는 음성인식기의 성능이 잡음환경하에서 급격히 저하되는 것을 완화하기 위해, 성능저하의 원인인 롬바드효과의 보정과 잡음의 제거방법을 제안하였다. 롬바드 효과는 조용한 환경에서 발성된 음성에 비해, 스펙트럼 포락과 발성음의 세기를 변이 시키는 것으로 모델링하였고, 변이의 제거를 위해 스펙트럼 크기의 정규화와 켑스트럼 변환을 사용하였다. 주변 잡음의 첨가에 의한 음성신호의 왜곡은 스펙트럼 차감법을 사용하여 완화하였고, 음성의 동적인 특성을 강조하기 위해 대역통과 필터링을 하였다. 잡음환경에서 발성된 롬바드 음성의 분석 및 잡음처리 기술의 개발과 평가를 위해, 음성인식 기술의 적용이 예상되는 자동차, 전시장, 시내 공중전화 부스, 거리, 전산실 잡음을 이용하여 롬바드 음성을 수집하여 실험하였다. 제안한 방법을 여러 가지 잡음환경하에서 음성인식에 적용한 결과, 효과적인 잡음처리 방법임을 확인할 수 있었다.

  • PDF

NUMERICAL SIMULATION OF TWO-DIMENSIONAL FREE-SURFACE FLOW AND WAVE TRANSFORMATION OVER CONSTANT-SLOPE BOTTOM TOPOGRAPHY

  • DIMAKOPOULOS AGGELOS S;DIMAS ATHANASSIOS A
    • 한국수자원학회:학술대회논문집
    • /
    • 한국수자원학회 2005년도 학술발표회(2)
    • /
    • pp.842-845
    • /
    • 2005
  • A method for the numerical simulation of two-dimensional free-surface flow resulting from the propagation of regular gravity waves over topography with arbitrary bottom shape is presented. The method is based on the numerical solution of the Euler equations subject to the fully nonlinear free-surface boundary conditions and the appropriate bottom, inflow and outflow conditions using a hybrid finite-differences and spectral-method scheme. The formulation includes a boundary-fitted transformation, and is suitable for extension to incorporate large-eddy simulation (LES) and large-wave simulation (LWS) terms for turbulence and breaking wave modeling, respectively. Results are presented for the simulation of the free-surface flow over two different bottom topographies, with constant slope values of 1:10 and 1:20, two different inflow wave lengths and two different inflow wave heights. An absorption outflow zone is utilized and the results indicate minimum wave reflection from the outflow boundary. Over the bottom slope, lengths of waves in the linear regime are modified according to linear theory dispersion, while wave heights remain more or less unchanged. For waves in the nonlinear regime, wave lengths are becoming shorter, while the free surface elevation deviates from its initial sinusoidal shape.

  • PDF