Search | Korea Science

Speaker Adaptation in VQ and HMM Based Speech Recognition (VQ와 HMM을 이용한 음성인식에서 화자적응에 관한 연구)

이대룡
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1991.06a
- /
- pp.54-57
- /
- 1991
본 논무에서는 HMM과 VQ를 이용한 고립단어에 대한 화자종속 및 화자독립 음성인식시스템을 만들고 여기에 화자적응을 하는 방법에 대한 연구를 했다. 화자적응방법에는 크게 VQ코드북을 적응시키는 방법과 HMM패러미터블 적응시키는 방법이 있다. 코드북적응을 하는 방법으로서 기존코드북에 대해 새로운화자의 적응음성을 양자화한 뒤 각 코드벡터에 해당하는 적응음성의 평균을 구해서 새로운 화자의 코드북을 구해주는 방법과 기준코드북에 대해 새로운화자의 적응음성을 양자화할 때 HMM의 각 상태에서 각각의 코드벡터를 발생할 확률을 거리오차의 계산에서 고려해 비록 거리오차는 크지만 그 코드벡터를 발생할 확률이 매우 높으면 적응음성이 그 코드벡터에 index되게해서 각 코드벡터에 해당하는 모든 적응음성데이타의 평균을 새로운 코드북으로 하는 두가지 알고리즘을 제안한다. 이렇게 함으로써 기존의 기준코드북을 초기 코드북으로해서 LBG알고리즘을 사용해서 적응음성데이타에 대한 새로운 코드북을 만드는 방법에 비해 5-10배의 계산시간을 감소하게 된다. 이 새로운 코드북으로 적응음성데이타를 다시 index해서 이 index된 음성렬로 HMM패러미터를 적응했다. 제안된 알고리즘이 코드북적응을 하는 경우에 기존의 적응방법에 비해 5-10배의 계산 시간을 단축하면서 인식률에서는 더 나은결과를 얻었다. 또 같은 적응방법에 대해서 화자종속모델 보다는 화자독립모델에 대해서 화자적응하는 것이 더 나은 인식결과를 보여주었다.
PDF

A Method of Depth Image Quantization and Bezier Curves Generation for Stereoscopic Image Authoring Tools (입체 영상 저작도구를 위한 깊이영상 양자화 및 베지어 곡선 생성 방법)

Ko, Min Soo;Cho, Choong Sang;Shin, Hwa Seon;Yoo, Jisang
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2014.11a
- /
- pp.240-241
- /
- 2014
3D 입체영상 변환 기술은 콘텐츠 확보의 측면에서 그 중요성이 대두되고 있다. 하지만 입체변환 기술은 매 프레임마다 모두 수작업을 거치기 때문에 다수의 인력과 오랜 작업 시간이 필요하여 생산성 문제가 발생하고 있다. 그 중 깊이영상의 외곽선을 벡터 곡선으로 그리는 작업이 수작업을 통해 이루어지고 있으며 오랜 작업시간이 걸리게 한다. 본 논문에서는 기존의 입체영상 변환 과정의 자동화율을 높이기 위한 깊이영상 양자화 및 베지어 곡선 생성 방법을 제안한다. 연속적인 깊이값을 갖는 깊이영상을 입력으로 받아 선형 또는 비선형 기반의 양자화 방법을 이용하여 깊이영상을 양자화 한다. 이 때 경계부분에 발생하는 페더를 제거하여 양자화 깊이영상의 경계를 보정한다. 양자화 깊이영상에서 같은 깊이를 잇는 등심선을 생성하고 방향 변화가 큰 지점인 굴곡점들을 추출하여 등심선을 다수의 곡선으로 구분한다. 각 곡선의 양 끝의 굴곡점과 그 사이의 중간점을 이용하여 3차 베지어 곡선의 제어 포인트를 계산한다. 같은 수행 단계를 모든 등심선에 적용하여 사용자가 미세보정하기 쉬운 3차 베지어 곡선들을 생성한다. 실험 결과를 통해 제안하는 기법의 우수성을 확인하였다.
PDF

A new Classified VQ Algorithm in the DCT domain (DCT영역에서의 분류화 방법을 이용한 벡터 양자화기)

임창훈;고종석;김재균
- Proceedings of the Korean Institute of Communication Sciences Conference
- /
- 1987.10a
- /
- pp.27-34
- /
- 1987
PDF

동영상의 차분 이미지 부호화를 위한 고속 벡터 양자화 알고리즘

최지웅;나성웅
- Proceedings of the Korean Institute of Communication Sciences Conference
- /
- 1998.10a
- /
- pp.467-470
- /
- 1998

벡터 양자화를 이용한 비디오 데이터 압축

오승준;변세영
- Proceedings of the Korean Institute of Communication Sciences Conference
- /
- 1995.06a
- /
- pp.373-376
- /
- 1995

Wavelet Packet-Based Progressive Image Transmission (Wavelet Packet 기반 점진적 영상 전송)

Song, Joon-Ho;Lee, Gi-Hun;Park, Rae-Hong
- Journal of the Korean Institute of Telematics and Electronics S
- /
- v.35S no.8
- /
- pp.77-85
- /
- 1998
This paper proposes progressive image transmission(PIT) methods based on the wavelet packet transform, in which quantizers are optimized at each stage for the given bit rate. Scalar and vector quantizers are used and the performance of each quantizer is compared. After quantization, selected subbands are ordered by their priority for transmission. Subjective quality of the reconsetructed image is improved by human visual system (HVS) weighting.
PDF

Fast Codebook Search for Vector Quantization in Image Coding (영상 부호화를 위한 벡터 양자화기에서의 고속 탐색 기법)

고종석;김재균
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.13 no.4
- /
- pp.302-308
- /
- 1988
The paper describes a very simple algorithm for reducing the encoding complexity of vector quantization(VQ), exploiting the feature of a vector currently being encoded. A proposed VQ of 16(=4x4) vector dimension shows a slight performance degradation of about 0.1-1.9dB, however, with only 16-32 among 256 codeword searches, i.e., with just 1/16-1/8 search complexity compared to a full-search VQ. And the proposed VQ scheme is also compared to outperform tree-search VQ with regard to their SNR performance and memory requirement.
PDF

Vector Quantization Compression of the Still Image by Multilayer Perceptron (다층 신경회로망 학습에 의한 정지 영상의 벡터)

Lee, Sang-Chan;Choe, Tae-Wan;Kim, Ji-Hong
- The Transactions of the Korea Information Processing Society
- /
- v.3 no.2
- /
- pp.390-398
- /
- 1996
In this paper, a new image compression algorithm using the generality of the multilaryer perceptron is proposed. Proposed algorithm classifies image into some classes, and trains them through the multilayer perceptron. Multilayer perceptron which trained by the above method can do compression and reconstruction of the nontrained image by the generality. Also, it reduces memory size of the side of receiver and quantization error. For the experiment, we divide Lena image into 16 classes and train them through one multilayer perceptron. The experimental results show that we can get excellent reconstruction images by doing compression and reconstruction for Lena image, Dollar image and Statue image.
PDF

On-line Vector Quantizer Design Using Stochastic Relaxation (Stochastic Relaxation 방법을 이용한 온라인 벡터 양자화기 설계)

Song, Geun-Bae;Lee, Haing-Sei
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.38 no.5
- /
- pp.27-36
- /
- 2001
This paper proposes new design algorithms based on stochastic relaxation (SR) for an on-line vector quantizer (VQ) design. These proposed SR methods solve the local entrapment problems of the conventional Kohonen learning algorithm (KLA). These SR methods cover two different types depending upon the use of simulated annealing (SA) : the one that uses SA is called the OLVQ SA and the other the OLVQ SR. These methods arc combined with the KLA and therefore preserve the its convergence properties. Experimental results for Gauss Markov sources, real speech and image demonstrate that the proposed algorithms can consistently provide better codebooks than the KLA.
PDF

3D Image Coding Using DCT and Hierarchical Segmentation Vector Quantization (DCT와 계층 분할 벡터 양자화를 이용한 3차원 영상 부호화)

Cho Seong Hwan;Kim Eung Sung
- Journal of Internet Computing and Services
- /
- v.6 no.2
- /
- pp.59-68
- /
- 2005
In this paper, for compression and transmission of 3D image, we propose an algorithm which executes 3D discrete cosine transform(DCT) for 3D images, hierarchically segments 3D blocks of an image in comparison with the original image and executes finite-state vector quantization(FSVQ) for each 3D block. Using 3D DCT coefficient feature, a 3D image is segmented hierarchically into large smooth blocks and small edge blocks, then the block hierarchy informations are transmitted. The codebooks are constructed for each hierarchical blocks respectively, the encoder transmits codeword index using FSVQ for reducing encoded bit with hierarchical segmentation information. The new algorithm suggested in this paper shows that the quality of Small Lobster and Head image increased by 1,91 dB and 1.47 dB respectively compared with those of HFSVQ.
PDF

Search Result 318, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)