통합 검색 | Korea Science

A New Pruning Method for Synthesis Database Reduction Using Weighted Vector Quantization

Kim, Sanghun;Lee, Youngjik;Keikichi Hirose
- The Journal of the Acoustical Society of Korea
- /
- 제20권4E호
- /
- pp.31-38
- /
- 2001
A large-scale synthesis database for a unit selection based synthesis method usually retains redundant synthesis unit instances, which are useless to the synthetic speech quality. In this paper, to eliminate those instances from the synthesis database, we proposed a new pruning method called weighted vector quantization (WVQ). The WVQ reflects relative importance of each synthesis unit instance when clustering the similar instances using vector quantization (VQ) technique. The proposed method was compared with two conventional pruning methods through the objective and subjective evaluations of the synthetic speech quality: one to simply limit maximum number of instance, and the other based on normal VQ-based clustering. The proposed method showed the best performance under 50% reduction rates. Over 50% of reduction rates, the synthetic speech quality is not seriously but perceptibly degraded. Using the proposed method, the synthesis database can be efficiently reduced without serious degradation of the synthetic speech quality.
PDF

한국어에 의한 EVRC LSP 코드북 설계 (Design of EVRC LSP Codebooks with Korean)

이진걸
- 한국음향학회지
- /
- 제21권2호
- /
- pp.167-172
- /
- 2002
음성부호화 알고리즘인 EVRC (Enhanced Variable Rate Codec)는 현재 북미 및 한국 CDMA 디지털 셀룰러 시스템에 사용되고 있다. EVRC음성부호화기에서 음성의 주파수영역에서의 에너지 분포와 관련되어 있는 LSP (Line Spectral Pairs)값은 가중분할 벡터 양자화 (Weighted Split Vector Quantization)에 의해 코딩된다. 이러한 코딩 과정에 사용되는 LSP 코드북이 개발국 언어 혹은 영어로 설계되었음을 감안하면 한국어통화에 대해서는 한국어로 설계된 LS 코드북에 의해 향상된 성능을 기대할 수 있다. 본 논문에서는 한국어로 BVRC의 LSP 코드북을 LBG알고리즘을 기반으로 한 벡터 양자화기법으로 설계하였으며 이 코드북에 의한 벡터양자화 성능향상 및 그에 따른 음질향상을 각각 SD (Spectral Distortion) 및 신호대 잡음비 (SNR), SegSNR측정으로 입증하였다.
PDF KSCI

Nearest-Neighbors Based Weighted Method for the BOVW Applied to Image Classification

Xu, Mengxi;Sun, Quansen;Lu, Yingshu;Shen, Chenming
- Journal of Electrical Engineering and Technology
- /
- 제10권4호
- /
- pp.1877-1885
- /
- 2015
This paper presents a new Nearest-Neighbors based weighted representation for images and weighted K-Nearest-Neighbors (WKNN) classifier to improve the precision of image classification using the Bag of Visual Words (BOVW) based models. Scale-invariant feature transform (SIFT) features are firstly extracted from images. Then, the K-means++ algorithm is adopted in place of the conventional K-means algorithm to generate a more effective visual dictionary. Furthermore, the histogram of visual words becomes more expressive by utilizing the proposed weighted vector quantization (WVQ). Finally, WKNN classifier is applied to enhance the properties of the classification task between images in which similar levels of background noise are present. Average precision and absolute change degree are calculated to assess the classification performance and the stability of K-means++ algorithm, respectively. Experimental results on three diverse datasets: Caltech-101, Caltech-256 and PASCAL VOC 2011 show that the proposed WVQ method and WKNN method further improve the performance of classification.
https://doi.org/10.5370/JEET.2015.10.4.1877 인용 PDF KSCI KPUBS HTML

다중레벨 벡터양자화 기반의 유사도를 이용한 자동 음악요약 (Automatic Music Summarization Using Similarity Measure Based on Multi-Level Vector Quantization)

김성탁;김상호;김회린
- The Journal of the Acoustical Society of Korea
- /
- 제26권2E호
- /
- pp.39-43
- /
- 2007
Music summarization refers to a technique which automatically extracts the most important and representative segments in music content. In this paper, we propose and evaluate a technique which provides the repeated part in music content as music summary. For extracting a repeated segment in music content, the proposed algorithm uses the weighted sum of similarity measures based on multi-level vector quantization for fixed-length summary or optimal-length summary. For similarity measures, count-based similarity measure and distance-based similarity measure are proposed. The number of the same codeword and the Mahalanobis distance of features which have same codeword at the same position in segments are used for count-based and distance-based similarity measure, respectively. Fixed-length music summary is evaluated by measuring the overlapping ratio between hand-made repeated parts and automatically generated ones. Optimal-length music summary is evaluated by calculating how much automatically generated music summary includes repeated parts of the music content. From experiments we observed that optimal-length summary could capture the repeated parts in music content more effectively in terms of summary length than fixed-length summary.
PDF KSCI

2차원 불변 영상 인식을 위한 퍼지 분류기와 바이스펙트럼 (Fuzzy Classifier and Bispectrum for Invariant 2-D Shape Recognition)

한수환;우영운
- 한국멀티미디어학회논문지
- /
- 제3권3호
- /
- pp.241-252
- /
- 2000
이 논문에서는 2차원 영상의 외곽선 정보를 이용하여 추출한 바이스펙트럼과 가중치 퍼지 분류기를 이용하여 영상의 이동, 회전, 크기 변화에 무관한 패턴 인식 기법을 제안하고, 그 인식 결과를 LVQ(Learning Vector Quantization)를 이용한 신경망 분류기와 비교하였다. 3차 큐물런트를 근간으로하는 바이 스펙트럼은 각 영상의 외각선 정보에 적용되어 15개의 특징값들을 추출한다. 이 특징 벡터들은 영상의 이동, 회전, 크기 변화에 무관한 특징을 가지며 2차원 평면 영상의 대표값으로 사용되어 패턴 분류를 위해 가중치 퍼지 분류기의 입력으로 들어간다. 서로 다른 8가지 비행기들의 평면 영상을 이용하여 실험한 결과들은 제안된 인식 시스템의 성능이 상대적으로 우수함을 보였다.
PDF

Fuzzy Mean Method with Bispectral Features for Robust 2D Shape Classification

Woo, Young-Woon;Han, Soo-Whan
- 한국지능정보시스템학회:학술대회논문집
- /
- 한국지능정보시스템학회 1999년도 추계학술대회-지능형 정보기술과 미래조직 Information Technology and Future Organization
- /
- pp.313-320
- /
- 1999
In this paper, a translation, rotation and scale invariant system for the classification of closed 2D images using the bispectrum of a contour sequence and the weighted fuzzy mean method is derived and compared with the classification process using one of the competitive neural algorithm, called a LVQ(Learning Vector Quantization). The bispectrun based on third order cumulants is applied to the contour sequences of the images to extract fifteen feature vectors for each planar image. These bispectral feature vectors, which are invariant to shape translation, rotation and scale transformation, can be used to represent two-dimensional planar images and are fed into an classifier using weighted fuzzy mean method. The experimental processes with eight different shapes of aircraft images are presented to illustrate the high performance of the proposed classifier.
PDF

VQ를 이용한 영상의 객체 특징 추출과 이를 이용한 내용 기반 영상 검색 (Representative Feature Extraction of Objects using VQ and Its Application to Content-based Image Retrieval)

장동식;정세환;유헌우;손용준
- 한국정보과학회논문지:컴퓨팅의 실제 및 레터
- /
- 제7권6호
- /
- pp.724-732
- /
- 2001
내용 기반 영상 검색을 위해 본 연구에서는 VQ(Vector Quantization)을 이용하여 영상을 구성하는 주요 객체들의 특징 추출 방법을 제안한다. 내용 기반 영상 검색 시스템에서 사용되는 영상의 주요특징으로는 색상, 절감, 형태 및 영상을 구성하고 있는 객체들의 공간적 위치 등이 있다. 이 중 본 논문에서는 일반적인 색상 및 질감 특징 추출방법과 더불어 VQ 멕터 클러스터링 알고리즘을 이용하여 정지영상을 구성하고 있는 객체들의 대표 색상과 질감 특징을 빠르게 추출하고 이를 내용 기반 검색에 이용함으로써 정지영상의 내용에 근거한 검색을 하였고 객체 단위 검색을 함으로써 객체의 위치, 회전 및 크기 변화에 무관한 검색을 가능케 했다. 연구의 실험 결과 VQ를 이용함으로써 대표특징치 추출시간을 줄일수 있었고 검색시 색상과 질감 특징의 가중치를 각각 0.5, 0.5로 주는 것이 가장 높은 검출율을 보였으며, ‘사람’영상에 제한한 방법을 적용한 경우 90%의 검출율을 보였다.
PDF

영상 부호화를 위한 새로운 사이드 매치 왜곡 함수를 이용한 적응 유한상태 벡터 양자화 기법 (An Adaptive Finite State Vector Quantization Method Using a New Side Match Distortion Function for Image Coding)

이상운;이두수;임인칠
- 전자공학회논문지S
- /
- 제35S권10호
- /
- pp.118-125
- /
- 1998
본 논문에서는 새로운 사이드 매치 왜곡 함수를 이용한 적응 FSVQ(Finite State Vector Quantization)방법을 제안한다. 기존의 사이드 매치 왜곡 함수는 블럭 경계 사이의 휘도 천이를 부드럽게 해주고 블럭간의 상관 관계가 높은 평탄 영역에서는 적절한 상태 부호책을 작성할 수 있으나 블럭간의 상관 관계가 낮은 윤곽선 영역에서는 적절한 상태 부호책을 작성할 수 없다. 본 논문에서 제안한 왜곡 함수는 기존의 사이드 매치 왜곡 함수에 영상의 특징을 나타내줄 수 있는 분산 값을 가중치로 사용하여 기존의 사이드 매치 왜곡 함수보다 적절한 상태 부호책을 작성할 수 있도록 한다. 또한 상태를 잘못 예측하였을 경우 이를 정정한 후, 처리하게 함으로써 만족할만한 영상의 질을 얻을 수 있도록 한다.
PDF

Unit Generation Based on Phrase Break Strength and Pruning for Corpus-Based Text-to-Speech

Kim, Sang-Hun;Lee, Young-Jik;Hirose, Keikichi
- ETRI Journal
- /
- 제23권4호
- /
- pp.168-176
- /
- 2001
This paper discusses two important issues of corpus-based synthesis: synthesis unit generation based on phrase break strength information and pruning redundant synthesis unit instances. First, the new sentence set for recording was designed to make an efficient synthesis database, reflecting the characteristics of the Korean language. To obtain prosodic context sensitive units, we graded major prosodic phrases into 5 distinctive levels according to pause length and then discriminated intra-word triphones using the levels. Using the synthesis unit with phrase break strength information, synthetic speech was generated and evaluated subjectively. Second, a new pruning method based on weighted vector quantization (WVQ) was proposed to eliminate redundant synthesis unit instances from the synthesis database. WVQ takes the relative importance of each instance into account when clustering similar instances using vector quantization (VQ) technique. The proposed method was compared with two conventional pruning methods through objective and subjective evaluations of synthetic speech quality: one to simply limit the maximum number of instances, and the other based on normal VQ-based clustering. For the same reduction rate of instance number, the proposed method showed the best performance. The synthetic speech with reduction rate 45% had almost no perceptible degradation as compared to the synthetic speech without instance reduction.
PDF

검색결과 9건 처리시간 0.029초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)