Search | Korea Science

Non-linear Predictive Method using Simplified Morphological Polynomial Transform and Morphological Interpolation (간략화된 형상학적 다항식 변환과 형상학적 보간을 이용한 배설형 예측 방법)

김수현;한헌수;홍민철;차형태
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2002.11a
- /
- pp.81-84
- /
- 2002
본 논문에서는 간략화 된 형상학적 다항식 변환(Morphological Polynomial Transform)과 형상학적 보간법(Morphological Interpolation)을 이용하는 비선형 예측 방법을 제안한다. 형상학적 다항식 변환은 형상학적 연산을 통해 데이터를 구조함수들의 계수들로 표현하는 변환이며, 형상학적 보간법은 형상학적 다항식 변환에 의한 계수들을 이용하여 보간하는 방법이다. 형상학적 다항식 변환을 간략화 하여 정수 연산만으로 적용할 수 있도록 개선하였으며, 보다 영상에 적합한 형상학적 보간법에 기반 한 예측 방법을 사용한다. 제안하는 예측 방법과 허프만 부호화를 사용하여 적은 비트로 영상을 손실 없이 저장할 수 있음을 실험으로 검증한다.
PDF

Method of a Multi-mode Low Rate Speech Coder Using a Transient Coding at the Rate of 2.4 kbit/s (전이구간 부호화를 이용한 2.4 kbit/s 다중모드 음성 부호화 방법)

Ahn Yeong-uk;Kim Jong-hak;Lee Insung;Kwon Oh-ju;Bae Mun-Kwan
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.42 no.2 s.302
- /
- pp.131-142
- /
- 2005
The low rate speech coders under 4 kbit/s are based on sinusoidal transform coding (STC) or multiband excitation (MBE). Since the harmonic coders are not efficient to reconstruct the transient segments of speech signals such as onsets, offsets, non-periodic signals, etc, the coders do not provide a natural speech quality. This paper proposes method of a efficient transient model :d a multi-mode low rate coder at 2.4 kbit/s that uses harmonic model for the voiced speech, stochastic model for the unvoiced speech and a model using aperiodic pulse location tracking (APPT) for the transient segments, respectively. The APPT utilizes the harmonic model. The proposed method uses different models depending on the characteristics of LPC residual signals. In addition, it can combine synthesized excitation in CELP coding at time domain with that in harmonic coding at frequency domain efficiently. The proposed coder shows a better speech quality than 2.4 kbit/s version of the mixed excitation linear prediction (MELP) coder that is a U.S. Federal Standard for speech coder.
PDF KSCI

Improvement of the Linear Predictive Coding with Windowed Autocorrelation (윈도우가 적용된 자기상관에 의한 선형예측부호의 개선)

Lee, Chang-Young;Lee, Chai-Bong
- The Journal of the Korea institute of electronic communication sciences
- /
- v.6 no.2
- /
- pp.186-192
- /
- 2011
In this paper, we propose a new procedure for improvement of the linear predictive coding. To reduce the error power incurred by the coding, we interchanged the order of the two procedures of windowing on the signal and linear prediction. This scheme corresponds to LPC extraction with windowed autocorrelation. The proposed method requires more calculational time because it necessitates matrix inversion on more parameters than the conventional technique where an efficient Levinson-Durbin recursive procedure is applicable with smaller parameters. Experimental test over various speech phonemes showed, however, that our procedure yields about 5 % less power distortion compared to the conventional technique. Consequently, the proposed method in this paper is thought to be preferable to the conventional technique as far as the fidelity is concerned. In a separate study of speaker-dependent speech recognition test for 50 isolated words pronounced by 40 people, our approach yielded better performance too.
https://doi.org/10.13067/JKIECS.2011.6.2.186 인용 PDF KSCI

Quality Improvement of Low Bitrate HE-AAC using Linear Prediction Pre-processor (저 전송률 환경에서 선형예측 전처리기를 사용한 HE-AAC의 성능 향상)

Lee, Jae-Seong;Lee, Gun-Woo;Park, Young-Chul;Youn, Dae-Hee
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.34 no.8C
- /
- pp.822-829
- /
- 2009
This paper proposes a new method of improving the quality of High Efficiency Advanced Audio Coding (HE-AAC). HE-AAC encodes input source by allocating bits for each scalefactor bands appropriately according to human ear's psychoacoustic property. As a result, insufficient bits are assigned to the bands which have relatively low energy. This imbalance between different energy bands can cause decreasing of sound quality like musical noise. In the proposed system, a Linear Prediction (LP) module is combined with HE-AAC as a pre-processor to improve sound quality by even bits distribution. To apply accurate human being's psychoacoustic property, the psychoacoustic model uses Fast Fourier Transform (FFT) spectrum of original input signal to make masking threshold. In its implementation, masking threshold of psychoacoustic model is normalized using the LP spectral envelope in prior to quantization of the LP residual. Experimental result shows that, the proposed algorithm allocates bits appropriately for insufficient bits condition and improves the performance of HE-AAC.
PDF KSCI

A Study on the Pulse-Train Code Excited Linear Prediction Coder: PT-CELP (Pulse-Train code 여기 선형 예측 (PT-CELP) 부호화기에 관한 연구)

김흥국
- Proceedings of the Acoustical Society of Korea Conference
- /
- 1995.06a
- /
- pp.246-249
- /
- 1995
4.16kbps의 전송률을 갖는 음성 부호화기 구조에 관하여 기술한다. 제안된 음성 부호화기는 개방 회로 피치 검출기와 이로부터 생성된 pulse train을 코드북으로 갖는 CELP 부호화기이다. Pulse-Train codebook은 분석 프레임별로 부호화 및 복호화 양단에서 생성되며 음성의 피치 및 포만트 정보를 내포하고 있다. 구현된 PT-CELP는 random codebook 방식의 CELP에 비해 적은 크기로 codebook을 만들 수 있으며 음성의 특징을 충분히 반영하므로 합성된 음성의 음질을 향상시킬 수 있다.
PDF

시간특성을 고려한 음성신호의 발성율 검출에 관한 연구

김익성;서지호;배명진
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.109-111
- /
- 2004
발성율은 일정한 시간동안 발성되는 음성신호 내에 몇 개의 음절이 포함되어 있는 지를 나타낸다. 발성율은 화자마다 다르고 각 음소들의 특징에 따라 변화할 수 있다. 발성율의 사전 측정이 이루어 진다면 음성부호화 측면에서도 중용한 정보로 사용될 수 있다. 기존의 음성부호화기는 발성율에 관계없이 고정적인 분석 구간을 정하여 전송률을 결정하고 있다. 따라서, 발성율을 미리 측정한다면, 발성율이 느린 부분과 빠른 부분에 각기 다른 부호화 방법을 적용하여 음질을 향상할 수도 있고 전송률을 가변적으로 적용할 수 도 있게 된다. 정확한 발성율을 측정하기 위해서는 음절의 변화를 추정하여야 한다. 음절의 변화를 추정하기 위한 방법으로 음성신호의 에너지 포락선 측정법과 LSP를 이용한 측정법이 각각 제안된 바 있으나, 본 논문에서는 위 두 가지 방법을 혼합한 방법을 사용하였다. 에너지 변동은 음성신호의 시간영역 처리방법으로 LSP 파라미터는 음성신호의 선형예측 분석에 의해 구해질 수 있다.
PDF

Estimation of Weight Coefficients of Residual DPCM based on L1 Regularization in HEVC Format Range Extension (HEVC 확장 표준 내 Residual DPCM 을 위한 L₁ 정규화 기반의 가중 계수 추정 기법)

Ryu, Su-Kyung;Kang, Je-Won
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2016.06a
- /
- pp.373-374
- /
- 2016
Residual Differnetial pulse-code Modulation (RDPCM) 기법은 비디오의 압축을 위한 시간 및 공간 예측 후 남은 잔여신호를 인접 화소를 이용하여 추가적인 중복정보를 제거하는 기법을 의미한다. 본 논문에서는 우선 잔차 신호의 예측을 위하여 인접 화소 사이 선형 가중 합으로 예측 모델을 세우고, 각 가중치를 $L_1$ 정규화를 포함하는 비용함수를 통해 추정함으로써 보다 효율적인 부호화 성능을 제공하는 알고리즘을 제안한다.
PDF

A Very Low-Bit-Rate Analysis-by-Synthesis Speech Coder Using Zinc Function Excitation (Zinc 함수 여기신호를 이용한 분석-합성 구조의 초 저속 음성 부호화기)

Seo Sang-Won;Kim Jong-Hak;Lee Chang-Hwan;Jeong Gyu-Hyeok;Lee In-Sung
- The Journal of the Acoustical Society of Korea
- /
- v.25 no.6
- /
- pp.282-290
- /
- 2006
This paper proposes a new Digital Reverberator that models Analog Helical Coil Spring Reverberator for guitar amplifiers. While the conventional digital reverberators are proposed to provide better sound field mainly based on room acoustics, no algorithm or analysis of digital reverberators those model Helical Coil Spring Reverberator was proposed. Considering the fact that approximately $70{\sim}80$ percent of guitar amplifiers are still with Helical Coil Spring Reverberator, research was performed based not on Room Acoustics but on Helical Coil Spring Reverberator itself as an effector. After performing simulations with proposed algorithm, it was confirmed that the Digital Reverberator by proposed algorithm provides perceptually equivalent response to the conventional Analog Helical Coil Spring Reverberators.
https://doi.org/10.7776/ASK.2006.25.6.282 인용 PDF KSCI

The First Quantization Parameter Decision Algorithm for the H.264/AVC Encoder (H.264/AVC를 위한 초기 Quantization Parameter 결정 알고리즘)

Kwon, Soon-Young;Lee, Sang-Heon;Lee, Dong-Ha
- Journal of KIISE:Information Networking
- /
- v.35 no.3
- /
- pp.235-242
- /
- 2008
To improve video quality and coding efficiency, H.264/AVC adopted an adaptive rate control. But this method has a problem as it cannot predict an accurate quantization parameter(QP) for the first frame. The first QP is decided among four constant values by using encoder input parameters. It does not consider encoding bits, results in significant fluctuation of the image quality and decreases the average quality of the whole coded sequence. In this paper, we propose a new algorithm for the first frame QP decision in the H.264/AVC encoder. The QP is decided by the existing algorithm and the first frame is encoded. According to the encoded bits, the new initial QP is decided. We can predict optimal value because there is a linear relationship between encoded bits and the new initial QP. Next, we re-encode the first frame using the new initial QP. Experimental results show that the proposed algorithm not only achieves better quality than the state of the art algorithm, but also adopts a rate control forthe sequence that was impossible with the existing algorithm. By reducing fluctuation, subjective quality also improved.
PDF KSCI

Least Squares Based Adaptive Motion Vector Prediction Algorithm for Video Coding (동영상 압축 방식을 위한 최소 자승 기반 적응 움직임 벡터 예측 알고리즘)

Kim, Ji-hee;Jeong, Jong-woo;Hong, Min-Cheol
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.9C
- /
- pp.1330-1336
- /
- 2004
This paper addresses an adaptive motion vector prediction algorithm to improve the performance of video encoder. The block-based motion vector is characterized by non-stationary local statistics so that the coefficients of LS (Least Squares) based linear motion can be optimized. However, it requires very expensive computational cost. The proposed algorithm using LS approach with spatially varying motion-directed property adaptively controls the coefficients of the motion predictor and reduces the computational cost as well as the motion prediction error. Experimental results show the capability of the proposed algorithm.
PDF KSCI

Search Result 52, Processing Time 0.021 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)