통합 검색 | Korea Science

효율적인 하모닉-CELP 구조를 갖는 저 전송률 음성 부호화기 (Efficient Harmonic-CELP Based Low Bit Rate Speech Coder)

최용수;김경민;윤대희
- 한국음향학회지
- /
- 제20권5호
- /
- pp.35-47
- /
- 2001
본 논문에서는 하모닉 부호화기와 CELP(Code Excited Linear Prediction) 부호화기의 장점을 고려한 효율적인 저 전송률 하모닉-CELP 음성 부호화기를 제안한다. 제안된 하모닉-CELP 부호화기에서는 프레임 단위 유/무성음 판별에 따라 무성음 구간에서는 고속 CELP방식으로 부호화하고 유성음 구간에서는 개선된 하모닉 부호화를 수행한다. 제안된 부호화기는 무성음 부호화를 위한 RP-VSELP(Regular Pulse Vector Sum Excited Linear Prediction), 유성음 부호화를 위한 간단한 정수 피치 검색, 정수 단위 피치에서의 고속 하모닉 추정, 가변 차원 하모닉 벡터 양자화, 주파수 해상도를 반영한 인지 가중치, 고속 하모닉 합성, 대역별 유성음 정도에 따른 자연성 제어, 다중 모드 등을 주요한 특징으로 하며, 이러한 특징들로 인해 기존의 HVXC(Harmonic Vector eXeited Coder) 부호화기에 비해서 매우 낮은 복잡도를 갖는다. 주관적인 음질 평가 결과, 제안된 2.4 kbps 하모닉-CELP 부호화기는 낮은 지연과 적은 계산량으로 양호한 음질을 얻을 수 있음을 확인하였다.
PDF

Complexity Reduction Algorithm of Speech Coder(EVRC) for CDMA Digital Cellular System

Min, So-Yeon
- 한국멀티미디어학회논문지
- /
- 제10권12호
- /
- pp.1551-1558
- /
- 2007
The standard of evaluating function of speech coder for mobile telecommunication can be shown in channel capacity, noise immunity, encryption, complexity and encoding delay largely. This study is an algorithm to reduce complexity applying to CDMA(Code Division Multiple Access) mobile telecommunication system, which has a benefit of keeping the existing advantage of telecommunication quality and low transmission rate. This paper has an objective to reduce the computing complexity by controlling the frequency band nonuniform during the changing process of LSP(Line Spectrum Pairs) parameters from LPC(Line Predictive Coding) coefficients used for EVRC(Enhanced Variable-Rate Coder, IS-127) speech coders. Its experimental result showed that when comparing the speech coder applied by the proposed algorithm with the existing EVRC speech coder, it's decreased by 45% at average. Also, the values of LSP parameters, Synthetic speech signal and Spectrogram test result were obtained same as the existing method.
PDF

음성 및 오디오 부호화기를 위한 저지연 윈도우 스위칭 modified discrete cosine transform (Low delay window switching modified discrete cosine transform for speech and audio coder)

김영준;이인성
- 한국음향학회지
- /
- 제37권2호
- /
- pp.110-117
- /
- 2018
본 논문에서는 음성/오디오 부호화기를 위한 저지연 윈도우 스위칭 MDCT(Modified Discrete Cosine Transform) 방법을 제안한다. 윈도우 스위칭 알고리즘을 사용하여 신호의 특성이 빨리 변하는 전이 구간에서 음질 저하를 개선하고, 저지연 TDAC(Time Domain Aliasing Cancellation)를 사용하여 알고리즘 지연을 1/2로 줄일 수 있는 MDCT 방법을 제안한다. 제안된 윈도우 스위칭 방법은 기존 윈도우 스위칭 알고리즘이 다른 길이의 중첩합(overlap-add)을 사용하는 것과 달리, 일정한 길이의 중첩합을 사용하여 알고리즘 지연을 1/2로 줄일 수 있었고, 신호의 특성에 따라 윈도우의 종류를 2개로 줄여 프레임 상태를 표현하는 정보 비트를 1 bit 감소시킬 수 있었다. 제안한 알고리즘을 MDCT 기반의 음성/오디오 부호화기인 ITU-T(International Telecommunication Union - Telecommunication) G.729.1 부호화기에 적용하여 성능을 확인하였으며, 알고리즘 지연을 절반으로 감소시키면서 동일한 음질을 유지할 수 있었다.
https://doi.org/10.7776/ASK.2018.37.2.110 인용 PDF KSCI

견실, 저지연 멀티트리 9.6Kbits/s 음성부호기에 관한 연구 (Robust, Low Delay Multi-tree Speech Coding at 9.6Kbits/sec)

우홍체;문병현;이채욱
- 한국통신학회논문지
- /
- 제18권3호
- /
- pp.348-354
- /
- 1993
본 논문에서는 음성의 short-term 계수 추출에 대한 새로운 방식을 제안하였으며, 데이타량 9.6Kbits/sec의 멀티 트리 부호기를 실현하였다. 이 트리 부호기는 총 지연시간 2.5msec을 (6.4KHz 샘플링 주파수에서 16샘플) 가지며, 좋은 출력 음질을 가지며, bit 오욜 (BER) $10^{-3}$에서도 견실한 상태를 유지한다. 이 견실성은 short-term 계수 추출을 위해 수신된 여기 신호를 smoothing 하여, 병렬 구성과 함께 사용하므로 가능 하였다. 이 부호기의 출력 음성은 SNR, SNRSEG, 그리고 듣기 시험으로 평가 되었다.
PDF

가변 지연 MDCT/IMDCT를 이용한 오디오/음성 코덱 (Audio /Speech Codec Using Variable Delay MDCT/IMDCT)

이상길;이인성
- 한국정보전자통신기술학회논문지
- /
- 제16권2호
- /
- pp.69-76
- /
- 2023
MDCT/IMDCT 과정을 사용하는 고품질 오디오/음성 코덱은 이전 프레임 과의 중첩-합(Overlap-add) 과정을 통해 현재 프레임을 완벽 복원 가능하다. 중첩-합 과정에서 프레임 길이 만큼의 알고리즘 지연이 발생하게 된다. 본 논문에서는 알고리즘 지연을 줄이기 위해 MDCT/IMDCT에 가변적인 위상변이를 사용하여 알고리즘 지연을 줄인 MDCT/IMDCT 과정을 제안한다. 가변 지연 MDCT/IMDCT알고리즘을 ITU-T 표준 코덱 G.729.1 코덱에 적용하여 저지연 오디오/음성 코덱을 제안하였다. MDCT/IMDCT 과정에서의 알고리즘 지연은 기존 20 ms에서 1.25ms 까지 감소시킬 수 있다. 저지연 MDCT/IMDCT를 적용한 오디오/음성 코덱의 복호화된 출력신호는 객관적 음질 시험 방법인 PESQ 시험을 통해 성능 평가하였다. 전송 지연이 감소 됨에도 불구하고 기존 방법과 음질 차이가 없음을 확인할 수 있었다.
https://doi.org/10.17661/jkiiect.2023.16.2.69 인용 PDF HTML

프레임 분류와 합성필터의 변형을 이용한 적은 지연을 갖는 음성 부호화기의 성능 (Improving LD-CELP using frame classification and modified synthesis filter)

임은희;이주호;김형명
- 한국통신학회논문지
- /
- 제21권6호
- /
- pp.1430-1437
- /
- 1996
중간 주파수 대역(8kbps) 이하에서 적은 지연을 갖는 벡터여기 선형예측 음성 부호화기(LD-CELP)에 대하여 고려한다. 합성필터를 입력 프레임의 종류에 따라 변화시켜 음성 부호화기의 성능을 향상시키고자 한다. 먼저 프레임을 유성음과 무성음 그리고 개시 프레임으로 분류한다. 유성음과 무성음 프레임에서는 합성필터의 스펙트럼 포락을 음운의 특성에 적합하도록 변화시킨다. 개시 프레임에서는 합성필터의 성격을 바꾸어주기 위하여 바이어스 필터를 이용한다. 제안된 부호화기는 다른 적은 지연을 갖는 벡터여기 선형예측 음성 부호화기들에 비하여 비슷한 지연시간을 갖으면서 더 나은 음질을 제공하였다.
PDF

저비트율 잉여오디오 정보를 이용한 손실 패킷 복구 방법의 구현 및 성능 평가 (Implementation and evaluation of lost packet recovery using low-bitrate redundant audio data)

박준석;고대식
- 전자공학회논문지S
- /
- 제35S권7호
- /
- pp.1-5
- /
- 1998
In this paper, recovery method with high-bitrate and low-bitrate coder was implemented in order to recover consecutive packet loss over the Internet. LPC was used as redundant audio data for recover of lost packets and RTP parcket format was modified for accommodation of redundant data. In measuring results using random packet loss rate with three redundant datra in every packet, it has shown that recovery rate was 80% in los rate of 50%. Since the processing delay for recovery of the lost packet was 200ms, this recovery method can be applied to real-time Internet sevice such as Internet phone.
PDF

AMR 기반 저 전력 인공 대역 확장 기술 개발 (Developing a Low Power BWE Technique Based on the AMR Coder)

구본강;박희완;주연재;강상원
- 한국음향학회지
- /
- 제30권4호
- /
- pp.190-196
- /
- 2011
대역폭 확장 (Bandwidth Extension)은 300-3400 Hz 대역의 협대역 음성 신호를 50-7000 Hz 대역의 광대역 음성신호로 확장하여 협대역 음성신호의 음질과 명료도를 높이는 기술이다. 본 논문에서는 협대역 음성 정보만을 이용해서 광대역 음성신호를 추정하는 인공 대역폭 확장 기술을 설계하여, ITU-T 협대역 표준 음성 코덱인 AMR (adaptive multi-rate) 복호화기에 내장시킴 (embedded)으로써, 대역폭 확장 모듈에서의 LPC 분석 및 LSP 해석과 관련된 계산량을 감소시켰고, 알고리즘 지연도 줄였다. 그리고 SDS (single distance search) 고속 탐색 방식을 대역폭 확장 시스템의 코드북 매핑에 적용하여, 최종적으로 저 전력 대역 확장 AMR 복호화기를 설계하였다. 제안된 대역폭 확장 방법은 AMR 복호화기 후단에 독립적으로 설치되는 기존 DTE (decode then extend)방식에 비해 28 % 정도의 계산량을 줄이고 알고리즘 지연도 20 msec 줄였다. 또한 제안방식은 피치정보를 이용한 classified 코드북 매핑 방식을 사용하여 스펙트럼 포락선을 확장하였고, 코드 벡터 탐색 시 가중치를 적용하여 광대역 합성 음성의 성능을 향상시켰다.
https://doi.org/10.7776/ASK.2011.30.4.190 인용 PDF KSCI

Inter-layer Texture and Syntax Prediction for Scalable Video Coding

Lim, Woong;Choi, Hyomin;Nam, Junghak;Sim, Donggyu
- IEIE Transactions on Smart Processing and Computing
- /
- 제4권6호
- /
- pp.422-433
- /
- 2015
In this paper, we demonstrate inter-layer prediction tools for scalable video coders. The proposed scalable coder is designed to support not only spatial, quality and temporal scalabilities, but also view scalability. In addition, we propose quad-tree inter-layer prediction tools to improve coding efficiency at enhancement layers. The proposed inter-layer prediction tools generate texture prediction signal with exploiting texture, syntaxes, and residual information from a reference layer. Furthermore, the tools can be used with inter and intra prediction blocks within a large coding unit. The proposed framework guarantees the rate distortion performance for a base layer because it does not have any compulsion such as constraint intra prediction. According to experiments, the framework supports the spatial scalable functionality with about 18.6%, 18.5% and 25.2% overhead bits against to the single layer coding. The proposed inter-layer prediction tool in multi-loop decoding design framework enables to achieve coding gains of 14.0%, 5.1%, and 12.1% in BD-Bitrate at the enhancement layer, compared to a single layer HEVC for all-intra, low-delay, and random access cases, respectively. For the single-loop decoding design, the proposed quad-tree inter-layer prediction can achieve 14.0%, 3.7%, and 9.8% bit saving.
https://doi.org/10.5573/IEIESPC.2015.4.6.422 인용 PDF KSCI

검색결과 9건 처리시간 0.023초

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)