Search | Korea Science

Audio Signal Coding Using Wavelet Transform (웨이블렛 변환을 이용한 오디오 코딩)

Bae, Seok-Mo;Kim, Do-Hyoung;Chung, Jae-Ho
- The Journal of the Acoustical Society of Korea
- /
- v.16 no.4
- /
- pp.64-70
- /
- 1997
This paper is aimed to propose a new wavelet audio signal coding scheme which reduces the complexity of well-known MPEG(Moving Picture Expert Group)-Audio. The filters of MPEG0audio apply subband technique on the 16-bits PCM audio to aquire bitstream of subband sample using dynamic bit allocation. If we use the wavelet coefficients instead of subband samples and 6 bands which is less than 32 bands of MPEG-audio, the complexity can be reduced. A new audio signal compression algorithm in this paper is based on wavelet transform and the proposed algorithm is compared with MPEG-audio. At the bitrate of 256kbps, the proposed algorithm maintains the CD(Compact-disc) quality. We were able to reduce the about 40% of complexity at encoder and about 70% at decoder.
PDF

Multimedia System for Streaming Time-Continuous Screen Images and Audio (시간 연속적인 스크린 이미지와 오디오의 스트리밍을 위한 멀티미디어 시스템)

Hwang, Ki-Tae
- The KIPS Transactions:PartB
- /
- v.9B no.2
- /
- pp.181-190
- /
- 2002
This paper proposes a motion-video multimedia system needed for computer applications like remote lecturing, distance learning, product demonstrations, and so on. The applications need a multimedia system which can author and play a motion-Video that is composed with computer screen images and audio continuously varing as time flows, not with real motion videos. Since the computer screen images are not like the real world video images in several rejects, MPEG is not competent as a compression algorithm for computer screen images raring continuously In this paper a new compression algorithm has been proposed, and a multimedia system that authors and plays a motion-video file which contains computer screen images and audio has been implemented. Also this paper shows the result of performance evaluation of both the compression algorithm and the multimedia system implemented in the paper.
https://doi.org/10.3745/KIPSTB.2002.9B.2.181 인용 PDF KSCI

A Digital Audio Watermark Using Wavelet Transform and Masking Effect (웨이브릿과 마스킹 효과를 이용한 디지털 오디오 워터마킹)

Hwang, Won-Young;Kang, Hwan-Il;Han, Seung-Soo;Kim, Kab-Il;Kang, Hwan-Soo
- Proceedings of the IEEK Conference
- /
- 2003.11b
- /
- pp.243-246
- /
- 2003
In this paper, we propose a new digital audio watermarking technique with the wavelet transform. The watermark is embedded by eliminating unnecessary information of audio signal based on human auditory system (HAS). This algorithm is an audio watermarking method, which does not require any original audio information in watermark extraction process. In this paper, the masking effect is used for audio watermarking, that is, post-tempera] masking effect. We construct the window with the synchronization signal and we extract the best frame in the window by using the zero-crossing rate (ZCR) and the energy of the audio signal. The watermark may be extracted by using the correlation of the watermark signal and the portion of the frame. Experimental results show good robustness against MPEG1-layer3 compression and other common signal processing manipulations. All the attacks are made after the D/A/D conversion.
PDF

A Perceptual Audio Coder Based on Temporal-Spectral Structure (시간-주파수 구조에 근거한 지각적 오디오 부호화기)

김기수;서호선;이준용;윤대희
- Journal of Broadcast Engineering
- /
- v.1 no.1
- /
- pp.67-73
- /
- 1996
In general, the high quality audio coding(HQAC) has the structure of the convertional data compression techniques combined with moodels of human perception. The primary auditory characteristic applied to HQAC is the masking effect in the spectral domain. Therefore spectral techniques such as the subband coding or the transform coding are widely used[1][2]. However no effort has yet been made to apply the temporal masking effect and temporal redundancy removing method in HQAC. The audio data compression method proposed in this paper eliminates statistical and perceptual redundancies in both temporal and spectral domain. Transformed audio signal is divided into packets, which consist of 6 frames. A packet contains 1536 samples($256{\times}6$) :nd redundancies in packet reside in both temporal and spectral domain. Both redundancies are elminated at the same time in each packet. The psychoacoustic model has been improved to give more delicate results by taking into account temporal masking as well as fine spectral masking. For quantization, each packet is divided into subblocks designed to have an analogy with the nonlinear critical bands and to reflect the temporal auditory characteristics. Consequently, high quality of reconstructed audio is conserved at low bit-rates.
PDF

An Efficient Representation Method for ICLD with Robustness to Spectral Distortion

Beack, Seung-Kwon;Seo, Jeong-Il;Kang, Kyung-Ok;Hanh, Min-Soo
- ETRI Journal
- /
- v.27 no.3
- /
- pp.330-333
- /
- 2005
The Inter-Channel Level Difference (ICLD) is a cue parameter to estimate spectral information in a binaural cue coding that has been recently in the spotlight as a multichannel audio signal compression technique. Even though the ICLD is an essential parameter, it is generally distorted by quantization. In this paper, a new modified ICLE representation method to minimize the quantization distortion is proposed by adopting a flexible determination of the reference channel and the unidirectional quantization. Our experimental result confirms that the proposed method improves the multichannel audio output quality even with the reduced bit-rate.
PDF

A public key audio watermarking using patchwork algorithm

Hong, Doo-Gun;Park, Se-Hyoung;Jaeho Shin
- Proceedings of the IEEK Conference
- /
- 2002.07a
- /
- pp.160-163
- /
- 2002
This paper presents a statistical technique for audio watermarking. We describe the application of the promising public key watermarking method to the patchwork algorithm. Its detection process does not need the original content nor the secret key used in the embedding process. Special attention is given to statistical method working in the frequency domain. We will present a solution of robust watermarking of audio data. In this scheme, an extension of patchwork audio watermarking is presented which enables public detection of the watermark. Experimental results show good robustness of the approach against MP3 compression and other common signal processing manipulations.
PDF

Implementation and evaluation of stereo audio codec using perceptual coding (지각 부호화를 이용한 스테레요 오디오 코덱의 구현 및 음질 평가)

차경환;장대영;홍진우;김천덕
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.4
- /
- pp.156-163
- /
- 1996
In this paper, we described the implementation and the sound quality assessment of a real-time stereo audio codec using TMS320C40 DSP (digital signal processing) chip for low bitrte and high quality audio. We implemented hardware and software in order to overcome a real-time processing problem of audio compression algorithm that can be produced by largely recursive computing and complexity of the process. We have studied five types of distortion that can be produced by perceptual coding and the codec was evaluated by eight test musics that are selected in SQAM (sound quality assessment material) 422-2-4-2 produced by EBU (european broadcast union). The subjective listening tests were carried out on the codec quality and preformance by double blind method in a listening room with eleven listeners. As a result, 5 grade-impairment scale was scored under minus one and the codec quality was evaluated to be perceptible, but not annoying.
PDF

Reversible Watermarking for Audio Using Recompression Method (재압축 기술을 이용한 오디오 파일에서의 가역 정보은닉)

Whang, Ho Young;Kim, Hyoung Joong
- Journal of Digital Contents Society
- /
- v.14 no.2
- /
- pp.199-206
- /
- 2013
Various methods of data compression have been developed to handle data within limited storage capacity and limited transmission speed. Recompression technology, a technology most recent among them, is a technology that can embed data regardless of the information entropy of a data. Recompression technology separates original multimedia data in to blocks and embeds 0 or 1 according to whether each block is flipped or not. In this paper, this technology has been applied on audio files. And was able to implement reversible watermarking for audio files.
https://doi.org/10.9728/dcs.2013.14.2.199 인용 PDF KSCI

A Complexity Reduction Method of MPEG-4 Audio Lossless Coding Encoder by Using the Joint Coding Based on Cross Correlation of Residual (여기신호의 상관관계 기반 joint coding을 이용한 MPEG-4 audio lossless coding 인코더 복잡도 감소 방법)

Cho, Choong-Sang;Kim, Je-Woo;Choi, Byeong-Ho
- Journal of the Institute of Electronics Engineers of Korea SP
- /
- v.47 no.3
- /
- pp.87-95
- /
- 2010
Portable multi-media products which can service the highest audio-quality by using lossless audio codec has been released and the international lossless codecs, MPEG-4 audio lossless coding(ALS) and MPEG-4 scalable lossless coding(SLS), were standardized by MPEG in 2006. The simple profile of MPEG-4 ALS, it supports up to stereo, was defined by MPEG in 2009. The lossless audio codec should have low-complexity in stereo to be widely used in portable multi-media products. But the previous researches of MPEG-4 ALS have focused on an improvement of compression ratio, a complexity reduction in multi-channels coding, and a selection of linear prediction coefficients(LPCs) order. In this paper, the complexity and compression ratio of MPEG-4 ALS encoder is analyzed in simple profile of MPEG-4 ALS, the method to reduce a complexity of MPEG-4 ALS encoder is proposed. Based on an analysis of complexity of MPEG-4 ALS encoder, the complexity of short-term prediction filter of MPEG-4 ALS encoder is reduced by using the low-complexity filter that is proposed in previous research to reduce the complexity of MPEG-4 ALS decoder. Also, we propose a joint coding decision method, it reduces the complexity and keeps the compression ratio of MPEG-4 ALS encoder. In proposed method, the operation of joint coding is decided based on the relation between cross-correlation of residual and compression ratio of joint coding. The performance of MPEG-4 ALS encoder that has the method and low-complexity filter is evaluated by using the MPEG-4 ALS conformance test file and normal music files. The complexity of MPEG-4 ALS encoder is reduced by about 24% by comparing with MPEG-4 ALS reference encoder, while the compression ratio by the proposed method is comparable to MPEG-4 ALS reference encoder.
PDF KSCI

A study on the Perceptual Model for MPEG II AAC Encoder (MPEG-II AAC Encoder의 perceptual Model에 관한 연구)

구대성;김정태;이강현
- Proceedings of the IEEK Conference
- /
- 2000.06c
- /
- pp.93-96
- /
- 2000
Currently, the most important technology is the compression methods in the multimedia society. Audio files are rapidly propagated through internet. MP-3 is offered to CD tone quality in 128Kbps, but 64Kbps below tone quality is abruptly down and high bitrate. on the other hand, MPEG-II AAC (Advanced Audio Coding) is not compatible with MPEG-I, but AAC has a high compression ratio 1.4 better than MP-3. Especially, AAC has max. 7.1 channel and 96KHz sampling rate. In this paper, the perceptual model is dealt with 44.1KHz sampling rate for SMR(Signal to Masking Ratio)
PDF

Search Result 135, Processing Time 0.025 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)