Search | Korea Science

Tonality Detection based on Spectrum Energy in Perceptual Audio Coder (지각 오디오 부호화기에서의 스펙트럼 에너지 기반 톤 성분 검출 알고리듬)

이근섭;연규철;박영철;윤대희
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.29 no.6C
- /
- pp.770-776
- /
- 2004
The goal of perceptual audio coder is to reduce redundancy and irrelevancy of audio signal based on the concept of masking. Several studies on masking effect reveal that the masking threshold varies as a function of the noise-like or tone-like nature of audio signals. Therefore, tonality of audio signal influences significantly the quality and efficiency of perceptual audio coder In this paper, we propose a new effective algorithm for tonality measure using spectrum energy. Since the proposed algorithm consists of a few transcendental functions and simple operations, it has lower complexity than MPEG psychoacoustic model-II. The proposed algorithm was tested with some audio signals, and DSP implementation showed that the proposed algorithm could be implemented with 3 MIPS. These results illustrate the efficiency of proposed algorithm in both performance and complexity.
PDF KSCI

Sinusoidal Modeling of Audio Signals Using Perceptually Weighted Matching Pursuit (지각적으로 가중된 매칭 퍼슈잇을 이용한 오디오 신호의 정현파 모델링)

김연지;이인성
- The Journal of the Acoustical Society of Korea
- /
- v.22 no.2
- /
- pp.96-103
- /
- 2003
This paper describes a method for sinusoidal modeling of audio signals using perceptually weighted matching pursuit. Matching pursuits extracts iteratively the greatest energy signals from the input signals until the residual between the original and the reconstructed signal is zero. In this paper, perceptual matching pursuits using psychoacoustic model to matching pursuit extracts greatest perceived energy iteratively. To evaluate the performance of the perceptual matching pursuits it is compared with the sinusoidal matching pursuits which is not included perceptual weighting. For various audio signals the result of simulation shows that the perceptual matching pursuit is superior to the sinusoidal matching pursuits, especially for a high change rate in time domain it can synthesized original signal.
PDF KSCI

A Reversible Audio Watermarking Scheme

Kim, Hyoung-Joong;Sachnev, Vasiliy;Kim, Ki-Seob
- Journal of The Institute of Information and Telecommunication Facilities Engineering
- /
- v.5 no.1
- /
- pp.37-42
- /
- 2006
A reversible audio watermarking algorithm is presented in this paper. This algorithm transforms the audio signal with the integer wavelet transform first in order to enhance the correlation between neighbor audio samples. Audio signal has low correlation between neighbor samples, which makes it difficult to apply difference expansion scheme. Second, a novel difference expansion scheme is used to embed more data by reducing the size of location map. Therefore, the difference expansion scheme used in this paper theoretically secures high embedding capacity under low perceptual distortion. Experiments show that this scheme can hide large number of information bits and keeps high perceptual quality.
PDF

Implementation and evaluation of stereo audio codec using perceptual coding (지각 부호화를 이용한 스테레요 오디오 코덱의 구현 및 음질 평가)

차경환;장대영;홍진우;김천덕
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.33B no.4
- /
- pp.156-163
- /
- 1996
In this paper, we described the implementation and the sound quality assessment of a real-time stereo audio codec using TMS320C40 DSP (digital signal processing) chip for low bitrte and high quality audio. We implemented hardware and software in order to overcome a real-time processing problem of audio compression algorithm that can be produced by largely recursive computing and complexity of the process. We have studied five types of distortion that can be produced by perceptual coding and the codec was evaluated by eight test musics that are selected in SQAM (sound quality assessment material) 422-2-4-2 produced by EBU (european broadcast union). The subjective listening tests were carried out on the codec quality and preformance by double blind method in a listening room with eleven listeners. As a result, 5 grade-impairment scale was scored under minus one and the codec quality was evaluated to be perceptible, but not annoying.
PDF

A Study on the Audio watermarking for High Quality Digital Audio (고음질 오디오를 위한 디지털 오디오 워터마킹에 관한 연구)

김정태;구대성;이강현
- Proceedings of the IEEK Conference
- /
- 2000.06c
- /
- pp.125-128
- /
- 2000
In this paper, we proposed the high quality digital audio watermarking algorithm in the frequency domain. The spread spectrum technique is used to encrypted a stream of information by spreading the data as much of the frequency spectrum as possible. It's technique adapt well to data hiding in audio signal. We have used the perceptual model and MDCT/IMDCT for the high qualify digital audio watermarking. The proposed watermark algorithm preserved high quality audio data from watermark signal.
PDF

Temporal-perceptual Judgement of Visuo-Auditory Stimulation (시청각 자극의 시간적 인지 판단)

Yu, Mi;Lee, Sang-Min;Piao, Yong-Jun;Kwon, Tae-Kyu;Kim, Nam-Gyun
- Journal of the Korean Society for Precision Engineering
- /
- v.24 no.1 s.190
- /
- pp.101-109
- /
- 2007
In situations of spatio-temporal perception about visuo-auditory stimulus, researches propose optimal integration hypothesis that perceptual process is optimized to the interaction of the senses for the precision of perception. So, when the visual information considered generally dominant over any other sense is ambiguous, the information of the other sense like auditory stimulus influences the perceptual process in interaction with visual information. Thus, we performed two different experiments to certain the conditions of the interacting senses and influence of the condition. We consider the interaction of the visuo-auditory stimulation in the free space, the color of visual stimulus and sex difference of testee with normal people. In first experiment, 12 participants were asked to judge the change in the frequency of audio-visual stimulation using a visual flicker and auditory flutter stimulation in the free space. When auditory temporal cues were presented, the change in the frequency of the visual stimulation was associated with a perceived change in the frequency of the auditory stimulation as the results of the previous studies using headphone. In second experiment, 30 male and 30 female were asked to judge the change in the frequency of audio-visual stimulation using a color of visual flicker and auditory flutter stimulation. In the color condition using red and green. Both male and female testees showed same perceptual tendency. male and female testees showed same perceptual tendency however, in case of female, the standard deviation is larger than that of male. This results implies that audio-visual asymmetry effects are influenced by the cues of visual and auditory information, such as the orientation between auditory and visual stimulus, the color of visual stimulus.
PDF KSCI

A study on the Perceptual Model for MPEG II AAC Encoder (MPEG-II AAC Encoder의 perceptual Model에 관한 연구)

구대성;김정태;이강현
- Proceedings of the IEEK Conference
- /
- 2000.06c
- /
- pp.93-96
- /
- 2000
Currently, the most important technology is the compression methods in the multimedia society. Audio files are rapidly propagated through internet. MP-3 is offered to CD tone quality in 128Kbps, but 64Kbps below tone quality is abruptly down and high bitrate. on the other hand, MPEG-II AAC (Advanced Audio Coding) is not compatible with MPEG-I, but AAC has a high compression ratio 1.4 better than MP-3. Especially, AAC has max. 7.1 channel and 96KHz sampling rate. In this paper, the perceptual model is dealt with 44.1KHz sampling rate for SMR(Signal to Masking Ratio)
PDF

Enhanced Adjustment Strategy of Masking Threshold for Speech Signals in Low Bit-Rate Audio Coding (저전송률 오디오 부호화에서 음성 신호의 성능 개선을 위한 마스킹 임계값 적응기법 향상)

Lee, Chang-Heon;Kang, Hong-Goo
- The Journal of the Acoustical Society of Korea
- /
- v.29 no.1
- /
- pp.62-68
- /
- 2010
This paper proposes a new masking threshold adjustment strategy to improve the performance for speech signals in low bit-rate audio coding. After determining formant regions, the masking threshold is adjusted by using the energy ratio of each sub-band to the average energy of each formant. More quantization noises are added to the bands that have relatively large energy, but less distortion is allowed in spectral valley regions by allocating more bits, which reflects the concept of perceptual weighting widely used in speech coding. From the results of objective speech quality measure, we verified that the proposed method improves quality for the speech input signals compared to the conventional one.
https://doi.org/10.7776/ASK.2010.29.1.062 인용 PDF KSCI

Signal Quality Enhancement using Perceptual Convolutional Noise Suppression (지각형 컨벌루션 잡음 제어를 통한 음질 개선 방법)

김헌중;한헌수;홍민철;차형태
- Journal of Broadcast Engineering
- /
- v.8 no.1
- /
- pp.11-18
- /
- 2003
In this paper, we introduce a novel signal quality enhancement algorithm with a perceptual interference analysis and perceptual convolutional noise suppression. A perceptual convolutional noise is reflected in the audible disturbance that can still be recognized after the additional noise suppression and tonality change which is caused by the noise energy excitation. The enhancement system is organized with a perceptual additional noise suppression part and a perceptual convolutional noise suppression part. Experimental results show that these two parts have an equivalent quality enhancement performance.
PDF KSCI

Digital Audio Watermarking Scheme Using Perceptual Modeling (지각 모델링을 이용한 디지털 오디오 워터마킹 방법)

석종원;홍진우
- Journal of Broadcast Engineering
- /
- v.6 no.2
- /
- pp.195-202
- /
- 2001
As a solution for copyright protection of digital multimedia contents, digital watermark technology is now drawing the attention. In this paper, we presented two novel audio watermarking algorithms as a solution for protecting unauthorized copy of digital audio. Proposed watermarking schemes include the psychoacoustic model of MPEG audio coding to achieve the perceptual transparency after watermark embedding and preprocessing procedure before correlation in watermark detection to extract copyright information without access to the original audio signal. Experimental results show that our watermarking scheme is robust to common signal Processing attacks and it Introduces no audible distortion after watermark insertion.
PDF

Search Result 74, Processing Time 0.019 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)