• 제목/요약/키워드: Frequency Masking

검색결과 102건 처리시간 0.034초

음성 명료도 향상을 위한 학습 기반의 신호 대 잡음 비 추정을 이용한 이산 마스크 추정 방법 (Binary Mask Estimation using Training-based SNR Estimation for Improving Speech Intelligibility)

  • 김기백
    • 방송공학회논문지
    • /
    • 제17권6호
    • /
    • pp.1061-1068
    • /
    • 2012
  • 본 논문에서는 시간-주파수 영역에서의 이산 마스킹을 이용하여 잡음환경 음성의 음성 명료도를 높이는 방법에 대해 다루고자 한다. 잡음이 섞여 있는 음성신호를 시간-주파수 영역으로 분해하여, 상대적으로 잡음이 많이 섞여 있는 시간-주파수 영역의 신호를 마스크 "0"을 할당하여 제거함으로써 음성명료도를 향상시킬 수 있다. 이러한 이산 마스크를 추정하기 위해서는 각 시간-주파수 영역에서 신호 대 잡음 비를 추정하여 문턱값과 비교해야 하는데, 본 논문에서는 학습 기반의 신호 대 잡음 비 추정방법을 사용하여 문턱값과 비교하여 이산 마스크를 추정한다. 신호 대 잡음 비와 비교하기 위한 문턱값은 모든 주파수 대역에 대해 동일한 값을 이용하는 고정 문턱값 외에도 주파수 대역에 따라 학습 데이터의 분포로부터 최적의 값을 사용하는 최적 문턱값을 제안한다. 제안된 이산 마스크 추정 방법은 잡음 환경 데이터에 적용한 후, 피험자에게 들려주어 음성 명료도를 측정한다.

선 스펙트럼 주파수의 청각 적응 부호화 (Perceptual and Adaptive Quantization of Line Spectral Frequency Parameters)

  • 한우진;김은경;오영환
    • 한국음향학회지
    • /
    • 제19권8호
    • /
    • pp.68-77
    • /
    • 2000
  • 선 스펙트럼 주파수를 양자화하기 위한 대부분의 방법들이 가중 유클리드 거리에 기반하고 있는 반면, 본 논문에서는 청각 마스킹 효과에 기반한 에러 척도를 사용하여 선 스펙트럼 주파수를 효과적으로 양자화하는 방법을 제안하였다. 제안한 방법에서는 noise-to-mask ratio (NMR)를 선 스펙트럼 주파수의 양자화에 적합하도록 변형한 새로운 에러 척도를 유도하고, 이를 사용하여 선 스펙트럼 주파수를 양자화한다. 한편, 본 논문에서는 양자화하고자 하는 음성 프레임이 갖는 청각적인 특성을 고려하여 동적으로 비트를 할당하는 적응 양자화 알고리즘을 제안하였다. 성능 평가를 위해서 11948 프레임의 테스트 자료를 기존의 방법과 제안한 방법으로 각자 양자화하고 perceptually transparent frame의 비운 및 이때의 평균 비트율을 비교한 결과, 기존의 방법이 1800 bps의 비트율에서 89.9%의 perceptually transparent frame을 얻은 데 비해, 제안한 방법은 770 bps의 평균 비트율에서 95.5%의 perceptually transparent frame을 얻음으로써 제안한 방법이 효과적임을 보였다.

  • PDF

해양목장 대상 어류의 음향순치에 관한 기초적 연구 -1. 돌돔의 청각 특성- (A Basic Study on Acoustic Conditioning of Fish Suitable for a Marine Ranch -1. The Sound Sensitivity of Japanese Parrot Fish Oplegnathus fasciatus-)

  • 김성호;이창헌;서두옥;김용주
    • 한국수산과학회지
    • /
    • 제35권6호
    • /
    • pp.563-567
    • /
    • 2002
  • Developing base data on luring fish schools into netting position by the use of underwater audible sound on japanese parrot fish Oplegnathus fasciatus found in the coastal waters of Jeju Island, S. Korea. Auditory threshold was determined by the heartbeat condition technique using pure tones coupled with a delayed electric shock. The audible range of japanese parrot fish extended from 80 Hz to 500 Hz with a peak sensitivity at 200 Hz. The mean auditory thresholds at the frequencies of 80 Hz, 100 Hz, 200 Hz,300 Hz and 500 Hz were 104 dB, 95 dB, 91 dB, 99 dB and 113 dB, respectively. As the frequency became higher than 200 Hz, the auditory threshold increased almost linearly with increasing frequency. Critical ratios of fishes measured in the presence of masking noise in the spectrum level range of 69$\~$78 dB (0 dB re 1$\mu$Pa/$\sqrt{Hz}$) ranged from 21 dB to 40 dB at test frequencies. The noise spectrum level at the start of masking was about 70 dB within the test frequency range. The sound pressure level of 100$\~$200 Hz recognized by japanese parrot fish under the ambient noise is above 91 dB and the critical ratio for them is above 21 dB.

제주 연안에서 어획된 붕장어의 청각 능력 (Hearing Ability of Conger eel Conger myriaster caught in the Coast of jeju Island)

  • 안장영;박용석;최찬문;김석종;이창헌
    • 수산해양기술연구
    • /
    • 제48권4호
    • /
    • pp.479-486
    • /
    • 2012
  • In order to obtain the fundamental data about the behavior of conger by underwater audible sound, this experiment was carried out to investigate the hearing ability of Conger eel Conger myriaster which was in the coast of Jeju Island by heartbeat conditioning method using pure tones coupled with a delayed electric shock. The audible range of conger eel extended from 50Hz to 300Hz with a peak sensitivity at 80Hz including less sensitivity over 200Hz. The mean auditory thresholds of conger eel at the frequencies of 50Hz, 80Hz, 100Hz, 200Hz and 300Hz were 105dB, 92dB, 96dB, 128dB and 140dB, respectively. The positive response of conger eel was not evident after the sound projection of over 200Hz. At the results, the sensitive frequency range of conger eel is narrow in spite of swim bladder. Auditory masking was determined for Conger eel by using masking stimuli with the spectrum level range of about 60~70dB (0dB re $1{\mu}Pa/\sqrt{Hz}$). According to white noise level, the auditory thresholds increased as compared with thresholds in a quiet background noise including critical ratio at 68dB of white noise from minimum 26dB to maximum 30dB at test frequencies of 80Hz and 100Hz. The noise spectrum level at the start of masking was distributed at the range of about 68dB within 80~100Hz.

제주 연안에서 양식된 참조기의 청각 능력 (Hearing Ability of Redlip croaker Pseudosciaena polyactis cultured in the Coastal Sea of Jeju)

  • 안장영;김석종;최찬문;박용석;이창헌
    • 수산해양교육연구
    • /
    • 제28권2호
    • /
    • pp.384-390
    • /
    • 2016
  • The purpose of this paper is to improve the availability of underwater sound by the fundamental data on the hearing ability of Redlip croaker Pseudosciaena polyactis, which is cultured according to the cultivation technology, recently. The auditory thresholds of Redlip croaker were determined at 6 frequencies from 80Hz to 800Hz by heartbeat conditioning method using pure tones coupled with a delayed electric shock. The audible range of the Redlip croaker extended from 80Hz to 800Hz with the best sensitive frequency range including little difference in hearing ability from 80Hz to 500Hz. In addition, the auditory thresholds over 800Hz increased rapidly. The mean auditory thresholds of the Redlip croaker at the test frequencies from 80Hz to 800Hz were 90.7dB, 93.4dB, 92.9dB, 94.4dB, 95.5dB and 108dB, respectively. Auditory masking for the redlip croaker was measured using masking stimuli with the spectrum level range of about 66, 71, 75dB (0dB re $1{\mu}Pa/{\sqrt{Hz}}$). According to white noise level, the auditory thresholds increased as compared with thresholds in a quiet background noise. The Auditory masking by the white noise spectrum level was stared over about 70dB within 80~500Hz. Critical ratio ranged from minimum 20.7dB to maximum 25.5dB at test frequencies of 80Hz~500Hz.

웨이브릿 영상 압축을 위한 인간 시각 가중 양자화기의 설계 (A design of visual weighted quantizer for wavelet image compression)

  • 엄일규;김재호
    • 한국통신학회논문지
    • /
    • 제22권3호
    • /
    • pp.493-505
    • /
    • 1997
  • In this paper, a wavelet image compression method using human visually estimated quantizer is proposed. The quantizer has three components. These are constructed by using effects of frequency band, background luminance, and spatial masking. The first quantization factor is a fixed constant value for each band. The second factor is calculated by averaging four wavelet coefficients in the lowest frequency band. The third factor is determined by the difference between wavelet coefficients in the lowest frequency band. Arithmetic coding is used for encoding quantized wavelet coefficients. Coefficients in the lowest band are transmitted without loss. Therefore the compressed image is decompressed by using three quantization factors which can be calculated in the receiver. Compared with previous image compression methods which adopted human visual system, the proposed method shows improved results with less computational cost.

  • PDF

경계선 특성을 고려한 관측자 선호 선예도 개선 방법 (Observer Preferable Sharpness Enhancement Considering Distributions of Edge Characteristics)

  • 홍상기;정재영;김대희;조맹섭
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2002년도 하계종합학술대회 논문집(4)
    • /
    • pp.275-278
    • /
    • 2002
  • Sharpness enhancement, which strengthen the edge(high frequency) of image, is widely studied for image processing research area. In this paper, psychophysical experiment is conducted by the 20 observers with simple linear unsharp masking for sharpness enhancement. The experimental results extracted using z-score analysis and linear regression suggests observer preferable sharpness enhancement method for digital television.

  • PDF

구조적 왜곡특성 측정을 이용한 블록기반 DCT 영상 부호화기의 객관적 화질평가 (Objective Image Quality Metric for Block-Based DCT Image Coder Using Structural Distortion Measurement)

  • 정태윤
    • 대한전기학회논문지:시스템및제어부문D
    • /
    • 제52권7호
    • /
    • pp.434-441
    • /
    • 2003
  • This paper proposes a new quantitative and objective image quality metric which is essential to verify the performance of block-based DCT image coding. The proposed metric considers not only global distortion of coded image such as spatial frequency sensitivity and channel masking using HVS based multi-channel model, but also structural distortions caused block-based coding. The experimental results show a strong correlation between proposed metric and subjective metric.

구조적 왜곡특성 측정을 이용한 블록기반 DCT 영상 부호화기의 객관적 화질평가 (Objective Image Quality Metric for Block-Based DCT Image Coder-using Structural Distortion Measurement)

  • 정태윤
    • 대한전기학회논문지:전기물성ㆍ응용부문C
    • /
    • 제52권7호
    • /
    • pp.434-434
    • /
    • 2003
  • This paper proposes a new quantitative and objective image quality metric which is essential to verify the performance of block-based DCT image coding The proposed metric considers not only global distortion of coded image such as spatial frequency sensitivity and channel masking using HVS based multi-channel model, but also structural distortions caused block-based coding. The experimental results show a strong correlation between propose(B metric and subjective metric.

Robust video watermarking algorithm for H.264/AVC based on JND model

  • Zhang, Weiwei;Li, Xin;Zhang, Yuzhao;Zhang, Ru;Zheng, Lixin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제11권5호
    • /
    • pp.2741-2761
    • /
    • 2017
  • With the purpose of copyright protection for digital video, a novel H.264/AVC watermarking algorithm based on JND model is proposed. Firstly, according to the characteristics of human visual system, a new and more accurate JND model is proposed to determine watermark embedding strength by considering the luminance masking, contrast masking and spatial frequency sensitivity function. Secondly, a new embedding strategy for H.264/AVC watermarking is proposed based on an analysis on the drift error of energy distribution. We argue that more robustness can be achieved if watermarks are embedded in middle and high components of $4{\times}4$ integer DCT since these components are more stable than dc and low components when drift error occurs. Finally, according to different characteristics of middle and high components, the watermarks are embedded using different algorithms, respectively. Experimental results demonstrate that the proposed watermarking algorithm not only meets the imperceptibility and robustness requirements, but also has a high embedding capacity.