• Title/Summary/Keyword: Frequency Masking

Search Result 102, Processing Time 0.042 seconds

Auditory Characteristics of Tiger shark Scyliorhinus torazame caught in the Coast of jeju Island (제주 연안에서 어획된 두툽상어의 청각 특성)

  • Ahn, Jang-Young;Choi, Chan-Moon;Lee, Chang-Heon
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.47 no.3
    • /
    • pp.234-240
    • /
    • 2011
  • In order to obtain the fundamental data about the behavior of sharks by underwater audible sound, this experiment was carried out to investigate the auditory characteristics of tiger shark Scyliorhinus torazame which was caught in the coast of Jeju Island by heart rate conditioning method using pure tones coupled with a delayed electric shock. The audible range of tiger shark extended from 80Hz to 300Hz with a peak sensitivity at 80Hz including less sensitivity at 300Hz. The mean auditory thresholds of tiger shark at the frequencies of 80Hz, 100Hz, 200Hz and 300Hz were 90dB, 103dB, 94dB and 115dB, respectively. The positive response of tiger shark was not evident after the sound projection of over 300Hz. At the results, the sensitive frequency range of tiger shark is narrower than that of fish that has swim bladder. In addition, it is assumed that the most sensitive frequency in auditory thresholds of Chondrichthyes is lower than that of Osteichthyes. Critical ratios of tiger shark measured in the presence of masking noise in the spectrum level range of about 60-70dB (0dB re $1{\mu}Pa/\sqrt{Hz}$) increased from minimum 27dB to maximum 39dB at test frequencies of 80-200Hz. The noise spectrum level at the start of masking was distributed at the range of about 65dB within 80-200Hz.

The Hearing Ability of Black Rockfish Sebastes inermis to Underwater Audible Sound 2. The Auditory Critical Ratio (수중 가청음에 의한 볼락의 청각 능력 2. 청각 임계비)

  • LEE Chang-Heon;SEO Du-Ok
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.34 no.2
    • /
    • pp.151-155
    • /
    • 2001
  • In order to obtain the fundamental data on the auditory thresholds of fishes for marine ranching, the auditory thresholds of black rockfish Sebastes inermis were measured in the presence of masking noise in the spectrum level range of $73\~83$ dB (0 dB re $1{\mu}Pa/\sqrt{Hz}$) with a classical cardiac conditioning technique. Critical ratios were about $28\~34$ dB at $80\~300$ Hz and $47\~52$ dB at $500\~800$ Hz. The ratio increased almost linearly with increasing frequency to 500 Hz. The noise spectrum level at the start of masking was about 70 dB within the frequency range of $80\~800$ Hz excepting 65 dB at 300 Hz. It means that hearing of the black rockfish is masked in the natural environment with the noise spectrum level above 65 dB. The sound pressure level of $200\~300$ Hz recognized by black rockfish was above 96 dB under the ambient noise and the critical ratio of them was above 26 dB.

  • PDF

The Auditory Critical Ratio of the Black Rock Fish Sebastes Schlegeli (조피볼락의 청각 임계비)

  • Park, Yong-Seok;Lee, Chang-Heon;Kim, Ko-Hwan;Seo, Du-Ok
    • Journal of Fisheries and Marine Sciences Education
    • /
    • v.12 no.1
    • /
    • pp.1-10
    • /
    • 2000
  • In order to obtain the fundamental data on the auditory thresholds of fishes for marine ranching, the auditory thresholds of black rock fish Sebastes Schlegeli were measured in the presence of masking noise in the spectrum level range of 73 - 83dB (0dB re $1{\mu}Pa/{\sqrt{Hz}}$) with a classical cardiac conditioning technique. Critical ratios were about 19 - 30dB at 80 - 300Hz and 46 - 54dB at 500 - 800Hz. The ratio increased almost linearly with increasing frequency to 500Hz. The noise spectrum level at the start of masking was about 70dB within the frequency range of 80 - 800Hz excepting 65dB at 300Hz. This suggests that hearing of the black rock fish is masked in the natural environment with the noise spectrum level above 65dB. The sound pressure level of which the signal sound of 100 - 200Hz is recognized by black rock fish under the ambient noise is above 90dB and the critical ratio of them is above 20dB.

  • PDF

Complex nested U-Net-based speech enhancement model using a dual-branch decoder (이중 분기 디코더를 사용하는 복소 중첩 U-Net 기반 음성 향상 모델)

  • Seorim Hwang;Sung Wook Park;Youngcheol Park
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.2
    • /
    • pp.253-259
    • /
    • 2024
  • This paper proposes a new speech enhancement model based on a complex nested U-Net with a dual-branch decoder. The proposed model consists of a complex nested U-Net to simultaneously estimate the magnitude and phase components of the speech signal, and the decoder has a dual-branch decoder structure that performs spectral mapping and time-frequency masking in each branch. At this time, compared to the single-branch decoder structure, the dual-branch decoder structure allows noise to be effectively removed while minimizing the loss of speech information. The experiment was conducted on the VoiceBank + DEMAND database, commonly used for speech enhancement model training, and was evaluated through various objective evaluation metrics. As a result of the experiment, the complex nested U-Net-based speech enhancement model using a dual-branch decoder increased the Perceptual Evaluation of Speech Quality (PESQ) score by about 0.13 compared to the baseline, and showed a higher objective evaluation score than recently proposed speech enhancement models.

Feature Extraction and Classification of Target from Jet Engine Modulation Signal Using Frequency Masking (제트 엔진 변조신호에서 주파수 마스킹을 이용한 표적의 특징 추출 및 식별)

  • Kim, Si-Ho;Kim, Chan-Hong;Chae, Dae-Young
    • The Journal of Korean Institute of Electromagnetic Engineering and Science
    • /
    • v.25 no.4
    • /
    • pp.459-466
    • /
    • 2014
  • This paper deals with the method to classify the aircraft target by analyzing its JEM signal. We propose the method to classify the engine model by analyzing JEM spectrum using the harmonic frequency mask generated from the blade information of jet engine. The proposed method does not need the complicated logic algorithm to find the chopping frequency in each rotor stage and the pre-simulated engine spectrum DB used in the previous methods. In addition, we propose the method to estimate the precise spool rate and it reduces the error in estimating the number of blades or in calculating the harmonic frequency of frequency mask.

A Rotation Resistant Logo Embedding Watermark on Frequency Domain (회전 변환에 강인한 주파수 영역 로고 삽입 워터마크 방법)

  • Lee, In-Jung;Lee, Hyoung;Yoo, Hye-Rim;Min, Joon-Young
    • Journal of Information Technology Applications and Management
    • /
    • v.14 no.1
    • /
    • pp.137-144
    • /
    • 2007
  • In this paper, we propose a rotation resistant robust logo embedding watermarking technique. Geometric manipulations make the detection process very complex and difficult. Watermark embedding in the normalized image directly suffers from smoothing effect due to the interpolation during the image normalization. This can be avoided by estimating the transform parameters using image normalization angle and moments, instead of embedding in the normalized image. Conventional rotation resistant schemes that use full frame transform. In this paper we adopt DCT and calculate masking using a spatio-frequency localization of the $8{\times}8$ block DCT coefficients. Experimental results show that the proposed algorithm is robust against rotation process.

  • PDF

Modified SNR-Normalization Technique for Robust Speech Recognition

  • Jung, Hoi-In;Shim, Kab-Jong;Kim, Hyung-Soon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.3E
    • /
    • pp.14-18
    • /
    • 1997
  • One fo the major problems in speech recognition is the mismatch between training and testing environments. Recently, SNR normalization technique, which normalizes the dynamic range of frequency channels in mel-scaled filterbank, was proposed[1]. While it showed improved robustness against additive noise, it requires a reliable speech detection mechanism and several adaptation parameters to be optimized. In this paper, we propose a modified SNR normalization technique. In this technique, we take simply the maximum of filterbank output and predetermined masking constant for each frequency band. According to the speaker-independent isolated word recognition in car noise environments, proposed modification yields better recognition performance that the original SNR normalization method, with rather reduced complexity.

  • PDF

The Hearing Ability of Coralfish Chromis notatus to Low Frequency Sound 2. The Auditory Critical Ratio and Hearing Index (저주파음에 의한 자리돔의 청각 능력 2. 청각 임계비 및 청각능력지수)

  • 이창헌;서두옥
    • Journal of the Korean Society of Fisheries and Ocean Technology
    • /
    • v.36 no.4
    • /
    • pp.314-321
    • /
    • 2000
  • In order to obtain the fundamental data on the auditory thresholds of fishes for catching method using low frequency sound, the auditory thresholds of coralfish Chromis notatus were measured in the presence of masking noise in the spectrum level range of 73~83dB re l$\mu$Pa/√Hz by heartbeat conditioning technique using pure tones coupled with a delayed electric shock. Critical ratios were about 23~41dB at measurement frequency, The critical ratio increased almost linearly with increasing frequency from 500Hz. The noise spectrum level at the start of masking was about 60~65dB. This suggests that hearing of coralfish is masked in the natural environment with the noise spectrum level above 60dB. The sound pressure level of which the signal sound of 300Hz is recognized by coralfish under the ambient noise is above 88dB and the critical ratio of them is above 23dB. The hearing index of coralfish with ambient noise was 81.

  • PDF

Study on the Sound Quality Evaluation Method for the Vehicle Diesel Engine Noise (승용차 디젤 엔진 소음에 대한 음질 평가 기법 연구)

  • Kwon, Jo-Seph;Kim, Chan-Mook;Kim, Ki-Chang;Kim, Jin-Taek
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.21 no.10
    • /
    • pp.883-889
    • /
    • 2011
  • The brand sound of vehicle diesel engine is recently one of the important advantage strategies in the automotive company. Because various noise components masked under high frequency level can be audible in quieter driving situation. Many researches have been carried out for subjective and objective assessments on vehicle sounds and noises. In particular, the interior sound quality has been one of research fields that can give high quality feature to vehicle products. Vehicle interior noise above 500 Hz is usually controlled by sound package parts. The materials and geometries of sound package parts directly affect on this high frequency noise. This paper describes the sound quality evaluation method for the vehicle diesel engine noise to establish objective criteria for sound quality assessment. Considering the sensitivity of human hearing to impulsive sounds such as diesel noise, the human auditory mechanism was simulated by introducing temporal masking in the time domain. Furthermore, each of the human auditory organs was simulated by computer codes, providing reasonable analytical explanations of typical human hearing responses to diesel noise. This method finally provides the sound quality index of vehicle diesel engine noise that includes high frequency intermittent offensive sounds caused by impacting excitations of combustion and piston slap.

A Post-processing for Binary Mask Estimation Toward Improving Speech Intelligibility in Noise (잡음환경 음성명료도 향상을 위한 이진 마스크 추정 후처리 알고리즘)

  • Kim, Gibak
    • Journal of Broadcast Engineering
    • /
    • v.18 no.2
    • /
    • pp.311-318
    • /
    • 2013
  • This paper deals with a noise reduction algorithm which uses the binary masking in the time-frequency domain. To improve speech intelligibility in noise, noise-masked speech is decomposed into time-frequency units and mask "0" is assigned to masker-dominant region removing time-frequency units where noise is dominant compared to speech. In the previous research, Gaussian mixture models were used to classify the speech-dominant region and noise-dominant region which correspond to mask "1" and mask "0", respectively. In each frequency band, data were collected and trained to build the Gaussian mixture models and detection procedure is performed to the test data where each time-frequency unit belongs to speech-dominant region or noise-dominant region. In this paper, we consider the correlation of masks in the frequency domain and propose a post-processing method which exploits the Viterbi algorithm.