• Title/Summary/Keyword: acoustical variable

Search Result 143, Processing Time 0.019 seconds

Efficient Harmonic-CELP Based Low Bit Rate Speech Coder (효율적인 하모닉-CELP 구조를 갖는 저 전송률 음성 부호화기)

  • 최용수;김경민;윤대희
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.5
    • /
    • pp.35-47
    • /
    • 2001
  • This paper describes an efficient harmonic-CELP speech coder by taking advantages of harmonic and CELP coders into account. According to frame voicing decision, the proposed harmonic-CELP coder adopts the RP-VSELP coder as a fast CELP in case of an unvoiced frame, or an improved harmonic coder in case of a voiced frame. The proposed coder has main features as follows: simple pitch detection, fast harmonic estimation, variable dimension harmonic vector quantization, perceptual weighting reflecting frequency resolution, fast harmonic synthesis, naturalness control using band voicing, and multi-mode. These features make the proposed coder require very low complexity, compared with HVXC coder To demonstrate the performance of the proposed coder, a 2.4 kbps coder has been implemented and compared with reference coders. From results of informal listening tests, the proposed coder showed good quality while requiring low delay and complexity.

  • PDF

Detection of Glottal Closure Instant using the property of G-peak (G-peak의 특성을 이용한 성문폐쇄시점 검출)

  • Keum, Hong;Kim, Dae-Sik;Bae, Myung-Jin;Kim, Young-Il
    • The Journal of the Acoustical Society of Korea
    • /
    • v.13 no.1E
    • /
    • pp.82-88
    • /
    • 1994
  • It is important to exactly detect the GCI(Glottal Closure Instant) in the speech signal processing. A few methods to detect the GCI of voiced speech have een proposer, untill now. But these are difficult to detect the GCI for wide range of speakers and or various vowel signals. In this paper, we prposed a new method for GCI detection using the G-peak. The speech waveforms are passed through the LPF of variable bandwidth. Then, the GCI's of voiced speech are detected by the G-peak based on the filtered signals. We compared the detected with the eye-checked GCI at the SNR of clean, 20dB, and 0dB. We took into account the range within 1ms between eye-checked and detected GCI. We obtained the result of the detection rate as 97.9% in the clean speech, 96.5% in 20dB SNR, and 94.8% in 0dB SNR, respectively.

  • PDF

Geoacoustic Characteristics of P-Wave Velocity in Donghae City - Ulleung Island Line, East Sea: Preliminary Results (동해시-울릉도 해저 측선에서의 P파 속도 지음향 특성: 예비 결과)

  • Ryang, Woo-Hun;Kwon, Yi-Kyun;Jin, Jae-Hwa;Kim, Hyun-Tae;Lee, Chi-Won;Jung, Ja-Hun;Kim, Dae-Choul;Choi, Jin-Hyuk;Kim, Young-Gyu;Kim, Sung-Il
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.2E
    • /
    • pp.44-49
    • /
    • 2007
  • Donghae City - Ulleung Island Line (DC-UI Line) is a representative line for underwater and geoacoustic modeling in the middle western East Sea. In this line, an integrated model of P-wave velocity is proposed for a low-frequency range target (<200 Hz), based on high-resolution seismic profiles (2 - 7 kHz sonar and air-gun), shallow and deep cores (grab, piston, and Portable Remote Operated Drilling), and outcrop geology (Tertiary rocks and the basement on land). The basement comprises 3 geoacoustic layers of P-wave velocity ranging from 3750 to 5550 m/s. The overlying sediments consist of 7 layers of P-wave velocities ranging from 1500 to 1900 m/s. The bottom model shows that the structure is very irregular and the velocity is also variable with both vertical and lateral extension. In this area, seabed and underwater acousticians should consider that low-frequency acoustic modeling is very range-dependent and a detailed geoacoustic model is necessary for better modeling of acoustic propagation such as long-range surveillance of submarines and monitoring of currents.

Robust variable range focusing with a virtual source array using the waveguide invariant in underwater (수중에서의 도파관 불변성을 이용한 가상 음원 배열 기반의 다양한 거리 방향으로의 강인한 집속)

  • Byun, Gi Hoon;Kim, J.S.
    • The Journal of the Acoustical Society of Korea
    • /
    • v.36 no.1
    • /
    • pp.23-29
    • /
    • 2017
  • A concept of a VSA (Virtual Source Array) is the method for an acoustic spatio-temporal focus at a selected location in the outbound direction with respect to the VSA without the need of a probe source as combines a TRP (Time-Reversal Processing) and time-delay and beam-steering. However, in TRP using the VSA concept, it is limited to the critical angle and the short distances relevant to the VSA. In this paper, the waveguide invariant theory is applied to the VSA concept to refocus the received field at ranges greater other than the critical angle and the short ranges by shifting the focused field. The suggested method is verified via numerical simulation, and the results show that the robust acoustic focusing is achieved on the selected location regardless of the limitation on the conventional VSA concept.

Implementation of low-noise, wideband ultrasound receiver for high-frequency ultrasound imaging (고주파수 초음파 영상을 위한 저잡음·광대역 수신 시스템 구현)

  • Moon, Ju-Young;Lee, Junsu;Chang, Jin Ho
    • The Journal of the Acoustical Society of Korea
    • /
    • v.36 no.4
    • /
    • pp.238-246
    • /
    • 2017
  • High frequency ultrasound imaging typically suffers from low sensitivity due to the small aperture of high frequency transducers and shallow imaging depth due to the frequency-dependent attenuation of ultrasound. These limitations should be overcome to obtain high-frequency, high- resolution ultrasound images. One practical solution to the problems is a high-performance signal receiver capable of detecting a very small signal and amplifying the signal with minimal electronic noise addition. This paper reports a recently developed low-noise, wideband ultrasound receiver for high-frequency, high-resolution ultrasound imaging. The developed receiver has an amplification gain of up to 73 dB and a variable amplification gain range of 48 dB over an operating frequency of 80 MHz. Also, it has an amplification gain flatness of ${\pm}1dB$. Due to these high performances, the developed receiver has a signal-to-noise ratio of at least 8.4 dB and a contrast-to-noise ratio of at least 3.7 dB higher than commercial receivers.

Finite Element Analysis for Acoustic Characteristics of Piezoelectric Underwater Acoustic Sensors (압전 수중음향센서 음향특성의 유한요소해석)

  • 김재환;손선봉;조철희;조치영
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.1
    • /
    • pp.68-76
    • /
    • 2001
  • Sonar is the system that detects objects and finds their location in water by using the echo ranging technique. In order to have excellent performance in variable environment, acoustic characteristics of this system must be analyzed accurately. In this paper, based on the finite element analysis, modeling and analysis of acoustic characteristics of underwater acoustic sensors are preformed. Couplings between piezoelectric and elastic materials, and fluid and structure systems associated with the modeling of piezoelectric underwater acoustic sensors are formulated. In the finite element modeling of unbounded acoustic fluid, IWEE (Infinite Eave Envelop Element) is adopted to take into account the infinite domain. When an incidence wave excites the surface of Tonpilz underwater acoustic sensor, the scattered wave on the sensor is founded by satisfying the radiation condition at the artificial boundary approximately. Based on this scattering analysis, the electrical response of the underwater acoustic sensor under incidence, so called RVS (Receiving Voltage Signal) is founded accurately. This will devote to design Sonar systems accurately.

  • PDF

Local Distribution Based Density Clustering for Speaker Diarization (화자분할을 위한 지역적 특성 기반 밀도 클러스터링)

  • Rho, Jinsang;Shon, Suwon;Kim, Sung Soo;Lee, Jae-Won;Ko, Hanseok
    • The Journal of the Acoustical Society of Korea
    • /
    • v.34 no.4
    • /
    • pp.303-309
    • /
    • 2015
  • Speaker diarization is the task of determining the speakers for unlabeled data, and DBSCAN (Density-Based Spatial Clustering of Applications with Noise) has been widely used in the field of speaker diarization for its simplicity and computational efficiency. One challenging issue, however, is that if different clusters in non-spatial dataset are adjacent to each other, over-clustering may occur which subsequently degrades the performance of DBSCAN. In this paper, we identify the drawbacks of DBSCAN and propose a new density clustering algorithm based on local distribution property around object. Variable density criterions for local density and spreadness of object are used for effective data clustering. We compare the proposed algorithm to DBSCAN in terms of clustering accuracy. Experimental results confirm that the proposed algorithm exhibits higher accuracy than DBSCAN without over-clustering and confirm that the new approach based on local density and object spreadness is efficient.

Global Soft Decision Based on Improved Speech Presence Uncertainty Tracking Method Incorporating Spectral Gradient (스펙트럼 변이 기반의 향상된 음성 존재 불확실성 추적 기법을 이용한 Global Soft Decision)

  • Kim, Jong-Woong;Chang, Joon-Hyuk
    • The Journal of the Acoustical Society of Korea
    • /
    • v.32 no.3
    • /
    • pp.279-285
    • /
    • 2013
  • In this paper, we propose a novel speech enhancement method to improve the performance of the conventional global soft decision which is based on the spectral gradient method applied to the ratio of a priori speech absence and presence probability value (q). Conventional global soft decision scheme used a fixed value of q in accordance with the hypothesis assumed, but the proposed algorithm is a technique for improving the speech absence probability which is applied adaptively variable value of q according to the speech presence or absence in the previous two frames and the conditions of the spectral gradient value. Experimental results show that the proposed improved global soft decision method based on the spectral gradient method yields better results compared to the conventional global soft decision technique based on the performance criteria of the ITU-T P. 862 PESQ (Perceptual Evaluation of Speech Quality).

A study on the fault diagnosis of rotating machine by machine learning (기계학습을 적용한 회전체 고장진단에 관한 연구)

  • Jeon, Hang-Kyu;Kim, Ji-Sun;Kim, Bong-Ju;Kim, Won-Jin
    • The Journal of the Acoustical Society of Korea
    • /
    • v.39 no.4
    • /
    • pp.263-269
    • /
    • 2020
  • In this study, a rotating machine that can reproduce normal condition and 8 fault conditions were produced, and vibration data was acquired. Feature is calculated from the acquired data, and accuracy is analyzed through fault diagnosis using artificial neural networks and genetic algorithms. In order to achieve optimal timing and higher accuracy, features by three domains were applied to the fault diagnosis. The learning number was selected as a setting variable. As a result of the rotating machine fault diagnosis, high precision was found in the frequency domain than in others, and precise fault diagnoses were accomplished through all of 10 operations, at the learning number of 5000 and 8000. Given the efficiency of time, it was estimated to be the most efficient when the number of learning was 5000.

Effect of expanding low-salinity water in the East China Sea on underwater sound propagation (동중국해 저염분수의 확장이 수중 음파 전달에 미치는 영향)

  • Bum-Jun Kil
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.1
    • /
    • pp.16-24
    • /
    • 2023
  • The salinity of sea water is known as a less influencing variable in the calculation of the sound speed of the sea water. This study investigated how the low salinity of sea water affects the vertical structure of the sound speed near the mouth of the Yangtze (Changjiang) River when the diluted fresh water extends toward the East China Sea in the summer. As a result of comparing two types of sound speeds considered measured and fixed salinity, sound speeds appeared distinguishable when the halocline formed steeper than the thermocline due to Yangtze-River Diluted Water (YRDW). In addition, unlikely with fixed salinity conditions, when measured salinity was considered, an underwater sound channel appeared in the middle of the thermocline of which the source depth is located. Accordingly, considering the salinity, this study suggests using Expendable Conductivity Temperature Depth (XCTD) and Expendable Sound Velocimeter (XSV) rather than Expandable Bathy Thermograph (XBT) when calculating sound speed because of the strong halocline due to YRDW in the summer.