• Title/Summary/Keyword: Spectrogram Analysis

Search Result 92, Processing Time 0.019 seconds

Impulse Response Filtration Technique for the Determination of Phase Velocities from SASW Measurements (SASW시험에 의한 위상속도 결정을 위한 임펄스 응답필터 기법)

  • ;Stokoe, K.H., Il
    • Geotechnical Engineering
    • /
    • v.13 no.1
    • /
    • pp.111-122
    • /
    • 1997
  • The calculation of phase velocities in Spectral-Analysis -of-Surface -Waves (SASW) meas urements requires unwrapping phase angles. In case of layered systems with strong stiffness contrast like a pavement system, conventional phase unwrapping algorithm to add in teger multiples of 2n to the principal value of a phase angle may lead to wrong phase volocities. This is because there is difficulty in counting the number of jumps in the phase spectrum especially at the receiver spacing where the measurements are in the transition Bone of defferent modes. A new phase interpretation scheme, called "Impulse Response Fil traction ( IRF) Technique," is proposed, which is based on the separation of wave groups by the filtration of the impulse response determinded between two receivers. The separation of a wave group is based on the impulse response filtered by using information from Gabor spectrogram, which visualizes the propagation of wave groups at the frequency -time space. The filtered impulse response leads to clear interpretation of phase spectrum, which eliminates difficulty in counting number of jumps in the phase spectrum. Verification of the IRF technique was performed by theoretical simulation of the SASW measurement on a pavement system which complicates wave propagation.opagation.

  • PDF

Time-Frequency Analysis of Broadband Acoustic Scattering from Chub Mackerel Scomber japonicus, Goldeye Rockfish Sebastes thompsoni, and Fat Greenling Hexagrammos otakii (고등어(Scomber japonicus), 불볼락(Sebastes thompsoni) 및 쥐노래미(Hexagrammos otakii)에 의한 광대역 음향산란신호의 시간-주파수 분석)

  • Lee, Dae-Jae
    • Korean Journal of Fisheries and Aquatic Sciences
    • /
    • v.48 no.2
    • /
    • pp.221-232
    • /
    • 2015
  • Broadband echoes measured in live chub mackerel Scomber japonicus, goldeye rockfish Sebastes thompsoni, and fat greenling Hexagrammos otakii with different morphologies and internal characteristics were analyzed in time and frequency domains to understand the species-specific echo feature characteristics for classifying fish species. The mean echo image for each time-frequency representation dataset obtained as a function of orientation angle was extracted to mitigate the effect of fish orientation on acoustic scattering. The joint time-frequency content of the broadband echo signals was obtained using the smoothed pseudo-Wigner-Ville distribution (SPWVD). The SPWVDs were analyzed for each echo signature of the three fish species. The results show that the time-frequency analysis provided species-specific echo structure patterns and metrics of the broadband acoustic signals to facilitate fish species classification.

Sample selection approach using moving window for acoustic analysis of pathological sustained vowels according to signal typing

  • Lee, Ji-Yeoun
    • Phonetics and Speech Sciences
    • /
    • v.3 no.3
    • /
    • pp.99-108
    • /
    • 2011
  • The perturbation parameters like jitter, shimmer, and signal-to-noise ratio (SNR) are largely estimated in the particular segment from the subjective or whole portion of the given pathological voice signal although there are many possible regions to be able to analyze the voice signals. In this paper, the pathological voice signals were classified as type 1, 2, 3, or 4 according to narrow band spectrogram and the value differences of the perturbation parameters extracted in the subjective and entire portion tended to be getting bigger as from type 1 to type 4 signals. Therefore, sample selection method based on moving window to analyze type 2 and 3 signals as well as type 1 signals is proposed. Although type 3 signals cannot be analyzed using the perturbation analysis, the type 3 signals by selecting out the samples in which error count is less than 10 through moving window were analyzed. At present, there is no method to be able to analyze the type 4 signals. Future research will endeavor to determine the best way to evaluate such voices.

  • PDF

A Study on the Digital Filter and Wavelet Transform of Monitoring for Laser Welding (레이저 용접 모니터링에 적합한 디지털 필터와 웨이블렛 변환 방법에 관한 연구)

  • Kim, Do Hyoung;Shin, Ho Jun;Yoo, Young Tae
    • Journal of the Korean Society for Precision Engineering
    • /
    • v.30 no.1
    • /
    • pp.67-76
    • /
    • 2013
  • We present an innovative real-time laser welding monitoring technique employing the correlation analysis of the plasma plume optical emission generated during the process. The plasma optical radiation emitted during Nd:YAG laser welding of S45C steel samples has detected with a Photodiode and analyzed under different process conditions. The discrete DC voltage difference, filter methods and wavelet transform has been used to decompose the optical signal into various discrete series of sequences over different frequency bands. Considering that wavelet analysis can decompose the optical signals, extract the characteristic information of the signals and define the defects location accurately, it can be used to implement process-control of laser welding.

A Study on the Foreign Accent of English Stressed Syllables (영어강세음절의 외국인어투에 관한 연구)

  • Park, Hee-Suk
    • Journal of Convergence Society for SMB
    • /
    • v.6 no.4
    • /
    • pp.51-57
    • /
    • 2016
  • This study aims at investigating and comparing the vowel lengths of the eight stressed syllable vowels among the Korean college students with the English native speakers. To do this English sentences were uttered and recorded by twenty Korean subjects. Acoustic features were measured from a sound spectrogram with the help of the Praat software program and analyzed through statistical analysis. From the results of the experiment, I was able to find out that the differences of the lengths of the first syllable stressed vowels were significant. Especially in the pronunciation of the English front low vowel /${\ae}$/, native subjects pronounced significantly longer than Korean subjects, and this result could be used as a teaching material in pronunciation class.

A Study on a Intelligent GIS Monitoring System using the Preventive Diagnostic Technology (예방진단기술을 이용한 지능형 GIS 감시시스템에 관한 연구)

  • Park, Kee-Young;Lee, Jong-Ha;Cho, Sook-Jin;Choi, Hyung-Ki;Jung, Eui-Bung
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.6
    • /
    • pp.244-251
    • /
    • 2014
  • In this study, we give a detailed account of normal and abnormal state of GIS(Gas Insulated Switch-gear) using the preventive diagnostic technology. And it is based on the analysis and diagnosis for storing data of GIS by intelligent GIS monitoring system. The wave shape of GIS sound is similar to noise and is systematically generated by discharge and its corona sound. Therefore, in this paper, to classify normal and abnormal GIS sound. We could discriminate between normal and abnormal case using level crossing rate(LCR) and spectrogram energy rate.

Automatic Phonetic Segmentation of Korean Speech Signal Using Phonetic-acoustic Transition Information (음소 음향학적 변화 정보를 이용한 한국어 음성신호의 자동 음소 분할)

  • 박창목;왕지남
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.8
    • /
    • pp.24-30
    • /
    • 2001
  • This article is concerned with automatic segmentation for Korean speech signals. All kinds of transition cases of phonetic units are classified into 3 types and different strategies for each type are applied. The type 1 is the discrimination of silence, voiced-speech and unvoiced-speech. The histogram analysis of each indicators which consists of wavelet coefficients and SVF (Spectral Variation Function) in wavelet coefficients are used for type 1 segmentation. The type 2 is the discrimination of adjacent vowels. The vowel transition cases can be characterized by spectrogram. Given phonetic transcription and transition pattern spectrogram, the speech signal, having consecutive vowels, are automatically segmented by the template matching. The type 3 is the discrimination of vowel and voiced-consonants. The smoothed short-time RMS energy of Wavelet low pass component and SVF in cepstral coefficients are adopted for type 3 segmentation. The experiment is performed for 342 words utterance set. The speech data are gathered from 6 speakers. The result shows the validity of the method.

  • PDF

Shooting sound analysis using convolutional neural networks and long short-term memory (합성곱 신경망과 장단기 메모리를 이용한 사격음 분석 기법)

  • Kang, Se Hyeok;Cho, Ji Woong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.41 no.3
    • /
    • pp.312-318
    • /
    • 2022
  • This paper proposes a model which classifies the type of guns and information about sound source location using deep neural network. The proposed classification model is composed of convolutional neural networks (CNN) and long short-term memory (LSTM). For training and test the model, we use the Gunshot Audio Forensic Dataset generated by the project supported by the National Institute of Justice (NIJ). The acoustic signals are transformed to Mel-Spectrogram and they are provided as learning and test data for the proposed model. The model is compared with the control model consisting of convolutional neural networks only. The proposed model shows high accuracy more than 90 %.

Limitations of Spectrogram Analysis for Smartphone Voice Recording File Forgery Detection (스마트폰 음성 녹음 파일 위변조 검출을 위한 스펙트로그램 분석의 한계점)

  • Sangmin Han;Yeongmin Son;Jae Wan Park
    • The Journal of the Convergence on Culture Technology
    • /
    • v.9 no.2
    • /
    • pp.545-551
    • /
    • 2023
  • As digital information is readily available to everyone today, the adoption of digital evidence is increasing. However, it is virtually impossible to determine the authenticity of forgery in the case of a voice recording file that has gone through a sophisticated editing process along with the spread of various voice file editing tools. This study aims to prove that forgery, which is difficult to distinguish from the original file, is possible by using insertion, deletion, linking, and synthetic editing technologies in voice recording files. This study presents the difficulty of detecting forgery by encoding a forged voice file with the same extension as the original. In addition, it was shown that forgery detection is impossible if additional transition band deletion and secondary encoding are performed only for experiments in which features occurred. Through this, this study is expected to contribute to the establishment of more stringent evidence admissibility criteria for adopting voice recording files as digital evidence.

Speech Denoising via Low-Rank and Sparse Matrix Decomposition

  • Huang, Jianjun;Zhang, Xiongwei;Zhang, Yafei;Zou, Xia;Zeng, Li
    • ETRI Journal
    • /
    • v.36 no.1
    • /
    • pp.167-170
    • /
    • 2014
  • In this letter, we propose an unsupervised framework for speech noise reduction based on the recent development of low-rank and sparse matrix decomposition. The proposed framework directly separates the speech signal from noisy speech by decomposing the noisy speech spectrogram into three submatrices: the noise structure matrix, the clean speech structure matrix, and the residual noise matrix. Evaluations on the Noisex-92 dataset show that the proposed method achieves a signal-to-distortion ratio approximately 2.48 dB and 3.23 dB higher than that of the robust principal component analysis method and the non-negative matrix factorization method, respectively, when the input SNR is -5 dB.