• Title/Summary/Keyword: Sound Model Generation

Search Result 55, Processing Time 0.024 seconds

Sound Model Generation using Most Frequent Model Search for Recognizing Animal Vocalization (최대 빈도모델 탐색을 이용한 동물소리 인식용 소리모델생성)

  • Ko, Youjung;Kim, Yoonjoong
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.10 no.1
    • /
    • pp.85-94
    • /
    • 2017
  • In this paper, I proposed a sound model generation and a most frequent model search algorithm for recognizing animal vocalization. The sound model generation algorithm generates a optimal set of models through repeating processes such as the training process, the Viterbi Search process, and the most frequent model search process while adjusting HMM(Hidden Markov Model) structure to improve global recognition rate. The most frequent model search algorithm searches the list of models produced by Viterbi Search Algorithm for the most frequent model and makes it be the final decision of recognition process. It is implemented using MFCC(Mel Frequency Cepstral Coefficient) for the sound feature, HMM for the model, and C# programming language. To evaluate the algorithm, a set of animal sounds for 27 species were prepared and the experiment showed that the sound model generation algorithm generates 27 HMM models with 97.29 percent of recognition rate.

A study on the application of residual vector quantization for vector quantized-variational autoencoder-based foley sound generation model (벡터 양자화 변분 오토인코더 기반의 폴리 음향 생성 모델을 위한 잔여 벡터 양자화 적용 연구)

  • Seokjin Lee
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.2
    • /
    • pp.243-252
    • /
    • 2024
  • Among the Foley sound generation models that have recently begun to be studied, a sound generation technique using the Vector Quantized-Variational AutoEncoder (VQ-VAE) structure and generation model such as Pixelsnail are one of the important research subjects. On the other hand, in the field of deep learning-based acoustic signal compression, residual vector quantization technology is reported to be more suitable than the conventional VQ-VAE structure. Therefore, in this paper, we aim to study whether residual vector quantization technology can be effectively applied to the Foley sound generation. In order to tackle the problem, this paper applies the residual vector quantization technique to the conventional VQ-VAE-based Foley sound generation model, and in particular, derives a model that is compatible with the existing models such as Pixelsnail and does not increase computational resource consumption. In order to evaluate the model, an experiment was conducted using DCASE2023 Task7 data. The results show that the proposed model enhances about 0.3 of the Fréchet audio distance. Unfortunately, the performance enhancement was limited, which is believed to be due to the decrease in the resolution of time-frequency domains in order to do not increase consumption of the computational resources.

A Range Dependent Structural HRTF Model for 3-D Sound Generation in Virtual Environments (가상현실 환경에서의 3차원 사운드 생성을 위한 거리 변화에 따른 구조적 머리전달함수 모델)

  • Lee, Young-Han;Kim, Hong-Kook
    • MALSORI
    • /
    • no.59
    • /
    • pp.89-99
    • /
    • 2006
  • This paper proposes a new structural head-related transfer function(HRTF) model to produce sounds in a virtual environment. The proposed HRTF model generates 3-D sounds by using a head model, a pinna model and the proposed distance model for azimuth, elevation, and distance that are three aspects for 3-D sounds, respectively. In particular, the proposed distance model consists of level normalization block distal region model, and proximal region model. To evaluate the performance of the proposed model, we setup an experimental procedure that each listener identifies a distance of 3-D sound sources that are generated by the proposed method with a predefined distance. It is shown from the tests that the proposed model provides an average distance error of $0.13{\sim}0.31$ meter when the sound source is generated as if it is 0.5 meter $\sim$ 2 meters apart from the listeners. This result is comparable to the average distance error of the human listening for the actual sound source.

  • PDF

Study on the Development of Integrated Vibration and Sound Generator (휴대폰용 일체형 음향 및 진동 발생장치 개발을 위한 연구)

  • 신태명;안진철
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.13 no.11
    • /
    • pp.875-881
    • /
    • 2003
  • The received signal of a mobile phone is normally sensed through two independent means which are the sound generation of a speaker and vibration generation of a vibration motor. As an improvement scheme to meet the consumer's demand on weight reduction and miniaturization of a mobile phone, the design and development of an integrated vibration and sound generating device are performed in this research. To this purpose, the optimal shapes of the voice coil. the permanent magnet and the vibration plate are designed, and the excitation force applied to the vibration system of the new device is estimated and verified through theoretical analyses, computer simulation, and experiments using an expanded model. In addition, vibration performance comparison of the device with the existing vibration motor is performed, and from the overall process, therefore, the method and procedure for the vibration performance analysis of the integrated vibration and sound generating device are established.

Sound Radiation Property of Tribo-System

  • Stoimenov, B.L.;Kato, K.;Adachi, K.
    • Proceedings of the Korean Society of Tribologists and Lubrication Engineers Conference
    • /
    • 2002.10b
    • /
    • pp.383-384
    • /
    • 2002
  • Frictional sound is observed in great many practical systems, but its generation mechanism is still unknown Model systems are best suited for research on the fundamental mechanisms, but results cannot be easily applied to real systems, because each system has different sound radiation properties. At present, there is no easy method for evaluation of these properties. We propose to describe the sound radiation property of a tribo-system by the relationship between friction-induced sound power and the friction-induced vibration velocity of the contact element. It was found that the sound power of a tribo-system is linearly proportional to the mean-square velocity of the sliding element by a constant coefficient having the dimension of mass flow rate (kg/s).

  • PDF

3-D Sound-Field Creation Implementing the Virtual Reality Ship Handling Simulator(I): HRTF Modeling (가상 현실 선박 조종 시뮬레이터 구현을 위한 3차원 음장생성(I) : 머리전달함수 모델링)

  • 임정빈
    • Journal of the Korean Institute of Navigation
    • /
    • v.22 no.3
    • /
    • pp.17-25
    • /
    • 1998
  • This paper describes elemental technologies for the creation of three-dimensional(3-D) sound-field to implement the next-generation Ship Handling Simulator with human -computer interaction, known as Virtual Reality. In the virtual reality system, Head-Related Transfer Functions(HRTF's) are used to generate 3-D sound environmental context. Where, the HRTF's are impulse response characterizing the acoustical transformation in a space. This work is divided into two parts, the part Ⅰis mainly for the model constructions of the HRTF's, the part Ⅱis for the control of 3-D sound-field by using the HRTF's . In this paper, as first part, we search for the theory to formulate models of the HRTF's which reduce the dimensionalityof the formulation without loss of any directional information . Using model HRTF's we report results from psychophysical tests used to asses the validity of the proposed modleing method.

  • PDF

An Analysis of Vibration and Sound Radiation of Sandwich Panels Using the Rayleigh-Ritz Method (Rayleigh-Ritz법을 이용한 샌드위치 패널의 진동 및 소음방사 특성 분석)

  • Kim, Dong-Kyu;Kim, Jae-Hyun;Jeon, Jin-Yong;Park, Jun-Hong
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.21 no.5
    • /
    • pp.430-436
    • /
    • 2011
  • The purpose of this study is to analyze the vibration and sound generation characteristics of the sandwich panel. Two thick panels were assumed to be separated by a compliant viscoelastic core. The transverse vibration induced by an external impact was analyzed using the Rayleigh-Ritz method. For applying arbitrary boundary condition of the panels, the edges were assumed to be supported by the translational and rotational springs. The beam functions were used as the trial functions. The effect of the boundary condition and viscoelastic core on the resulting vibration characteristics was investigated. The radiated sound power was analyzed using the proposed numerical model and the Rayleigh integral. The dynamic properties of the core and the mass-stiffness-mass resonance frequency had significant influence on the impact sound.

Modeling of Ultrasonic Testing in Butt Joint by Ray Tracing

  • Nam, Young-Hyun
    • Journal of Mechanical Science and Technology
    • /
    • v.15 no.4
    • /
    • pp.441-447
    • /
    • 2001
  • Ultrasonic wave generation and propagation were modeled to simulate an ultrasonic test. A ray model was used for the modeling. Actual sound pressure distribution of the incident wave from an angle probe was analyzed using an ultrasonic visualization method to incorporate the actual sound pressure distribution in the model. In this method, the sound pressure was expressed by the density of rays and the reflection coefficient of ultrasonic beams. Reflection and mode conversion of rays were computed by the Snells law. Simulation programs for the problem of ultrasonic testing of a butt joint are built using this ray modeling. Simulation results for ultrasonic wave scattering from a defect and A-scan display in ultrasonic testing agreed with the actual experiment results.

  • PDF

Brake Squeal Noise Due to Disk Misalignment (디스크 정렬불량에 기인한 브레이크 스퀼소음)

  • Park, Ju-Pyo;Choi, Yeon-Sun
    • Proceedings of the KSME Conference
    • /
    • 2003.11a
    • /
    • pp.1690-1695
    • /
    • 2003
  • In order to investigate the mechanism of brake squeal noise, the sound and vibration of an actua1 brake system were measured using a brake dynamometer. The experimental results show that disc run-out varies with brake line pressure and the factor of squeal generation is the run-out due to the misalignment of brake disk. A three degrees of freedom friction model is developed for the disk brake system where the run-out effect and nonlinear friction characteristic are considered. The results of numerical analysis of the model agree well with the experimental results. Also, the stability analysis of the model was performed to predict the generation of brake squeal due to the design parameter modification of brake systems. The results show that the squeal generation depends on the nm-out rather than the friction characteristic between the pad and the disk of brake.

  • PDF

Investigation of the Dynamic Properties of Railway Tracks using a Model for Calculation of Generation of Wheel/Rail Noise

  • Koh, Hyo-In;Nordborg, Anders
    • International Journal of Railway
    • /
    • v.7 no.4
    • /
    • pp.109-116
    • /
    • 2014
  • For optimization of a low-noise track system, rail vibration and noise radiation needs to be investigated. The main influencing parameters for the noise radiation and the quantitative results of every track system can be obtained using a calculation model of generation and radiation of railway noise. This kind of model includes contact modeling and the calculation model of the dynamic properties of the wheel and the rail. This study used a nonlinear wheel/rail interaction model in the time domain to investigate the excitation of the rolling noise. Wheel/rail response is determined by time integrating Green's function of the rail together with force impulses from the wheel/rail contact. This model and the results of the study can be used for supporting calculation with the conventional model by an addition of the contributions due to nonlinearities to the roughness spectrum.