• 제목/요약/키워드: Sound Model Generation

검색결과 55건 처리시간 0.02초

최대 빈도모델 탐색을 이용한 동물소리 인식용 소리모델생성 (Sound Model Generation using Most Frequent Model Search for Recognizing Animal Vocalization)

  • 고유정;김윤중
    • 한국정보전자통신기술학회논문지
    • /
    • 제10권1호
    • /
    • pp.85-94
    • /
    • 2017
  • 본 논문에서는 동물소리 인식시스템을 위하여 최대 빈도모델 탐색 알고리즘을 고안하고 이를 이용한 소리모델을 생성하는 방법을 제안하였다. 소리모델 생성 방법은 동물종의 소리 데이터로부터 학습과정, 비터비 탐색과정 및 최대 빈도모델 탐색과정을 반복하면서 HMM(Hidden Makcov Model)모델의 구조(상태의 수와 GMM의 수)를 탐색하여 최적의 인식률을 갖는 모델집합이 생성하는 방법이다. 최대 빈도모델 탐색 알고리즘은 입력 소리 데이터를 비터비(Viterbi) 알고리즘으로 탐색하여 모델리스트를 생성하고 이 리스트 중에서 최대 빈도수의 모델을 탐색하여 최종 인식결과로 결정하는 방법이다. 알고리즘에서 소리특징으로 MFCC(Mel Frequency Cepstral Coefficient), 모델형식으로 HMM을 이용하고 C# 프로그래밍언어로 구현 하였다. 알고리즘의 성능을 평가하기 위하여 27종의 동물소리를 선정하고 실험을 하였으며 27개의 HMM 모델집합이 97.29 퍼센트의 인식률로 생성됨을 확인하였다.

벡터 양자화 변분 오토인코더 기반의 폴리 음향 생성 모델을 위한 잔여 벡터 양자화 적용 연구 (A study on the application of residual vector quantization for vector quantized-variational autoencoder-based foley sound generation model)

  • 이석진
    • 한국음향학회지
    • /
    • 제43권2호
    • /
    • pp.243-252
    • /
    • 2024
  • 최근에 연구되기 시작한 폴리(Foley) 음향 생성 모델 중 벡터 양자화 변분 오토인코더(Vector Quantized-Variational AutoEncoder, VQ-VAE) 구조와 Pixelsnail 등 생성모델을 활용한 생성 기법은 중요한 연구대상 중 하나이다. 한편, 딥러닝 기반의 음향 신호의 압축/복원 분야에서는 기존의 VQ-VAE 구조에 비해 잔여 벡터 양자화 기술이 더 적합한 것으로 보고되고 있으며, 따라서 본 논문에서는 폴리 음향 생성 분야에서도 잔여 벡터 양자화 기술이 효과적으로 적용될 수 있을지 연구하고자 한다. 이를 위하여 본 논문에서는 기존의 VQ-VAE 기반의 폴리 음향 생성 모델에 잔여 벡터 양자화 기술을 적용하되, Pixelsnail 등 기존의 다른 모델과 호환이 가능하고 연산 자원의 소모를 늘리지 않는 모델을 고안하여 그 효과를 확인하고자 하였다. 효과를 검증하기 위하여 DCASE2023 Task7의 데이터를 활용하여 실험을 진행하였으며, 그 결과 평균적으로 0.3 가량의 Fréchet audio distance 의 향상을 보이는 것을 확인하였다. 다만 그 성능 향상의 정도가 제한적이었으며, 이는 연산 자원의 소모를 유지하기 위하여 시간-주파수축의 분해능이 저하된 영향으로 판단된다.

가상현실 환경에서의 3차원 사운드 생성을 위한 거리 변화에 따른 구조적 머리전달함수 모델 (A Range Dependent Structural HRTF Model for 3-D Sound Generation in Virtual Environments)

  • 이영한;김홍국
    • 대한음성학회지:말소리
    • /
    • 제59호
    • /
    • pp.89-99
    • /
    • 2006
  • This paper proposes a new structural head-related transfer function(HRTF) model to produce sounds in a virtual environment. The proposed HRTF model generates 3-D sounds by using a head model, a pinna model and the proposed distance model for azimuth, elevation, and distance that are three aspects for 3-D sounds, respectively. In particular, the proposed distance model consists of level normalization block distal region model, and proximal region model. To evaluate the performance of the proposed model, we setup an experimental procedure that each listener identifies a distance of 3-D sound sources that are generated by the proposed method with a predefined distance. It is shown from the tests that the proposed model provides an average distance error of $0.13{\sim}0.31$ meter when the sound source is generated as if it is 0.5 meter $\sim$ 2 meters apart from the listeners. This result is comparable to the average distance error of the human listening for the actual sound source.

  • PDF

휴대폰용 일체형 음향 및 진동 발생장치 개발을 위한 연구 (Study on the Development of Integrated Vibration and Sound Generator)

  • 신태명;안진철
    • 한국소음진동공학회논문집
    • /
    • 제13권11호
    • /
    • pp.875-881
    • /
    • 2003
  • The received signal of a mobile phone is normally sensed through two independent means which are the sound generation of a speaker and vibration generation of a vibration motor. As an improvement scheme to meet the consumer's demand on weight reduction and miniaturization of a mobile phone, the design and development of an integrated vibration and sound generating device are performed in this research. To this purpose, the optimal shapes of the voice coil. the permanent magnet and the vibration plate are designed, and the excitation force applied to the vibration system of the new device is estimated and verified through theoretical analyses, computer simulation, and experiments using an expanded model. In addition, vibration performance comparison of the device with the existing vibration motor is performed, and from the overall process, therefore, the method and procedure for the vibration performance analysis of the integrated vibration and sound generating device are established.

Sound Radiation Property of Tribo-System

  • Stoimenov, B.L.;Kato, K.;Adachi, K.
    • 한국윤활학회:학술대회논문집
    • /
    • 한국윤활학회 2002년도 proceedings of the second asia international conference on tribology
    • /
    • pp.383-384
    • /
    • 2002
  • Frictional sound is observed in great many practical systems, but its generation mechanism is still unknown Model systems are best suited for research on the fundamental mechanisms, but results cannot be easily applied to real systems, because each system has different sound radiation properties. At present, there is no easy method for evaluation of these properties. We propose to describe the sound radiation property of a tribo-system by the relationship between friction-induced sound power and the friction-induced vibration velocity of the contact element. It was found that the sound power of a tribo-system is linearly proportional to the mean-square velocity of the sliding element by a constant coefficient having the dimension of mass flow rate (kg/s).

  • PDF

가상 현실 선박 조종 시뮬레이터 구현을 위한 3차원 음장생성(I) : 머리전달함수 모델링 (3-D Sound-Field Creation Implementing the Virtual Reality Ship Handling Simulator(I): HRTF Modeling)

  • 임정빈
    • 한국항해학회지
    • /
    • 제22권3호
    • /
    • pp.17-25
    • /
    • 1998
  • This paper describes elemental technologies for the creation of three-dimensional(3-D) sound-field to implement the next-generation Ship Handling Simulator with human -computer interaction, known as Virtual Reality. In the virtual reality system, Head-Related Transfer Functions(HRTF's) are used to generate 3-D sound environmental context. Where, the HRTF's are impulse response characterizing the acoustical transformation in a space. This work is divided into two parts, the part Ⅰis mainly for the model constructions of the HRTF's, the part Ⅱis for the control of 3-D sound-field by using the HRTF's . In this paper, as first part, we search for the theory to formulate models of the HRTF's which reduce the dimensionalityof the formulation without loss of any directional information . Using model HRTF's we report results from psychophysical tests used to asses the validity of the proposed modleing method.

  • PDF

Rayleigh-Ritz법을 이용한 샌드위치 패널의 진동 및 소음방사 특성 분석 (An Analysis of Vibration and Sound Radiation of Sandwich Panels Using the Rayleigh-Ritz Method)

  • 김동규;김재현;전진용;박준홍
    • 한국소음진동공학회논문집
    • /
    • 제21권5호
    • /
    • pp.430-436
    • /
    • 2011
  • The purpose of this study is to analyze the vibration and sound generation characteristics of the sandwich panel. Two thick panels were assumed to be separated by a compliant viscoelastic core. The transverse vibration induced by an external impact was analyzed using the Rayleigh-Ritz method. For applying arbitrary boundary condition of the panels, the edges were assumed to be supported by the translational and rotational springs. The beam functions were used as the trial functions. The effect of the boundary condition and viscoelastic core on the resulting vibration characteristics was investigated. The radiated sound power was analyzed using the proposed numerical model and the Rayleigh integral. The dynamic properties of the core and the mass-stiffness-mass resonance frequency had significant influence on the impact sound.

Modeling of Ultrasonic Testing in Butt Joint by Ray Tracing

  • Nam, Young-Hyun
    • Journal of Mechanical Science and Technology
    • /
    • 제15권4호
    • /
    • pp.441-447
    • /
    • 2001
  • Ultrasonic wave generation and propagation were modeled to simulate an ultrasonic test. A ray model was used for the modeling. Actual sound pressure distribution of the incident wave from an angle probe was analyzed using an ultrasonic visualization method to incorporate the actual sound pressure distribution in the model. In this method, the sound pressure was expressed by the density of rays and the reflection coefficient of ultrasonic beams. Reflection and mode conversion of rays were computed by the Snells law. Simulation programs for the problem of ultrasonic testing of a butt joint are built using this ray modeling. Simulation results for ultrasonic wave scattering from a defect and A-scan display in ultrasonic testing agreed with the actual experiment results.

  • PDF

디스크 정렬불량에 기인한 브레이크 스퀼소음 (Brake Squeal Noise Due to Disk Misalignment)

  • 박주표;최연선
    • 대한기계학회:학술대회논문집
    • /
    • 대한기계학회 2003년도 추계학술대회
    • /
    • pp.1690-1695
    • /
    • 2003
  • In order to investigate the mechanism of brake squeal noise, the sound and vibration of an actua1 brake system were measured using a brake dynamometer. The experimental results show that disc run-out varies with brake line pressure and the factor of squeal generation is the run-out due to the misalignment of brake disk. A three degrees of freedom friction model is developed for the disk brake system where the run-out effect and nonlinear friction characteristic are considered. The results of numerical analysis of the model agree well with the experimental results. Also, the stability analysis of the model was performed to predict the generation of brake squeal due to the design parameter modification of brake systems. The results show that the squeal generation depends on the nm-out rather than the friction characteristic between the pad and the disk of brake.

  • PDF

Investigation of the Dynamic Properties of Railway Tracks using a Model for Calculation of Generation of Wheel/Rail Noise

  • Koh, Hyo-In;Nordborg, Anders
    • International Journal of Railway
    • /
    • 제7권4호
    • /
    • pp.109-116
    • /
    • 2014
  • For optimization of a low-noise track system, rail vibration and noise radiation needs to be investigated. The main influencing parameters for the noise radiation and the quantitative results of every track system can be obtained using a calculation model of generation and radiation of railway noise. This kind of model includes contact modeling and the calculation model of the dynamic properties of the wheel and the rail. This study used a nonlinear wheel/rail interaction model in the time domain to investigate the excitation of the rolling noise. Wheel/rail response is determined by time integrating Green's function of the rail together with force impulses from the wheel/rail contact. This model and the results of the study can be used for supporting calculation with the conventional model by an addition of the contributions due to nonlinearities to the roughness spectrum.