Search | Korea Science

Animal Sounds Classification Scheme Based on Multi-Feature Network with Mixed Datasets

Kim, Chung-Il;Cho, Yongjang;Jung, Seungwon;Rew, Jehyeok;Hwang, Eenjun
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.14 no.8
- /
- pp.3384-3398
- /
- 2020
In recent years, as the environment has become an important issue in dealing with food, energy, and urban development, diverse environment-related applications such as environmental monitoring and ecosystem management have emerged. In such applications, automatic classification of animals using video or sound is very useful in terms of cost and convenience. So far, many works have been done for animal sounds classification using artificial intelligence techniques such as a convolutional neural network. However, most of them have dealt only with the sound of a specific class of animals such as bird sounds or insect sounds. Due to this, they are not suitable for classifying various types of animal sounds. In this paper, we propose a sound classification scheme based on a multi-feature network for classifying sounds of multiple species of animals. To do that, we first collected multiple animal sound datasets and grouped them into classes. Then, we extracted their audio features by generating mixed records and used those features for training. To evaluate the effectiveness of our scheme, we constructed an animal sound classification model and performed various experiments. We report some of the results.
https://doi.org/10.3837/tiis.2020.08.013 인용 PDF KSCI HTML

Fuzzy Logic Based Sound Source Localization System Using Sound Strength in the Underground Parking Lot (지하주차장에서 음의 세기를 이용한 퍼지로직 기반 음원 위치추정 시스템)

Choi, Chang Yong;Lee, Dong Myung
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.38C no.5
- /
- pp.434-439
- /
- 2013
It is very difficult to monitor the blind spots that are not recognized by traditional surveillance camera (CCTV) systems, and the surveillance efficiencies are very low though many accidents/events can be solved by the systems. In this paper, the fuzzy logic based sound source localization system using sound strength in the underground parking lot is suggested and the performance of the system is analyzed in order to enhance the stabilization and the accuracy of the localization algorithm in the suggested system. It is confirmed that the localization stabilization of the localization algorithm (SLA_fuzzy) using the fuzzy logic in the suggested system is 4 times higher than that of the conventional localization algorithm (SLA). In addition to this, the localization accuracy of the SLA_fuzzy in the suggested system is 29% higher than that of the SLA.
https://doi.org/10.7840/kics.2013.38C.5.434 인용 PDF KSCI

Estimation of sound radiation for a flat plate by using BEM and vibration experiment (경계요소 해석과 진동 실험을 이용한 단순 평판의 방사 음향 예측)

김관주;김정태;최승권
- Journal of KSNVE
- /
- v.10 no.5
- /
- pp.843-848
- /
- 2000
BEA(Boundary Element Analysis) based on Kirchhoff-Helmholtz integral equation is widely used in the prediction of sound radiation problems of vibrating structures. Accurate estimation of sound pressure distribution by BEA can be [possible if and only if dynamic behavior of the relating structure was described correctly. Another plausible method of sound radiation phenomena could be the NAH(Nearfield Acoustic Holography) method. NAH also based on the identical governing equation with BEA could be one of the best acoustic imaging schemes but it has disadvantages of the complexity of measurement and of the need of large amount of measuring points. In this paper, modal expansion method is presented for taking accurate dynamic data of the structures efficiently. This method makes use of vibration principle an arbitrary dynamic behavior of the structure is described by the summation of that structures mode shapes which can be calculated by FEA easily and accurately. Sound pressure field from a vibration flat plate is calculated using the combination of vibration signal on that flat plate from experiment, and of the natural mode shapes form FEA. When sound pressure field from vibration signal is calculated the importance of the phase information was emphasized.
PDF

An advertisement method using inaudible sound of speaker

Chung, Myoungbeom
- Journal of the Korea Society of Computer and Information
- /
- v.20 no.8
- /
- pp.7-13
- /
- 2015
Recently, there are serviced user customized advertisement of various type using smart device. Representative services are advertisement service using light of smart TV screen or audible sound of smart TV to transmit advertisement information. However, those services have to do a specific action of smart device user for advertisement information or need audible audio information of TV contents. To overcome those weakness, therefore, we propose an advertisement method using inaudible sound of speaker based on smart device. This method supports the transfer of advertising content to the smart device user with no additional action or TV audio signal required to access that content. The proposed method used two high frequencies among 18kHz ~ 22kHz of audible frequency range which smart TV can send out. And it generates those frequencies synthesized with audio of TV contents as trigger signal which can send advertisements to smart device. Next, smart device analysis the trigger signal and request advertisement contents related to the signal to server. After then, smart device can show the downloaded contents to user. Because the proposed method uses the high frequencies of sound signals via the inner speaker of the smart device, its main advantage is that it does not affect the audio signal of TV content. To evaluate the efficacy of the proposed method, we developed an application to implement it and subsequently carried out an advertisement transmission experiment. The success rate of the transmission experiment was approximately 97%. Based on this result, we believe the proposed method will be a useful technique in introducing a customized user advertising service.
https://doi.org/10.9708/jksci.2015.20.8.007 인용 PDF KSCI

Sound Diffusion Control for the Localized Sound Image Using Time Delay (방향 정위된 음원에 시간지연을 이용한 확산감 제어에 관한 연구)

김익형;정의필
- Proceedings of the IEEK Conference
- /
- 2001.06d
- /
- pp.135-138
- /
- 2001
Many researchers have developed the techniques of an efficient 3-D sound system based on the psycho-acoustics of spatial hearing with multimedia or virtual reality In this paper, we propose an idea for the improved 3-D sound system using conventional stereo headphones to obtain a better sound diffusion from the mono-sound recorded at an anechoic chamber. We use the HRTF (Head Related Transfer Function) for the sound localization and the wavelet filter bank with time delay for the sound diffusion. We investigate the effects of the 3-B sound depending on the length of time delay at lowest frequency band. Also the correlation coefficient of the signals between the left channel and the right channel is measured to identify the sound diffusion.
PDF

Development of three-dimensional sound effects system for virtual reality (가상환경용 3차원 입체음향 시스템 개발)

Yang, Si-Young;Kim, Dong-Hyung;Jeong, Je-Chang
- Journal of Broadcast Engineering
- /
- v.13 no.5
- /
- pp.574-585
- /
- 2008
3D sound is of central importance for the virtual reality system, and is becoming increasingly important for the auditory displays and for the human-computer interaction. In this paper, we propose a novel real-time 3D sound representation system for virtual reality. At first, we propose a calculation method of the impulse response for virtual space. To transmit the information of the virtual space, we propose an enhanced DXF file type that contains the material information. And then, we implement the multi-channel sound panning system. we perform the experiment based on computer simulation and prove the utility of the proposed method.
https://doi.org/10.5909/JBE.2008.13.5.574 인용 PDF KSCI

A system for recommending audio devices based on frequency band analysis of vocal component in sound source (음원 내 보컬 주파수 대역 분석에 기반한 음향기기 추천시스템)

Jeong-Hyun, Kim;Cheol-Min, Seok;Min-Ju, Kim;Su-Yeon, Kim
- Journal of Korea Society of Industrial Information Systems
- /
- v.27 no.6
- /
- pp.1-12
- /
- 2022
As the music streaming service and the Hi-Fi market grow, various audio devices are being released. As a result, consumers have a wider range of product choices, but it has become more difficult to find products that match their musical tastes. In this study, we proposed a system that extracts the vocal component from the user's preferred sound source and recommends the most suitable audio device to the user based on this information. To achieve this, first, the original sound source was separated using Python's Spleeter Library, the vocal sound source was extracted, and the result of collecting frequency band data of manufacturers' audio devices was shown in a grid graph. The Matching Gap Index (MGI) was proposed as an indicator for comparing the frequency band of the extracted vocal sound source and the measurement data of the frequency band of the audio devices. Based on the calculated MGI value, the audio device with the highest similarity with the user's preference is recommended. The recommendation results were verified using equalizer data for each genre provided by sound professional companies.
https://doi.org/10.9723/jksiis.2022.27.6.001 인용 PDF KSCI

A Study on Elemental Technology Identification of Sound Data for Audio Forensics (오디오 포렌식을 위한 소리 데이터의 요소 기술 식별 연구)

Hyejin Ryu;Ah-hyun Park;Sungkyun Jung;Doowon Jeong
- Journal of the Korea Institute of Information Security & Cryptology
- /
- v.34 no.1
- /
- pp.115-127
- /
- 2024
The recent increase in digital audio media has greatly expanded the size and diversity of sound data, which has increased the importance of sound data analysis in the digital forensics process. However, the lack of standardized procedures and guidelines for sound data analysis has caused problems with the consistency and reliability of analysis results. The digital environment includes a wide variety of audio formats and recording conditions, but current audio forensic methodologies do not adequately reflect this diversity. Therefore, this study identifies Life-Cycle-based sound data elemental technologies and provides overall guidelines for sound data analysis so that effective analysis can be performed in all situations. Furthermore, the identified elemental technologies were analyzed for use in the development of digital forensic techniques for sound data. To demonstrate the effectiveness of the life-cycle-based sound data elemental technology identification system presented in this study, a case study on the process of developing an emergency retrieval technology based on sound data is presented. Through this case study, we confirmed that the elemental technologies identified based on the Life-Cycle in the process of developing digital forensic technology for sound data ensure the quality and consistency of data analysis and enable efficient sound data analysis.
https://doi.org/10.13089/JKIISC.2024.34.1.115 인용 PDF HTML

Signal Enhancement of a Variable Rate Vocoder with a Hybrid domain SNR Estimator

Park, Hyung Woo
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.13 no.2
- /
- pp.962-977
- /
- 2019
The human voice is a convenient method of information transfer between different objects such as between men, men and machine, between machines. The development of information and communication technology, the voice has been able to transfer farther than before. The way to communicate, it is to convert the voice to another form, transmit it, and then reconvert it back to sound. In such a communication process, a vocoder is a method of converting and re-converting a voice and sound. The CELP (Code-Excited Linear Prediction) type vocoder, one of the voice codecs, is adapted as a standard codec since it provides high quality sound even though its transmission speed is relatively low. The EVRC (Enhanced Variable Rate CODEC) and QCELP (Qualcomm Code-Excited Linear Prediction), variable bit rate vocoders, are used for mobile phones in 3G environment. For the real-time implementation of a vocoder, the reduction of sound quality is a typical problem. To improve the sound quality, that is important to know the size and shape of noise. In the existing sound quality improvement method, the voice activated is detected or used, or statistical methods are used by the large mount of data. However, there is a disadvantage in that no noise can be detected, when there is a continuous signal or when a change in noise is large.This paper focused on finding a better way to decrease the reduction of sound quality in lower bit transmission environments. Based on simulation results, this study proposed a preprocessor application that estimates the SNR (Signal to Noise Ratio) using the spectral SNR estimation method. The SNR estimation method adopted the IMBE (Improved Multi-Band Excitation) instead of using the SNR, which is a continuous speech signal. Finally, this application improves the quality of the vocoder by enhancing sound quality adaptively.
https://doi.org/10.3837/tiis.2019.02.026 인용 PDF KSCI HTML

Sound Power Evaluation of Various Domestic Railroad Vehicles (국내 철도 차량의 음향발생 특성에 대한 비교 연구)

Kim, Jeung-Tae;Cho, Sung-Ho
- Journal of the Korean Society for Railway
- /
- v.2 no.1
- /
- pp.28-37
- /
- 1999
Many residential areas are situated near to railroad tracks so that a railroad noise has been one of the major environmental issues. In this paper two important aspects have been investigated in order to properly evaluate the railroad vehicle noise : sound power levels for different types and sound propagation characteristics of the railroad vehicles. For noise source characteristics of railroad vehicles, sound power values for various types of trains that are in active service have been measured. In this paper, domestic railroad vehicles are measured and compared with high speed train(TGV). Based on sound power information of railway vehicles, prediction on the sound pressure level and equivalent noise level near to railway areas have been evaluated.
PDF

Search Result 637, Processing Time 0.024 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)