A Deep Learning System for Emotional Cat Sound Classification and Generation

Joo Yong Shim;SungKi Lim;Jong-Kook Kim;

doi:10.3745/TKIPS.2024.13.10.492

정보처리학회 논문지 (The Transactions of the Korea Information Processing Society)

제13권10호
/
Pages.492-496
/
2024
/
3022-7011(eISSN)

한국정보처리학회 (Korea Information Processing Society)

DOI QR Code

감정별 고양이 소리 분류 및 생성 딥러닝 시스템

A Deep Learning System for Emotional Cat Sound Classification and Generation

심주용 (고려대학교 정보통신기술연구소) ;
임성기 ((주)애니멀보이스) ;
김종국 (고려대학교 전기전자공학과)

투고 : 2024.08.29
심사 : 2024.09.06
발행 : 2024.10.31

https://doi.org/10.3745/TKIPS.2024.13.10.492 인용 PDF

PDF 다운로드

⟨ 이전 논문 다음 논문 ⟩

초록

반려동물, 특히 고양이는 인간과의 상호작용에서 다양한 소리를 통해 감정을 표현하는 것으로 알려져 있다. 고양이의 소리는 그들이 느끼는 감정 상태를 반영하며, 이를 이해하고 해석하는 것은 반려동물과의 소통을 더욱 원활하게 하는 데 중요한 요소이다. 최근 인공지능 기술의 발전으로 감정 인식과 관련된 연구가 활발히 진행되고 있으며, 특히 딥러닝 모델을 활용한 음성 데이터 분석이 주목받고 있다. 본 연구는 이러한 배경에서 출발하여, 고양이의 소리를 감정별로 분류하고 생성하는 딥러닝 시스템을 개발하는 것을 목표로 한다. 분류 모델은 고양이 소리를 감정별로 정확하게 분류하기 위해 학습되며, 소리 생성 모델은 SampleRNN과 같은 딥러닝 기법을 활용하여 특정 감정을 표현하는 고양이 소리를 생성할 수 있도록 설계된다. 마지막으로, 학습된 두 모델을 통합하여 고양이 소리를 녹음하고 이를 감정별로 분류한 결과 및 사용자의 요구에 따른 고양이 소리를 생성하여 제공할 수 있는 시스템을 제안한다.

Cats are known to express their emotions through a variety of vocalizations during interactions. These sounds reflect their emotional states, making the understanding and interpretation of these sounds crucial for more effective communication. Recent advancements in artificial intelligence has introduced research related to emotion recognition, particularly focusing on the analysis of voice data using deep learning models. Building on this background, the study aims to develop a deep learning system that classifies and generates cat sounds based on their emotional content. The classification model is trained to accurately categorize cat vocalizations by emotion. The sound generation model, which uses deep learning based models such as SampleRNN, is designed to produce cat sounds that reflect specific emotional states. The study finally proposes an integrated system that takes recorded cat vocalizations, classify them by emotion, and generate cat sounds based on user requirements.

키워드

참고문헌

G. R. Farley, S. M. Barlow, R. Netsell, and J. V. Chmelka, "Vocalizations in the cat: behavioral methodology and spectrographic analysis," Experimental Brain Research, Vol.89, pp.333-340, 1992.
Y. Aytar, C. Vondrick, and A. Torralba, "Soundnet: Learning sound representations from unlabeled video," in Proceedings of the International Conference on Neural Information Processing Systems, Spain, pp.892-900, 2016.
A. M. Badshah, J. Ahmad, N. Rahim, and S. W. Baik, "Speech Emotion Recognition from Spectrograms with Deep Convolutional Neural Network," in Proceedings of the International Conference on Platform Technology and Service, Korea(South), pp.1-5, 2017.
Y. Zhou, Z. Wang, C. Fang, T. Bui, and T. L. Berg, "Visual to Sound: Generating Natural Sound for Videos in the Wild," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, USA, pp. 3550-3558, 2018.
J . Engel, K. K. Agrawal, S. Chen, I. Gulrajani, C. Donahue, and A. Roberts, "Gansynth: Adversarial neural audio synthesis," arXiv preprint arXiv:1902.08710, 2019.
E. Kucukkulahli and A. T. Kabakus, "Towards Understanding Cat Vocalizations: A Novel Cat Sound Classification Model Based on Vision Transformers," Applied Acoustics, Vol.226, 110218, 2024.
Y. R. Pandeya and J. Lee, "Domestic cat sound classification using transfer learning," The International Journal of Fuzzy Logic and Intelligent Systems, Vol.18, pp.154-160, 2018.
S. Mehri, K. Kumar, I. Gulrajani, R. Kumar, S. Jain, J. Sotelo, A. Courville, Aaron, and Y. Bengio, "SampleRNN: An Unconditional End-to-End Neural Audio Generation Model," in Proceedings of the International Conference on Learning Representations, 2017.

정보처리학회 논문지 (The Transactions of the Korea Information Processing Society)

감정별 고양이 소리 분류 및 생성 딥러닝 시스템

A Deep Learning System for Emotional Cat Sound Classification and Generation

초록

키워드

참고문헌

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

자세히 찾기

이미지 검색 (β)