Search | Korea Science

Human Laughter Generation using Hybrid Generative Models

Mansouri, Nadia;Lachiri, Zied
- KSII Transactions on Internet and Information Systems (TIIS)
- /
- v.15 no.5
- /
- pp.1590-1609
- /
- 2021
Laughter is one of the most important nonverbal sound that human generates. It is a means for expressing his emotions. The acoustic and contextual features of this specific sound are different from those of speech and many difficulties arise during their modeling process. During this work, we propose an audio laughter generation system based on unsupervised generative models: the autoencoder (AE) and its variants. This procedure is the association of three main sub-process, (1) the analysis which consist of extracting the log magnitude spectrogram from the laughter database, (2) the generative models training, (3) the synthesis stage which incorporate the involvement of an intermediate mechanism: the vocoder. To improve the synthesis quality, we suggest two hybrid models (LSTM-VAE, GRU-VAE and CNN-VAE) that combine the representation learning capacity of variational autoencoder (VAE) with the temporal modelling ability of a long short-term memory RNN (LSTM) and the CNN ability to learn invariant features. To figure out the performance of our proposed audio laughter generation process, objective evaluation (RMSE) and a perceptual audio quality test (listening test) were conducted. According to these evaluation metrics, we can show that the GRU-VAE outperforms the other VAE models.
https://doi.org/10.3837/tiis.2021.05.001 인용 PDF KSCI HTML

Recognition of Overlapped Sound and Influence Analysis Based on Wideband Spectrogram and Deep Neural Networks (광역 스펙트로그램과 심층신경망에 기반한 중첩된 소리의 인식과 영향 분석)

Kim, Young Eon;Park, Gooman
- Journal of Broadcast Engineering
- /
- v.23 no.3
- /
- pp.421-430
- /
- 2018
Many voice recognition systems use methods such as MFCC, HMM to acknowledge human voice. This recognition method is designed to analyze only a targeted sound which normally appears between a human and a device one. However, the recognition capability is limited when there is a group sound formed with diversity in wider frequency range such as dog barking and indoor sounds. The frequency of overlapped sound resides in a wide range, up to 20KHz, which is higher than a voice. This paper proposes the new recognition method which provides wider frequency range by conjugating the Wideband Sound Spectrogram and the Keras Sequential Model based on DNN. The wideband sound spectrogram is adopted to analyze and verify diverse sounds from wide frequency range as it is designed to extract features and also classify as explained. The KSM is employed for the pattern recognition using extracted features from the WSS to improve sound recognition quality. The experiment verified that the proposed WSS and KSM excellently classified the targeted sound among noisy environment; overlapped sounds such as dog barking and indoor sounds. Furthermore, the paper shows a stage by stage analyzation and comparison of the factors' influences on the recognition and its characteristics according to various levels of noise.
https://doi.org/10.5909/JBE.2018.23.3.421 인용 PDF KSCI KPUBS

A COMPUTER ANALYSIS ON THE KOREAN CONSONANT SOUND DISTORTION IN RELATION TO THE PALATAL PLATE THICKNESS -Dentoalveolar and hard palatal consonant- (구개상의 두께에 따른 한국어 자음의 발음 변화에 관한 컴퓨터 분석 - 치조음, 경구개음-)

Woo, Yi-Hyung;Choi, Dae-Kyun;Choi, Boo-Byung;Park, Nam-Soo
- The Journal of Korean Academy of Prosthodontics
- /
- v.25 no.1
- /
- pp.71-94
- /
- 1987
This study was carried out to investigate the sound distortion following the alternation of the palatal plate thickness. For this study, 2 healthy male subjects (24-year-old) were selected. Born in Seoul, they both spoke Seoul dialect. First, their sounds of /na(나)/, /da(다)/, /1a(라)/, /ja(자)/, /cha(차)/, /ta(타)/, without inserting plates were recorded, and then the sounds with palatal plates of different thickness were recorded, successively. The plate was fabricated in 3 types, each palatal thickness being 1.0mm, 2.5mm, dentoalveolar portion 2.5mm, other residual portion was 1.0mm, successively. Each type plates named B, C, D-type, in succession. Series of analysis were administered through Computer(16 bit) to analyze the sound distortions. These experiments were analyzed by the LPC (without weighting, pre-weighting, post-weighting) of the consonants, vowels portion, formant frequency of the vowels and word duration of the consonants. The findings led to the following conclusions: 1. There was no correlation of the distortion rate on the 2 informants. 2. Generally, vowels were not affected by the palatal plate thickness in the formant analysis, however, more distortion was detected in the LPC analysis, especially C, D-type plates. 3. Consonants distortion was more evident in the C, D-type plate. 4. The second formant was most disturbed and reduced in the all consonants with insertion of the palatal plate, especially C, D-type plate. 5. Word duration was shortened in the plate inserted(except /ja/, /cha/), especially C, D-type. 6. It was found that dentoalveolar, hard palatal sounds were severely distorted in plate inserted, and they were mainly affected by the dentoalveolar portion thickness. 7. There was correlation between palatal thickness and consonants quality.
PDF

Efficient Design of Plate Spring for Improving Performance of Sound Wave Vibration Massage Chair (음파진동 안마의자제품의 성능향상을 위한 판스프링의 효율적 설계)

Kim, Chang-Gyum;Park, Soo-Yong;Jo, Eun-Hyeon;Lee, Dong-Hyung
- Journal of Korean Society of Industrial and Systems Engineering
- /
- v.42 no.4
- /
- pp.1-7
- /
- 2019
The customer of massage chair is expanding day by day from middle age to all ages. In 2018, the market size was 700 billion KRW, an increase of 30 times over 10 years. However, most related SMEs suffer from excessive competition by the market monopoly of some major companies. In this situation, in order for a related company to survive, it is necessary to steadily research and develop new products. Founded in 2009, company L produces massage chairs for health and relaxation of customers. L's products use a sound wave vibration module that is favorable for human body, unlike other products that use vibration motor type. However, frequent breakdowns of massage chair due to the vulnerability of plate (leaf) springs, which play an important role in sound wave vibration modules, made sap its competitiveness. In this paper, we propose a method to design desirable plate spring structure by sequentially experimenting with five different plate springs. The results of this study are expected to contribute to improve the quality of plate spring and the reliability of sound wave vibration module. In the future, it is necessary to find a way to use it in the development of foot massage or scalp management device as well as continuous research to find optimal plate spring structure through various analysis.
https://doi.org/10.11627/jkise.2019.42.4.001 인용 PDF KSCI

Relationship Between the Resonance Frequency and Q_TS for Microspeaker (마이크로스피커에서 공명진동수와 Q_TS 사이의 연관성)

Oh, Sei-Jin
- Korean Journal of Materials Research
- /
- v.21 no.7
- /
- pp.403-409
- /
- 2011
Micro speakers are used to reproduce sound in small electric and information and communications devices, such as cellular phones, PMPs, and MP3 players. The acoustical properties and sound quality, which are changed due to the decreased size of the speaker, are often adjusted varying the type and thickness of the diaphragm. The most widely used diaphragm material is thin polymer. It was previously reported by the author of this paper that the resonance frequency of a micro speaker is changed by the type and thickness of a polymer diaphragm. In this paper, the frequency response near the resonance frequency of a micro speaker was studied as functions of the type and thickness of the polymer diaphragm. While $R_{max}$ and $R_{DC}$ were affected by the type and thickness, an analysis of the electrical impedance curve revealed that $R_o(= R_{max}/R_{DC})$ and ${\Delta}f$ were not changed. Thus, $Q_{TS}$ which was function of $R_o$, ${\Delta}f$, and the resonance frequency, is only related to the resonance frequency. The increase of the resonance frequency led to a proportional rise of $Q_{TS}$. The change of the frequency response near the resonance frequency was not dependent on the type or thickness of the polymer diaphragm, but was affected by the resonance frequency.
https://doi.org/10.3740/MRSK.2011.21.7.403 인용 PDF KSCI

Consumption Attribute Value Estimation of Digital Music Contents Service by Conjoint Analysis (컨조인트 분석을 통한 디지털 음악콘텐츠 서비스의 소비 속성별 가치 추정)

Shin, Dong-Myoung;Kim, Bo-Young
- The Journal of the Korea Contents Association
- /
- v.14 no.12
- /
- pp.924-934
- /
- 2014
In the last 10 years the digital music contents market has grown rapidly. However digital music contents product and services are not managed with product planning and price policy considered customer attitude and digital music contents values. This study is to define the value properties of digital music contents services based on streaming and download as genre, price, sound quality, and usage appliance, and suggest the strategic market price and service composition of digital music contents service by customer attitudes about the value properties. The research used the conjoint analysis methodology based on the hedonic price model and collected 405 questionaries by users of Korean digital music contents services to the analysis. Hence 'sound quality' in download platform, and 'appliance' in streaming platform were the elements to evaluate the customer attitude. The results present the music contents productions and companies have to provide the differentiated services and price by the value properties of user preference in the market.
https://doi.org/10.5392/JKCA.2014.14.12.924 인용 PDF KSCI

Measurement and Control of Abnormal Sound for Refrigerator (냉장고의 이상소음에 관한 사례연구)

주재만;김중래;이동현
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2001.05a
- /
- pp.380-384
- /
- 2001
The household reftigerator's noise which is closely related with resident environment can hardly be evaluated its sound quality using the Korean Standards. Radiation characteristics of compressor noise consist of tonal noise in low frequency range and, or narrow band noise in high frequency range. In this study, measuring method for detecting the abnormal and low-level noise in high frequency band is presented, and control method for its reduction is proposed. After installing wall which is similar to living condition, we determined a major concerning frequency band of noise. It can be found the directivity of high frequency noise radiated from compressor by using experiment and analysis. According to isolation of noise transfer path, remarkable noise reduction is achieved.
PDF

An Efficient Time-Frequency Representation for Parametric-Based Audio Object Coding

Beack, Seung-Kwon;Lee, Tae-Jin;Kim, Min-Je;Kang, Kyeong-Ok
- ETRI Journal
- /
- v.33 no.6
- /
- pp.945-948
- /
- 2011
Object-based audio coding can provide new music applications with interactivity. To efficiently compress a lot of target audio objects, a subband-based parametric coding scheme has been adopted for MPEG spatial audio object coding. In this letter, the time-frequency (T/F) subband analysis structure is investigated. A reconfigured T/F structure is also proposed to enhance the generating performance of sound scenes such as 'karaoke' and 'solo' play in interactive music scenarios. From the experimental results, it was confirmed that the proposed scheme remarkably improves the SNR and sound quality.
https://doi.org/10.4218/etrij.11.0211.0007 인용 PDF KSCI

The prediction of train interior noise with Statistical Energy Analysis (통계적 에너지 해석법을 이용한 전동차의 실내소음 분석)

Lee, Seoung-Woo;Kim, Jae-Chul;Lee, Dong-Hoon;Choo, Don-Ho
- Proceedings of the KSR Conference
- /
- 2008.11b
- /
- pp.1413-1419
- /
- 2008
As the improvements of service quality is becoming an important issue the, interior noise level of a train is an important factor of comfortable ride. To reduce the interior noise level, noise sources of the train need to be removed. However, in case with a structure of large scale and multiple noise sources, an estimation of influences of major noise sources, with indentification of its traveling paths needs to be performed. In current state, to improve the interior noise reduction, consideration of sound transmission loss of the train body prior to manufacturing is usually performed. In this study, the sound transmission loss of the train body of new model train of seoul metro's line no.2 under test opeation is measured and modeling of the train body is performed. And train interior noise is predicted using the measured values.
PDF

Sound Signal Analysis Using the Time-Frequency Representations (시주파수 표현법을 이용한 소리신호의 분석)

Iem, Byeong-Gwan
- Journal of IKEEE
- /
- v.23 no.3
- /
- pp.893-898
- /
- 2019
Time-frequency representations are methods to display the magnitude or energy density of a signal on the two dimensional plane of both time and frequency. They are useful in analyzing the characteristics of time-varying signals. Music is a typical time-varying signal, and it can be analyzed by time-frequency representations. Recently, it is popular to change the sound quality by attaching a safety sounder to an instrument. It is performed to improve perception subjectively by spending little cost and modifying sound quality. In time domain, it is difficult to notify the difference between music signals with and without the sounder. But, it is easy to find the difference in frequency domain or in time-frequency domain. In this paper, the music signal from a flute with sounder is analyzed both in the frequency domain and in the time-frequency domain. It is confirmed that the frequency components in the mid-frequency range of 500~2500 are reinforced.
https://doi.org/10.7471/ikeee.2019.23.3.893 인용 PDF KSCI

Search Result 333, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)