Search | Korea Science

An Analysis of Acoustic Features Caused by Articulatory Changes for Korean Distant-Talking Speech

Kim Sunhee;Park Soyoung;Yoo Chang D.
- The Journal of the Acoustical Society of Korea
- /
- v.24 no.2E
- /
- pp.71-76
- /
- 2005
Compared to normal speech, distant-talking speech is characterized by the acoustic effect due to interfering sound and echoes as well as articulatory changes resulting from the speaker's effort to be more intelligible. In this paper, the acoustic features for distant-talking speech due to the articulatory changes will be analyzed and compared with those of the Lombard effect. In order to examine the effect of different distances and articulatory changes, speech recognition experiments were conducted for normal speech as well as distant-talking speech at different distances using HTK. The speech data used in this study consist of 4500 distant-talking utterances and 4500 normal utterances of 90 speakers (56 males and 34 females). Acoustic features selected for the analysis were duration, formants (F1 and F2), fundamental frequency, total energy and energy distribution. The results show that the acoustic-phonetic features for distant-talking speech correspond mostly to those of Lombard speech, in that the main resulting acoustic changes between normal and distant-talking speech are the increase in vowel duration, the shift in first and second formant, the increase in fundamental frequency, the increase in total energy and the shift in energy from low frequency band to middle or high bands.
PDF KSCI

Sound Synthesis of Gayageum by Impulse Responses of Body and Anjok (안족과 몸통의 임펄스 응답을 이용한 가야금 사운드 합성)

Cho Sang-Jin;Choi Gin-Kyu;Chong Ui-Pil
- Journal of the Institute of Convergence Signal Processing
- /
- v.7 no.3
- /
- pp.102-107
- /
- 2006
In this paper, we propose a method of a sound synthesis of Korean plucked string instrument, gayageum, by physical modeling which use impulse responses of body and Anjok. Gayageum consists of three kinds of systems: string, body, and Anjok. These are a serial combination of linear time invariant systems. String can be modeled by digital delay line. Body and Anjok can be estimated by their impulse responses. We found three resonance frequencies in the body impulse response, and implemented resonator as body. Anjok was implemented as high pass filter in fundamental frequency band of gayageum. RMSEs of synthesized sounds are distributed from 0.01 to 0.03. It was difficult to distinguish the resulting synthesized sounds from the originals sound by ear.
PDF

합성음성 경보의 주관적 위급도에 관한 연구

박경수;장필식
- Proceedings of the ESK Conference
- /
- 1996.10a
- /
- pp.191-196
- /
- 1996
This paper presents an experimental study of the relationship between sound parameters of synthesized voice warning and perceived(psychoacoustic) urgency. Eighteen subjects participated in two experimental sessions to evaluate and quqntify the effects of the voice parameters. Experiments showed that speech rate, fundamental frequency and voice types have clear and consistent effect on perceived urgency. The results of these experiments can be applied to the improvement of existing auditory warning systems and the design of new systems.
PDF

A Basic Study on the System of Converting Color Image into Sound (컬러이미지-소리 변환 시스템에 관한 기초연구)

Kim, Sung-Ill;Jung, Jin-Seung
- Journal of the Korean Institute of Intelligent Systems
- /
- v.20 no.2
- /
- pp.251-256
- /
- 2010
This paper aims for developing the intelligent robot emulating human synesthetic skills which associate a color image with sound, so that we are able to build an application system based on the principle of mutual conversion between color image and sound. As the first step, in this study, we have tried to realize a basic system using the color image to sound conversion. This study describes a new conversion method to convert color image into sound, based on the likelihood in the physical frequency information between light and sound. In addition, we present the method of converting color image into sound using color model conversion as well as histograms in the converted color model. In the basis of the method proposed in this study, we built a basic system using Microsoft Visual C++(ver. 6.0). The simulation results revealed that the hue, saturation and intensity elements of a input color image were converted into F0, harmonic and octave elements of a sound, respectively. The converted sound elements were synthesized to generate a sound source with WAV file format using Csound toolkit.
https://doi.org/10.5391/JKIIS.2010.20.2.251 인용 PDF KSCI

Acoustic Characteristics of Watermelon for Internal Quality Evaluation (내부품질 판정을 위한 수박의 음파특성)

최동수;최규홍;이강진;이영희;김만수
- Journal of Biosystems Engineering
- /
- v.27 no.1
- /
- pp.59-66
- /
- 2002
The objectives of the study were to analyze the acoustic characteristics related to the internal quality factors of watermelon(Citrulus Vulgaris Schrad). Among the various internal quality factors, only four factors such as ripeness, inside cavity, yellow belt and blood flesh were considered in this study. Relationships between the internal quality factors, the day after fruit set and the day after harvest were also investigated. Test apparatus was the same as the apparatus described in the previous study(Choi et at., 2000). The selected sample was divided into four groups; 69 samples used for ripeness tests 56 samples for ripeness test along the day after fruit set and for yellow belt detection, 60 samples for ripeness along the day after harvest 44 samples fur blood flesh detection. It was shown that the first peak frequencies shifted to the lower range and the energy ratios of the bandwidths between 0∼550 Hz to the bandwidths between 850∼2500 Hz increased as the day after fruit set elapsed. Since the acoustic responses of the watermelon such as frequency and magnitude began to change from 10 days after harvest, the storage period of watermelon in a normal temperature condition seemed to be approximately 10 days after harvest. The ratios of the first peak amplitude to the maximum peak amplitude fur the sound watermelon showed the higher value than that fur watermelon with cavity inside, and the separation between the sound and cavity inside could be accomplished by the ratio value of 0.25. The energy ratios (0∼550 Hz/850∼2,500 Hz) for the watermelon with cavity inside showed the higher value than 2.3. The frequency characteristics of the yellow belt watermelon appeared mostly in the range of 600∼900 Hz frequencies. The yellow belt watermelon showing the energy spectral density function at this frequency range to be over 70 seemed to be not a marketable commodity, The energy ratios(0∼550 Hz/850∼2,500 Hz) for the blood flesh watermelon showed the higher value than 3.5.
https://doi.org/10.5307/JBE.2002.27.1.059 인용 PDF KSCI

A Study on Acoustic Radiation Optimization of Vibrating Panel Using Genetic Algorithm (유전자 알고리즘을 이용한 판넬구조물의 구조음향 최적화에 관한 연구)

Jeon, Jin-Young
- Journal of Advanced Marine Engineering and Technology
- /
- v.33 no.1
- /
- pp.19-27
- /
- 2009
Globally, customer appreciation and demand for quieter products has driven noise control engineers to develop efficient and quieter products in a relatively short time. In the vehicles and ship industry, noise has become an important attribute because of the competitive market and increasing customer awareness. Noise reduction is often achieved through structural modifications by typical approaches. In the present paper, author describes a fundamental study on optimum design of curvature. Bezier curve. and rib attachment to reduce noise from simple panel using a genetic algorithm(GA). The acoustic optimization procedure employed p-FEM for structural analysis, the Rayleigh integral method for acoustic analysis and the GA for searching optimum design. In the optimization procedure. the objective function to be minimized is the average sound power radiated from an objective structure over a given frequency range $0{\sim}300$ Hz.
https://doi.org/10.5916/jkosme.2009.33.1.19 인용 PDF KSCI

The Study of the improvement of the sound quality using the target profile of combustion pressure (목적 연소압 형상을 이용한 음질 개선에 관한 연구)

Hwang, C.K.;Min, B.D.;Kim, I.S.
- Proceedings of the Korean Society for Noise and Vibration Engineering Conference
- /
- 2006.11a
- /
- pp.649-653
- /
- 2006
Engine Noise is composed of the mechanical and combustion noise. The contribution of combustion noise is generally bigger than the contribution of the mechanical noise at idle condition in DI diesel engine. That noise usually makes a roughness problem at the fundamental engine order. It is difficult to remove the modulation frequency so we have to directly reduce the combustion noise. The key effect of combustion noise reducing solution is the modification of the combustion pressure profile. It is accomplished by the multiple injection method and we solved the 400Hz combustion noise and improved the sound quality at idle condition in DI diesel engine.
PDF

The acoustic cue-weighting and the L2 production-perception link: A case of English-speaking adults' learning of Korean stops

Kong, Eun Jong;Kang, Soyoung;Seo, Misun
- Phonetics and Speech Sciences
- /
- v.14 no.3
- /
- pp.1-9
- /
- 2022
The current study examined English-speaking adult learners' production and perception of L2 Korean stops (/t/ or /t'/ or /t^h/) to investigate whether the two modalities are linked in utilizing voice onset time (VOT) and fundamental frequency (F0) for the L2 sound distinction and how the learners' L2 proficiency mediates the relationship. Twenty-two English-speaking learners of Korean living in Seoul participated in the word-reading task of producing stop-initial words and the identification task of labelling CV stimuli synthesized to vary VOT and F0. Using logistic mixed-effects regression models, we quantified group- and individual-level weights of the VOT and F0 cues in differentiating the tense-lax, lax-aspirated, and tense-aspirated stops in Korean. The results showed that the learners as a group relied on VOT more than F0 both in production and perception (except the tense-lax pair), reflecting the dominant role of VOT in their L1 stop distinction. Individual-level analyses further revealed that the learners' L2 proficiency was related to their use of F0 in L2 production and their use of VOT in L2 perception. With this effect of L2 proficiency controlled in the partial correlation tests, we found a significant correlation between production and perception in using VOT and F0 for the lax-aspirated stop contrast. However, the same correlation was absent for the other stop pairs. We discuss a contrast-specific role of acoustic cues to address the non-uniform patterns of the production-perception link in the L2 sound learning context.
https://doi.org/10.13064/KSSS.2022.14.3.001 인용 PDF KSCI

A Study on the Small Size Loudspeaker for Hi-Fi Low Frequency Sound Reproduction (저음재생용 소형 스피커의 개발에 관한 연구)

남경준;이채봉;김천덕
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.8
- /
- pp.31-37
- /
- 2001
Following the recent trends of reducing the size of multimedia devices, we tried for the development of a compact-sized speaker to produce low-frequency sounds efficiently. For this work, equivalent-circuit analysis was used to get fundamental resonant frequency and then the structure of speaker components has been changed appropriately. As a result, an 80mm small-sized speaker was developed. The performance test showed that the resonant frequency of our system is 79 Hz while that of numerical analysis was 81Hz. At a distance of 1m from our speaker, the frequency ranges 80 Hz to 15kHz and the average sound pressure was found to be 84±2 dB. The second (at 400 Hz) and the third (at 100 Hz) high-frequency distortions of our system were 0.5% and 1.8% respectively, which is to be compared with the distortions of 0.9% and 6% in conventional speakers.
PDF

Aerodynamic Characteristics, Vocal Efficiency, and Closed Quotient Differences according to Fundamental Frequency Fixation (음도 고정 유무에 따른 공기역학, 음성효율성 및 성대접촉률 차이)

Kim, Jaeock
- Phonetics and Speech Sciences
- /
- v.5 no.1
- /
- pp.19-26
- /
- 2013
The aerodynamic characteristics (subglottal pressure (Ps) and mean airflow rate (MFR)), fundamental frequency (Fo), intensity (I), vocal efficiency (VE), and closed quotient (CQ) were compared during a sustained vowel /o/ sound under three conditions: in a comfortable loudness and pitch level (condition 1), in a maximum loudness level with a fixed pitch (condition 2), and in a maximum loudness level without a fixed pitch (condition 3). Also, multiple regression analyses were done to measure the aerodynamic characteristics affect on the VE and the CQ in each condition. The results showed the Fo, Ps, MFR, VE, and CQ increased as I increased with and without fixed pitch. Most notably, VE in condition 3 was the highest of all the conditions, but CQ was not very high. By the results of multiple regression analysis, VE was significantly affected by I and Ps in all conditions; Fo was the other main key for affecting VE in high pitch. However, none of the aerodynamic characteristics significantly affected CQ. As I increases, Fo should be increased by increasing Ps and VE. Therefore, researchers should consider and specify an a priori to Fo, Ps, and I when measuring VE to examine the complex and delicate vocal mechanism.
https://doi.org/10.13064/KSSS.2013.5.1.019 인용 PDF

Search Result 101, Processing Time 0.023 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)