• Title/Summary/Keyword: 주관적 음향평가

Search Result 166, Processing Time 0.026 seconds

Diagnosis and Evaluation of Humanities Therapy: The Phonetic Analysis of Speech Rates and Fundamental Frequency According to Preferred Sensation Type (인문치료의 진단 및 평가: 감각유형에 따른 말속도와 기본주파수의 실험음성학적 분석)

  • Lee, Chan-Jong;Heo, Yun-Ju
    • The Journal of the Acoustical Society of Korea
    • /
    • v.30 no.4
    • /
    • pp.231-237
    • /
    • 2011
  • The purpose of this study is to examine the correlation between the preferred sensation type and speech sounds, especially on $F_0$ and the speech rates. Data for the sensation types and speech sounds were collected from 36 undergraduate and graduate students (17 male, 19 female). Subjects were asked to read a given text (400 syllables), describe a drawing, and give answers to some questions. We measured speakers' $F_0$ and speech rates. The results show that type V (Visual) has the correlation with the speech rates when type D (Digital) was ruled out, and type A (Auditory) has the correlation with the speech rates when type D was included. Furthermore, the analysis of the mean values of V, A, K (Visual, Auditory, Kinethetic) indicates that type V is characterized with faster speech rates and higher $F_0$ in all parts except for interview and the same is true for that of V, A, K, D (Visual, Auditory, Kinethetic, Digital) in all parts. In conclusion, this study proved that the preferred sensation type has the correlation with $F_0$ and speech rates. Based on the results of this study, $F_0$ and speech rates can be used to analyze the sensation types for individualized education as well as consultation. In addition, this study has great significance in that it lays a foundation for the study on the correlation between a preferred sensation type and speech sounds.

A Real Time 6 DoF Spatial Audio Rendering System based on MPEG-I AEP (MPEG-I AEP 기반 실시간 6 자유도 공간음향 렌더링 시스템)

  • Kyeongok Kang;Jae-hyoun Yoo;Daeyoung Jang;Yong Ju Lee;Taejin Lee
    • Journal of Broadcast Engineering
    • /
    • v.28 no.2
    • /
    • pp.213-229
    • /
    • 2023
  • In this paper, we introduce a spatial sound rendering system that provides 6DoF spatial sound in real time in response to the movement of a listener located in a virtual environment. This system was implemented using MPEG-I AEP as a development environment for the CfP response of MPEG-I Immersive Audio and consists of an encoder and a renderer including a decoder. The encoder serves to offline encode metadata such as the spatial audio parameters of the virtual space scene included in EIF and the directivity information of the sound source provided in the SOFA file and deliver them to the bitstream. The renderer receives the transmitted bitstream and performs 6DoF spatial sound rendering in real time according to the position of the listener. The main spatial sound processing technologies applied to the rendering system include sound source effect and obstacle effect, and other ones for the system processing include Doppler effect, sound field effect and etc. The results of self-subjective evaluation of the developed system are introduced.

Experimental Study on Subjective Sound Quality Evaluation of Vehicle Noises (승용차소음의 주관적 음질평가 실험연구)

  • Choe, Byongho
    • Transactions of the Korean Society for Noise and Vibration Engineering
    • /
    • v.14 no.12
    • /
    • pp.1223-1232
    • /
    • 2004
  • This study is directed toward determining the number and characteristics of psychologically meaningful perceptual dimensions required for assessing the sound quality with respect to vehicle noises, and toward identifying the acoustical and/or psychoacoustical bases underlying the preference and similarity judgments. For the purpose of analyzing the paired comparison data produced by subjective ratings we used nonmetric multidimensional scaling(MDS). The perceptual dimensions based upon preference ratings could explain 76.3 % of the variance by maximum dB(A) and sharpness acum. The correlation between objective and subjective positions of the stimuli is $R^2$=0.97(F(1,13)=195.45, p < .01), corrected $R^2$=0.93. The less the intensity of the stimulus the more becomes the subjective Position would be over-estimated relative to the objective one. The same is valid for the opposite case. The perceptual dimensions based upon similarity judgments could be accounted for 47.8 % and 23.5% of the variance, each of which might be a match for the maximum dB(A) and the sharpness acum, respectively. The correlation between objective and subjective positions of the stimuli is $R^2$=0.94(F(1,13)=92.38, p < .01), corrected $R^2$=0.87. The more the intensity of the stimulus the more becomes the subjective position would be over-estimated relative to the objective one. The same is valid for the opposite case. In other words, it is likely that the larger the amount of two stimuli which to compare would be judged similar. So far it should be further clarified that whether the relationship between preference ratings and psychological distances nay be optimized through which psycho-physical models.

Variation of heart rate during listening to music (음악 청취 시 정서적 특성에 따른 심박수 변화)

  • Jiyun Han;Soojin Kang;Junghwan Moon;Kyung Myun Lee;Jihwan Woo
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.5
    • /
    • pp.536-546
    • /
    • 2024
  • Music has a close connection with human emotions, and this relationship has been explored in various fields. Recently, research has been attempted to quantify these subjective emotions based on biosignals such as brain signals. However, emotional changes when listening to music, as measured by heart rate, which can be easily measured in daily life, are not sufficiently known. In this study, we investigated how changing emotions are expressed through variations in heart rate during music listening. The electrocardiogram (ECG) was measured while participants listened to music, and the emotional characteristics of preference, familiarity, arousal, and valence after listening were evaluated using Likert scale scores to analyze the correlation between changes in heart rate and emotional characteristics. The study confirmed that smaller changes in heart rate were associated with lower preference, higher arousal, and more negative emotional valence, while larger heart rate differences were associated with higher preference, lower arousal, and more positive emotional valence. This study demonstrates that heart rate can be used to objectively predict emotional changes due to music listening, and it is expected to have applications in various music-related industries in the future.

Performance analysis of subjective Loudness meter with ITU-R BS. 1387-1 algorithm for digital audio (디지털 오디오 주관적 음향레벨 계측기 구현을 위한 ITU-R BS. 1387-1의 알고리즘 특성 분석)

  • Ngan, Nguyen Vo Bao;Park, Seonggyoon;Ro, Soonghwan;Han, Chankyu
    • Journal of IKEEE
    • /
    • v.16 no.4
    • /
    • pp.395-404
    • /
    • 2012
  • In this paper, the perceived loudness metering algorithm based on ITU-R BS.1387-1 was investigated and implemented, and its performance was evaluated by applying to 23 pure tones and 9 digital audio samples. Error of the tone test results compared with ISO226:2003 was below 5%, and sample test results, in comparison with Moore's algorithm, showed deviation of less than 4.7% and correlation of 0.96. On the other hand, it was investigated how the implemented algorithm's performance was subject to auditory pitch scale. Its result showed that the algorithm with 37 auditory filters, through correcting a bias effect, has a good performance of less than 2% in comparison with the one with 109 auditory filters.

Corpus-based Korean Text-to-speech Conversion System (콜퍼스에 기반한 한국어 문장/음성변환 시스템)

  • Kim, Sang-hun; Park, Jun;Lee, Young-jik
    • The Journal of the Acoustical Society of Korea
    • /
    • v.20 no.3
    • /
    • pp.24-33
    • /
    • 2001
  • this paper describes a baseline for an implementation of a corpus-based Korean TTS system. The conventional TTS systems using small-sized speech still generate machine-like synthetic speech. To overcome this problem we introduce the corpus-based TTS system which enables to generate natural synthetic speech without prosodic modifications. The corpus should be composed of a natural prosody of source speech and multiple instances of synthesis units. To make a phone level synthesis unit, we train a speech recognizer with the target speech, and then perform an automatic phoneme segmentation. We also detect the fine pitch period using Laryngo graph signals, which is used for prosodic feature extraction. For break strength allocation, 4 levels of break indices are decided as pause length and also attached to phones to reflect prosodic variations in phrase boundaries. To predict the break strength on texts, we utilize the statistical information of POS (Part-of-Speech) sequences. The best triphone sequences are selected by Viterbi search considering the minimization of accumulative Euclidean distance of concatenating distortion. To get high quality synthesis speech applicable to commercial purpose, we introduce a domain specific database. By adding domain specific database to general domain database, we can greatly improve the quality of synthetic speech on specific domain. From the subjective evaluation, the new Korean corpus-based TTS system shows better naturalness than the conventional demisyllable-based one.

  • PDF

A Study on the Performance Analysis and Improvement of the Book Sharing Regular Delivery Project for Pre-schoolers in Gyeonggi Province (경기도 유아 책꾸러미 정기배송 사업에 대한 성과 분석 및 개선방안 연구)

  • Gum-Sook Hoang;Soo-Kyoung Kim;Sung-Une Yoon
    • Journal of Korean Library and Information Science Society
    • /
    • v.53 no.4
    • /
    • pp.71-100
    • /
    • 2022
  • The purpose of this study is to analyze the performance of the Gyeonggi province book sharing regular delivery project for pre-schoolers in 2021, analyze the problems of the project, and suggest improvement measures to help the preparation of policies for the promotion of reading culture in Gyeonggi province. As a result of the survey, the necessity and satisfaction of the project were found to be very high for both the caregivers and the reading coaches. In particular, in the case of caregivers, the higher the amount of reading after the project, the higher the need for and satisfaction with the project. It was found that they wanted to participate in programs such as regular book delivery. Even in the convergence of expert opinions, it was evaluated that there was a positive change in the perception of parenting by reading by caregivers, and that the necessity and satisfaction of this project were very high due to the development of infant-parent interaction. However, in the self-evaluation of the project manager and the implementing organization, it was evaluated that better results could be achieved only when the project was carried out through thorough planning and preparation in advance. Through this, for the continuity of Gyeonggi province regular delivery of children's book packages, improvement plans were presented and Gyeonggi-do's reading culture promotion policy was proposed.

Audio Contents Adaptation Technology According to User′s Preference on Sound Fields (사용자의 음장선호도에 따른 오디오 콘텐츠 적응 기술)

  • 강경옥;홍재근;서정일
    • The Journal of the Acoustical Society of Korea
    • /
    • v.23 no.6
    • /
    • pp.437-445
    • /
    • 2004
  • In this paper. we describe a novel method for transforming audio contents according to user's preference on sound field. Sound field effect technologies. which transform or simulate acoustic environments as user's preference, are very important for enlarging the reality of acoustic scene. However huge amount of computational power is required to process sound field effect in real time. so it is hard to implement this functionality at the portable audio devices such as MP3 player. In this paper, we propose an efficient method for providing sound field effect to audio contents independent of terminal's computational power through processing this functionality at the server using user's sound field preference, which is transfered from terminal side. To describe sound field preference, user can use perceptual acoustic parameters as well as the URI address of room impulse response signal. In addition, a novel fast convolution method is presented to implement a sound field effect engine as a result of convoluting with a room impulse response signal at the realtime application. and verified to be applicable to real-time applications through experiments. To verify the evidence of benefit of proposed method we performed two subjective listening tests about sound field descrimitive ability and preference on sound field processed sounds. The results showed that the proposed sound field preference can be applicable to the public.

Preferred masking levels of water sounds according to various noise background levels in small scale open plan offices (소규모 개방형 사무실 배경 소음 레벨에 따른 최적 물소리 마스킹 레벨)

  • Tae-Hui Kim;Sang-Hyeon Lee;Chae-Hyun Yoon;Hyo-Won Sim;Joo-Young Hong
    • The Journal of the Acoustical Society of Korea
    • /
    • v.42 no.6
    • /
    • pp.617-626
    • /
    • 2023
  • This study aims to investigate the preferred sound level of water sound for various levels of open-plan-office noise regarding soundscape quality and speech privacy. And assessment of the work efficiency of the water sound. For the laboratory experiment, office noise was recorded using a binaural microphone in a real open-plan office. For the assessment of the soundscape quality and speech privacy, Overall Soundscape Quality (OSQ) and Listening Difficulty (LD) were evaluated under three different sound levels (55 dBA, 60 dBA, and 65 dBA) and five different signal-to-noise ratios (SNR -10 dB, -5 dB, 0 dB, +5 dB, and +10 dB). After the evaluation, the preferred SNR was proposed according to OSQ and LD. For the assessment of to work efficiency of water sound, this study evaluated the cognitive performance of both of the condition noise only and combine the water sound with office noise. The results showed that LD increased as the water sound level increased, but OSQ decreased. When the water sound level was more than the office noise level, the OSQ decreased from noise only. Therefore, considering OSQ and LD, the preferred SNR of water sound was -5 dB for all noise levels. At the preferred level of water sound, the cognitive performance results were shown to decrease at 55 dBA compared to noise only, but at 60 dBA and 65 dBA combine the water sound results were increased than the noise only.

Audio Quality Enhancement at a Low-bit Rate Perceptual Audio Coding (저비트율로 압축된 오디오의 음질 개선 방법)

  • 서정일;서진수;홍진우;강경옥
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.6
    • /
    • pp.566-575
    • /
    • 2002
  • Low-titrate audio coding enables a number of Internet and mobile multimedia streaming service more efficiently. For the help of next-generation mobile telephone technologies and digital audio/video compression algorithm, we can enjoy the real-time multimedia contents on our mobile devices (cellular phone, PDA notebook, etc). But the limited available bandwidth of mobile communication network prohibits transmitting high-qualify AV contents. In addition, most bandwidth is assigned to transmit video contents. In this paper, we design a novel and simple method for reproducing high frequency components. The spectrum of high frequency components, which are lost by down-sampling, are modeled by the energy rate with low frequency band in Bark scale, and these values are multiplexed with conventional coded bitstream. At the decoder side, the high frequency components are reconstructed by duplicating with low frequency band spectrum at a rate of decoded energy rates. As a result of segmental SNR and MOS test, we convinced that our proposed method enhances the subjective sound quality only 10%∼20% additional bits. In addition, this proposed method can apply all kinds of frequency domain audio compression algorithms, such as MPEG-1/2, AAC, AC-3, and etc.