• Title/Summary/Keyword: Sound perception

Search Result 269, Processing Time 0.024 seconds

Aurally Relevant Analysis by Synthesis - VIPER a New Approach to Sound Design -

  • Daniel, Peter;Pischedda, Patrice
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2003.05a
    • /
    • pp.1009-1009
    • /
    • 2003
  • VIPER a new tool for the VIsual PERception of sound quality and for sound design will be presented. Requirement for the visualization of sound quality is a signal analysis modeling the information processing of the ear. The first step of the signal processing implemented in VIPER, calculates an auditory spectrogram by a filter bank adapted to the time- and frequency resolution of the human ear. The second step removes redundant information by extracting time- and frequency contours from the auditory spectrogram in analogy to contours of the visual system. In a third step contours and/or auditory spectrogram can be resynthesised confirming that only aurally relevant information were extracted. The visualization of the contours in VIPER allows intuitively to grasp the important components of a signal. Contributions of parts of a signal to the overall quality can be easily auralized by editing and resynthesising the contours or the underlying auditory spectrogram. Resynthesis of time contours alone allows e.g. to auralize impulsive components separately from the tonal components. Further processing of the contours determines tonal parts in form of tracks. Audible differences between two versions of a sound can be visually inspected in VIPER through the help of auditory distance spectrograms. Applications are shown for the sound design of several interior noises of cars.

  • PDF

Evaluation on the Field Application of Spontaneous Acoustic Field Reproduction System (능동형 음장조정시스템의 현장적용 평가)

  • Jeon, Ji-Hyeon;Shin, Yong-Gyu;Kang, Sang-Woo;Min, Byeong-Cheol;Kook, Chan
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2006.11a
    • /
    • pp.616-621
    • /
    • 2006
  • A began of this study is to verify Spontaneous Acoustic Field Reproduction System(SAFRS), developed as an embodiment of creating agreeable sound environment, with evaluation on the field application. SAFRS is a system to sense changes of surroundings and produce sounds, which can go well with environment elements sensed by the system in to the space. The sound which can go well with environment elements is sound which judged by individual evaluation to be so, the classification of the preferred sounds according to the mood of the space was suggested in the former study. So, SAFRS was applied into the Square of D University to evaluate effectiveness of the system. The executed evaluations were 1) evaluation on sounds perception, frequency, volume and matchability with the space, 2) image evaluation on the square and sound environment and 3) evaluation on sound environment with existing sounds, fountains sound, sound produced by SAFRS, and both fountains sound and sound produced by SAFRS. Verifying SAPRS of field application was deduced from those evaluations. Theresultsofthestudyarefollowing: Though the system was applied into the space, the volume of the sounds shouldn't be too high. And with visual surroundings, the effectiveness of the system would be increased. At the results of four evaluations, the result of day evaluation is; both fountains sound and sound produced by SAFRS>fountains sound>sound produced by SAFRS>existing sounds, the result of night evaluation is; sound produced by SAFRS>both fountains sound and sound produced by SAFRS>fountains sound>existing sounds and these results pointed out that sounds environment produced by the system was highly evaluated due to less background sounds.

  • PDF

Sensibility Evaluation of Internet Shoppers with the Sportswear Rustling Sounds (스포츠의류 마찰음 정보 제공에 따른 인터넷 구매자의 감성평가)

  • Baek, Gyeong-Rang;Jo, Gil-Su
    • Proceedings of the Korean Society for Emotion and Sensibility Conference
    • /
    • 2009.05a
    • /
    • pp.177-180
    • /
    • 2009
  • This study investigates the perception of different fabrics by consumers when provided with a video clip with rustling sounds of the fabric. We utilized sportswear products that are currently on the market and evaluated the emotional response of internet shoppers by measuring the physiological and psychological responses. Three kinds of vapor-permeable water-repellent fabric were selected to generate video clips each containing the fabric rustling sound and images of exercise activities wearing the sportswear made of the respective fabric. The new experimental website contained the video clips and was compared with the original website which served as a control. 30 subjects, who had experience to buy clothing online, took part in the physiological and psychological response to the video clip. Electroen-cephalography (EEG) was used to measure the physiological response while the psychological response consisted of evaluating accurate perception of the fabric, satisfaction, and consumer interest. When we offered video clips with fabric's rustling sound on the website, subjects answered they could get more accurate and rapid information to decide to purchase the products than otherwise they do the shopping without such information. However, such rustling sounds somewhat annoy customers, as proved psychological and physiological response. Our study is a critical step in evaluating the consumer's emotional response to sportswear fabric which will promote selling frequency, reduce the return rate and aid development of new sportswear fabric further evolution of the industry.

  • PDF

A Perceptually Motivated Active Noise Control Design and Its Psychoacoustic Analysis

  • Bao, Hua;Panahi, Issa M.S.
    • ETRI Journal
    • /
    • v.35 no.5
    • /
    • pp.859-868
    • /
    • 2013
  • The active noise control (ANC) technique attenuates acoustic noise in a flexible and effective way. Traditional ANC design aims to minimize the residual noise energy, which is indiscriminative in the frequency domain. However, human hearing perception exhibits selective sensitivity for different frequency ranges. In this paper, we aim to improve the noise attenuation performance in perceptual perspective by incorporating noise weighting into ANC design. We also introduce psychoacoustic analysis to evaluate the sound quality of the residual noise by using a predictive pleasantness model, which combines four psychoacoustic parameters: loudness, sharpness, roughness, and tonality. Simulations on synthetic random noise and realistic noise show that our method improves the sound quality and that ITU-R 468 noise weighting even performs better than A-weighting.

A Study on the Perception of Syllable Structure on Korean and Chinese in Korean-Chinese and Chinese Students (조선족과 한족 대학생의 중국어, 한국어의 음절구조 지각에 대한 연구)

  • Yoon, H.K.;Park, H.C.
    • Speech Sciences
    • /
    • v.12 no.2
    • /
    • pp.43-56
    • /
    • 2005
  • This study was analyzed the syllable structure of Korean and Chinese using subjective sound similarity judgement tasks with both Korean-chinese and Chinese students in Shenyang, China. 86 college students were administered with the tasks which lasted about 20 min. in a small group setting. Both Korean-chinese and Chinese students showed the sensitivity for the CV sub-syllabic unit and the CV+C was the building block for phonetic representation for both languages. This syllabic similarity of Korean and Chinese may be a help for Korean-chinese to become horizontal bilinguals. Further studies are needed to specify the mechanism that will explain the syllabic perception of CV+C in both Korean and Chinese which was different from that of C+VC structure dominance in English.

  • PDF

Clinical utility of auditory perceptual assessments in the discrimination of a diplophonic voice (이중음성 판별에 있어 청지각적 평가의 임상적 유용성)

  • Bae, Inho;Kwon, Soonbok
    • Phonetics and Speech Sciences
    • /
    • v.10 no.1
    • /
    • pp.75-81
    • /
    • 2018
  • Diplophonia is generally defined as the perception of more than one fundamental frequency component in a voice. Its perceptual aspect has traditionally been used to evaluate diplophonia because the perceptions can be easily evaluated, but there are limitations in the validity of the reliability of the intra- and inter-raters, examination situation, and variation of voice sample. Therefore, the purpose of this study is to confirm the reliability and accuracy of auditory perceptual evaluation by comparing non-invasive indirect assessment methods (sound waveform and EGG analysis), and to identify their usefulness with diplophonia. A total of 28 diplophonic voices and 39 non-periodic voices were assessed. Three raters assessed the diplophonia by performing an auditory perception evaluation and identifying the quasi-periodic perturbations of the acoustic waveform and EGG. Among the three discrimination methods, intra- and inter-rater reliability, sensitivity, specificity, accuracy, positive likelihood ratio, and negative likelihood ratio were examined, and the McNemar test was performed to compare the discriminant agreement. The accuracy of the auditory perceptual evaluation (86.57%) was not significantly different from that of sound waveform acoustic (88.06%), but it was significantly different from that of EGG (83.33%). The reading time (6.02 s) for the auditory perceptual evaluation was significantly different from that for sound waveform analysis (30.15 s) and EGG analysis (16.41 s). In the discrimination of diplophonia, auditory perceptual evaluation has sufficient reliability and accuracy as compared to sound waveform and EGG. Since immediate feedback is possible, auditory perceptual evaluation is more convenient. Therefore, it can continue to be used as a tool to discriminate diplophonia in clinical practice.

Phonological Status of Korean /w/: Based on the Perception Test

  • Kang, Hyun-Sook
    • Phonetics and Speech Sciences
    • /
    • v.4 no.3
    • /
    • pp.13-23
    • /
    • 2012
  • The sound /w/ has been traditionally regarded as an independent segment in Korean regardless of the phonological contexts in which it occurs. There have been, however, some questions regarding whether it is an independent phoneme in /CwV/ context (cf. Kang 2006). The present pilot study examined how Korean /w/ is realized in $/S^*wV/$ context by performing some perception tests. Our assumption was that if Korean /w/ is a part of the preceding complex consonant like $/C^w/$, it should be more or less uniformly articulated and perceived as such. If /w/ is an independent segment, it will be realized with speaker variability. Experiments I and II examined the identification rates as "labialized" of the spliced original stimuli of $/S^*-V/$ and $/S^{w*}-^wV/$, and the cross-spliced stimuli $/S^{w*}-V/$ and $/S^*-^wV/$. The results showed that round qualities of /w/ are perceived at significantly different temporal point with speaker and context variability. We therefore conclude that /w/ in $/S^*wV/$ context is an independent segment, not a part of the preceding segment. Full-scale examination of the production test in the future should be performed to verify the conclusion we suggested in this paper.

SPATIAL EXPLANATIONS OF SPEECH PERCEPTION: A STUDY OF FRICATIVES

  • Choo, Won;Mark Huckvale
    • Proceedings of the KSPS conference
    • /
    • 1996.10a
    • /
    • pp.399-403
    • /
    • 1996
  • This paper addresses issues of perceptual constancy in speech perception through the use of a spatial metaphor for speech sound identity as opposed to a more conventional characterisation with multiple interacting acoustic cues. This spatial representation leads to a correlation between phonetic, acoustic and auditory analyses of speech sounds which can serve as the basis for a model of speech perception based on the general auditory characteristics of sounds. The correlations between the phonetic, perceptual and auditory spaces of the set of English voiceless fricatives /f $\theta$ s $\int$ h / are investigated. The results show that the perception of fricative segments may be explained in terms of 2-dimensional auditory space in which each segment occupies a region. The dimensions of the space were found to be the frequency of the main spectral peak and the 'peakiness' of spectra. These results support the view that perception of a segment is based on its occupancy of a multi-dimensional parameter space. In this way, final perceptual decisions on segments can be postponed until higher level constraints can also be met.

  • PDF

The Dependence of Frequency Change Perception on the Acoustics of a Listening Environment and Its Implication for the Evaluation of Room Acoustics

  • Jeong, Dae-Up
    • The Journal of the Acoustical Society of Korea
    • /
    • v.18 no.4E
    • /
    • pp.10-19
    • /
    • 1999
  • Psychophysical approaches to the room acoustics have been made in number of researches. The present study is looking at whether the listener's perception of frequency change in rooms are dependent on the acoustics of a listening environment. P(C)s for frequency change in short tones were measured in different listening conditions. Two experiments were carried out to investigate the influence of room acoustics on the listener's perception of frequency change, and its implication for evaluating the acoustics of listening environments were examined and discussed. It was found that the temporal and spectral contents of reflections from room surfaces might be an important factor which influenced the listener's perception of frequency change in a reverberant sound field. This implicates that psychophysical approach by measuring listener's frequency change perception might be an useful tool for evaluating room acoustics. However, cares should be taken, since some individual differences were found to exist with respect to the direction of frequency change.

  • PDF

Real-time 3D Audio Downmixing System based on Sound Rendering for the Immersive Sound of Mobile Virtual Reality Applications

  • Hong, Dukki;Kwon, Hyuck-Joo;Kim, Cheong Ghil;Park, Woo-Chan
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.12
    • /
    • pp.5936-5954
    • /
    • 2018
  • Eight out of the top ten the largest technology companies in the world are involved in some way with the coming mobile VR revolution since Facebook acquired Oculus. This trend has allowed the technology related with mobile VR to achieve remarkable growth in both academic and industry. Therefore, the importance of reproducing the acoustic expression for users to experience more realistic is increasing because auditory cues can enhance the perception of the complicated surrounding environment without the visual system in VR. This paper presents a audio downmixing system for auralization based on hardware, a stage of sound rendering pipelines that can reproduce realiy-like sound but requires high computation costs. The proposed system is verified through an FPGA platform with the special focus on hardware architectural designs for low power and real-time. The results show that the proposed system on an FPGA can downmix maximum 5 sources in real-time rate (52 FPS), with 382 mW low power consumptions. Furthermore, the generated 3D sound with the proposed system was verified with satisfactory results of sound quality via the user evaluation.