• Title/Summary/Keyword: auditory model

Search Result 159, Processing Time 0.026 seconds

CNN-based Visual/Auditory Feature Fusion Method with Frame Selection for Classifying Video Events

  • Choe, Giseok;Lee, Seungbin;Nang, Jongho
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.3
    • /
    • pp.1689-1701
    • /
    • 2019
  • In recent years, personal videos have been shared online due to the popular uses of portable devices, such as smartphones and action cameras. A recent report predicted that 80% of the Internet traffic will be video content by the year 2021. Several studies have been conducted on the detection of main video events to manage a large scale of videos. These studies show fairly good performance in certain genres. However, the methods used in previous studies have difficulty in detecting events of personal video. This is because the characteristics and genres of personal videos vary widely. In a research, we found that adding a dataset with the right perspective in the study improved performance. It has also been shown that performance improves depending on how you extract keyframes from the video. we selected frame segments that can represent video considering the characteristics of this personal video. In each frame segment, object, location, food and audio features were extracted, and representative vectors were generated through a CNN-based recurrent model and a fusion module. The proposed method showed mAP 78.4% performance through experiments using LSVC data.

Listening environment design of houses for the seniors aging at home (고령자의 청력손실을 배려한 재가보호 주거시설의 음향설계)

  • Yu-Kyeong Jang;Yang-ki Oh
    • The Journal of the Acoustical Society of Korea
    • /
    • v.43 no.2
    • /
    • pp.152-161
    • /
    • 2024
  • Although hearing loss in the elderly is one of the common symptoms of aging, as the aging population continues to grow, policies such as home care and welfare housing for the elderly are implemented with a focus on mobility, often overlooking the issue of hearing loss in the elderly. In this study, our aim is to enhance the quality of life for the elderly by improving the auditory environment within residential spaces, which plays a pivotal role in determining their overall well-being. We have proposed a technique that focuses on reducing reverberation, minimizing noise levels, and enhancing sound quality to improve the listening environment for the elderly, and we have verified its effectiveness. Building upon this, we have developed an acoustic design model for residential facilities catering to elderly home care.

Correlations of Exogenous and Endogenous Components of Auditory ERPs to Psychometric Measures of Personality (청각 EPR의 내외생적 요소들과 성격의 상관에 관한 연구)

  • Park, Chang-Bum;Lee, Ji-Young;Chi, Sang-Eun;Park, Eun-Hye;Lee, Young-Hyuk;Kim, Hyun-Teak
    • Science of Emotion and Sensibility
    • /
    • v.5 no.4
    • /
    • pp.59-66
    • /
    • 2002
  • This study was proposed as an exploratory study for understanding the biological bases and structures of three personality models: Eysenck's PEN model, Gray's BIS/BAS model, and Costa & McCrae's Five Factor Model, which was chosen as the major descriptive model regardless of its biological bases. Besides, Eysenck's impulsivity scale, IVE, was added to demonstrate the relationship of P and impulsivity. Concerning personality, most previous reports have explored the relationships between P300 and the introversion-extraversion of Eysenck's theory because of its putative biological bases. In the present study, forty-eight undergraduate took four personality batteries (ERQ-R, NEO-Pl-R, BIS/BAS, and IVE). Two types of oddball tasks including different stimulus duration were used to induce ERPs (50ms for task 1, 300ms for task 2). Distributional topographies of correlation coefficients with personality traits and ERP components were drawn, and considered for the consistent interpretation of the personality model structures. Even though all equivalences for extraversion of personality batteries showed similarities for their intra-correlation, their correlations with P3 amplitudes were dissociate. Eysenck's E might not be the proper psychometric measure for elucidating its biological bases. The present study supported the negative relationship of P3 amplitude and extraversion, which is the consensus of previous studies. Neuroticism and Psychoticism showed correlations with the earlier sensory processing components such as N1 and P2. This result might explain the reason why most of studies have failed to find biological connections relating them. Interaction between gender and personality traits should be considered for the interpretation of correlations. Two types of auditory stimulus duration had different sensitivity to personality traits.

  • PDF

Multisensory based AR System for Education of Cultural Heritage

  • Jeong, Eunsol;Oh, Jeong-eun;Won, Haeyeon;Yu, Jeongmin
    • Journal of the Korea Society of Computer and Information
    • /
    • v.24 no.11
    • /
    • pp.61-69
    • /
    • 2019
  • In this paper, we propose a multisensory(i.e., visual-auditory-tactile) based AR system for the education of cultural heritage. The proposed system provides a multisensory interaction by designing a user to experience with a 3D printed artifact which is mapped by a virtual 3D content of digital heritage. Compared with the existing systems of cultural heritage education based on augmented reality(AR) technology, this system focused on not only providing learning experience via a sense of visual and auditory, but also a sense of tactile. Furthermore, since this systems mainly provided the direct interactions using a 3D printed model, it gives a higher degree of realism than existing system that use touch or click motions on a 2D display of mobile phones and tablets. According to a result of user testing, we concluded that the proposed system delivered the excellent presence and learning flow to users. Particularly, from the usability evaluation, a 3D printed target artifact which is similar in shape to original heritage artifact, achieved the highest scores among the various tested targets.

Blood-Brain Barrier Disruption in Mild Traumatic Brain Injury Patients with Post-Concussion Syndrome: Evaluation with Region-Based Quantification of Dynamic Contrast-Enhanced MR Imaging Parameters Using Automatic Whole-Brain Segmentation

  • Heera Yoen;Roh-Eul Yoo;Seung Hong Choi;Eunkyung Kim;Byung-Mo Oh;Dongjin Yang;Inpyeong Hwang;Koung Mi Kang;Tae Jin Yun;Ji-hoon Kim;Chul-Ho Sohn
    • Korean Journal of Radiology
    • /
    • v.22 no.1
    • /
    • pp.118-130
    • /
    • 2021
  • Objective: This study aimed to investigate the blood-brain barrier (BBB) disruption in mild traumatic brain injury (mTBI) patients with post-concussion syndrome (PCS) using dynamic contrast-enhanced (DCE) magnetic resonance (MR) imaging and automatic whole brain segmentation. Materials and Methods: Forty-two consecutive mTBI patients with PCS who had undergone post-traumatic MR imaging, including DCE MR imaging, between October 2016 and April 2018, and 29 controls with DCE MR imaging were included in this retrospective study. After performing three-dimensional T1-based brain segmentation with FreeSurfer software (Laboratory for Computational Neuroimaging), the mean Ktrans and vp from DCE MR imaging (derived using the Patlak model and extended Tofts and Kermode model) were analyzed in the bilateral cerebral/cerebellar cortex, bilateral cerebral/cerebellar white matter (WM), and brainstem. Ktrans values of the mTBI patients and controls were calculated using both models to identify the model that better reflected the increased permeability owing to mTBI (tendency toward higher Ktrans values in mTBI patients than in controls). The Mann-Whitney U test and Spearman rank correlation test were performed to compare the mean Ktrans and vp between the two groups and correlate Ktrans and vp with neuropsychological tests for mTBI patients. Results: Increased permeability owing to mTBI was observed in the Patlak model but not in the extended Tofts and Kermode model. In the Patlak model, the mean Ktrans in the bilateral cerebral cortex was significantly higher in mTBI patients than in controls (p = 0.042). The mean vp values in the bilateral cerebellar WM and brainstem were significantly lower in mTBI patients than in controls (p = 0.009 and p = 0.011, respectively). The mean Ktrans of the bilateral cerebral cortex was significantly higher in patients with atypical performance in the auditory continuous performance test (commission errors) than in average or good performers (p = 0.041). Conclusion: BBB disruption, as reflected by the increased Ktrans and decreased vp values from the Patlak model, was observed throughout the bilateral cerebral cortex, bilateral cerebellar WM, and brainstem in mTBI patients with PCS.

Evaluation of In-vehicle Warning Information Modalities by Kansei Engineering (감성공학을 이용한 차내 경고정보 제공방식 평가)

  • Park, Jun-Yeong;O, Cheol;Kim, Myeong-Ju;Jang, Myeong-Sun
    • Journal of Korean Society of Transportation
    • /
    • v.28 no.3
    • /
    • pp.39-49
    • /
    • 2010
  • Provision of in-vehicle warning information is of keen interest since it can be effectively used to prevent traffic accident on the road. This study evaluates the effectiveness of information provision modalities based on kansei engineering. Various warning information scenarios using different modalities are devised for the evaluation. Statistical data analysis techniques including factor analysis, correlation analysis, and the general linear model are used to assess the user's affect for information modalities. The evaluation result shows that the provision of visual information consisted of 'text and pictogram' leads to higher understandability. The combination of beep sound and voice message' was identified as a more effective modality for auditory warning. In addition, the red color for the blinking warning signal was preferred by users.

Genetical and Pathological Studies on the Mutant Mice as an Animal Model for Deafness Disease

  • Lee, Jeong-Woong;Lee, Eun-Ju;Lee, Hoon-Taek;Chung, Kil-Saeng;Ryoo, Zae-Young
    • Proceedings of the KSAR Conference
    • /
    • 2001.03a
    • /
    • pp.48-48
    • /
    • 2001
  • A new neurological mutant has been found in the ICR outbred strain mouse. Affected mice display profound deafness and a head-tossing and bidirectional circling behavior, showing an autosomal recessive mode of inheritance. It was, therefore, named cir/Kr with the gene symbol cir. The auditory tests identified clearly the hearing loss of the cir mice when compared to wild type mice. Pathological studies confirmed the developmental defects in the middle ear, cochlea, cochlear nerve, and semicircular canal areas, which were correlated to the abnormal behavior observed in the cir mice. Thus, cir mice may be useful as a model for studying inner ear abnormalities and deafness. We have constructed a genetic linkage map by positioning 14 microsatellite markers across the (cir) region and intraspecific backcross between cir and C57BL/6J mice. The cir mouse harbors an autosomal recessive mutation on mouse chromosome 9. The cir gene was mapped to a region between D9Mit116 and D9Mit38 Estimated distances between cir and D9Mit116, and between cir and D9Mit38 are 0.7 and 0.2 cM, respectively. The gene in order was defines : centromere-D9Mit182-D9Mit51/D9Mit79/D9Mit310-D9Mit212/D9Mit184-D9Mit116-cir-D9Mit38-D9Mit20-D9Mit243-D9Mit16-D9Mit55/D9Mit125-D9Mit281. The mouse map location of the cir locus appears to be in a region homologous to human 3q21. Our present date suggest that the nearest flanking marker D9Mit38 provides a useful anchor for the isolation of the cir gene in a yeast artificial chromosome contig.

  • PDF

A Novel Modeling Method for Manufacturing Hearing Aid Using 3D Medical Images (3차원 의료영상을 이용한 보청기 제작의 새로운 모델링 방법)

  • Kim, Hyeong-Gyun
    • Journal of radiological science and technology
    • /
    • v.39 no.2
    • /
    • pp.257-262
    • /
    • 2016
  • This study aimed to suggest a novel method of modeling a hearing aid ear shell based on Digital Imaging and Communication in Medicine (DICOM) in the hearing aid ear shell manufacturing method using a 3D printer. In the experiment, a 3D external auditory meatus was extracted by using the critical values in the DICOM volume images, and the modeling surface structures were compared in standard type STL (STereoLithography) files which could be recognized by a 3D printer. In this 3D modeling method, a conventional ear model was prepared, and the gaps between adjacent isograms produced by a 3D scanner were filled with 3D surface fragments to express the modeling structure. In this study, the same type of triangular surface structures were prepared by using the DICOM images. The result showed that the modeling surface structure based on the DICOM images provide the same environment that the conventional 3D printers may recognize, eventually enabling to print out the hearing aid ear shell shape.

Speech Enhancement System Using a Model of Auditory Mechanism (청각기강의 모델을 이용한 음성강조 시스템)

  • 최재승
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.6
    • /
    • pp.295-302
    • /
    • 2004
  • On the field of speech processing the treatment of noise is still important problems for speech research. Especially, it has been noticed that the background noise causes remarkable reduction of speech recognition ratio. As the examples of the background noise, there are such various non-stationary noises existing in the real environment as driving noise of automobiles on the road or typing noise of printer. The treatment for these kinds of noises is not so simple as could be eliminated by the former Wiener filter, but needs more skillful techniques. In this paper as one of these trials, we show an algorithm which is a speech enhancement method using a model of mutual inhibition for noise reduction in speech which is contaminated by white noise or background noise mentioned above. It is confirmed that the proposed algorithm is effective for the speech degraded not only by white noise but also by colored noise, judging from the spectral distortion measurement.

Verification of Automatic PAR Control System using DEVS Formalism (DEVS 형식론을 이용한 공항 PAR 관제 시스템 자동화 방안 검증)

  • Sung, Chang-ho;Koo, Jung;Kim, Tag-Gon;Kim, Ki-Hyung
    • Journal of the Korea Society for Simulation
    • /
    • v.21 no.3
    • /
    • pp.1-9
    • /
    • 2012
  • This paper proposes automatic precision approach radar (PAR) control system using digital signal to increase the safety of aircraft, and discrete event systems specification (DEVS) methodology is utilized to verify the proposed system. Traditionally, a landing aircraft is controlled by the human voice of a final approach controller. However, the voice information can be missed during transmission, and pilots may also act improperly because of incorrectness of auditory signals. The proposed system enables the stable operation of the aircraft, regardless of the pilot's capability. Communicating DEVS (C-DEVS) is used to analyze and verify the behavior of the proposed system. A composed C-DEVS atomic model has overall composed discrete state sets of models, and the state sequence acquired through full state search is utilized to verify the safeness and the liveness of a system behavior. The C-DEVS model of the proposed system shows the same behavior with the traditional PAR control system.