• Title/Summary/Keyword: Audio Analysis

Search Result 536, Processing Time 0.029 seconds

Joint Channel Coding Based on Principal Component Analysis

  • Hyun, Dong-Il;Lee, Dong-Geum;Park, Young-Cheol;Youn, Dae-Hee;Seo, Jeong-Il
    • ETRI Journal
    • /
    • v.32 no.5
    • /
    • pp.831-834
    • /
    • 2010
  • This paper proposes a new joint channel coding algorithm based on principal component analysis. A conventional joint channel coder using passive downmixing undergoes a reduction of both the primary-to-ambient energy ratio (PAR) of the downmix signal and the panning gain ratio of the primary source. The proposed system preserves the PAR of the downmix signal by using active downmixing which reflects spatial characteristic. The proposed system also improves the accuracy of the panning gain ratio estimation. Computer simulations and subjective listening tests verify the performance of the proposed system.

Optimal Design of Acoustical Characteristics of Passenger Compartment (차실 음향 최적 설계에 관한 연구)

  • 김정수;강연준
    • Proceedings of the Korean Society for Noise and Vibration Engineering Conference
    • /
    • 2003.11a
    • /
    • pp.183-188
    • /
    • 2003
  • This study is to make the fundamentals of sound quality evaluation in regard of acoustical characteristics of passenger compartment. The deviation of frequency response function level within audible frequency is evaluated at receiving point in the research of room acoustics. In this study, frequency response function is the one between speaker and driver's ear positions. The positions of driver and audio speakers are optimized by analysis of acoustic mode of acoustic cavity. The main reflection planes are determined by analysis sound ray path diffused at optimized speaker positions. Finally, designer selects acoustical material by analysis of absorption effect of acoustical materials on the main reflection planes in order to avoid to distortion and fluctuation of frequency response function..

  • PDF

Story-based Information Retrieval (스토리 기반의 정보 검색 연구)

  • You, Eun-Soon;Park, Seung-Bo
    • Journal of Intelligence and Information Systems
    • /
    • v.19 no.4
    • /
    • pp.81-96
    • /
    • 2013
  • Video information retrieval has become a very important issue because of the explosive increase in video data from Web content development. Meanwhile, content-based video analysis using visual features has been the main source for video information retrieval and browsing. Content in video can be represented with content-based analysis techniques, which can extract various features from audio-visual data such as frames, shots, colors, texture, or shape. Moreover, similarity between videos can be measured through content-based analysis. However, a movie that is one of typical types of video data is organized by story as well as audio-visual data. This causes a semantic gap between significant information recognized by people and information resulting from content-based analysis, when content-based video analysis using only audio-visual data of low level is applied to information retrieval of movie. The reason for this semantic gap is that the story line for a movie is high level information, with relationships in the content that changes as the movie progresses. Information retrieval related to the story line of a movie cannot be executed by only content-based analysis techniques. A formal model is needed, which can determine relationships among movie contents, or track meaning changes, in order to accurately retrieve the story information. Recently, story-based video analysis techniques have emerged using a social network concept for story information retrieval. These approaches represent a story by using the relationships between characters in a movie, but these approaches have problems. First, they do not express dynamic changes in relationships between characters according to story development. Second, they miss profound information, such as emotions indicating the identities and psychological states of the characters. Emotion is essential to understanding a character's motivation, conflict, and resolution. Third, they do not take account of events and background that contribute to the story. As a result, this paper reviews the importance and weaknesses of previous video analysis methods ranging from content-based approaches to story analysis based on social network. Also, we suggest necessary elements, such as character, background, and events, based on narrative structures introduced in the literature. We extract characters' emotional words from the script of the movie Pretty Woman by using the hierarchical attribute of WordNet, which is an extensive English thesaurus. WordNet offers relationships between words (e.g., synonyms, hypernyms, hyponyms, antonyms). We present a method to visualize the emotional pattern of a character over time. Second, a character's inner nature must be predetermined in order to model a character arc that can depict the character's growth and development. To this end, we analyze the amount of the character's dialogue in the script and track the character's inner nature using social network concepts, such as in-degree (incoming links) and out-degree (outgoing links). Additionally, we propose a method that can track a character's inner nature by tracing indices such as degree, in-degree, and out-degree of the character network in a movie through its progression. Finally, the spatial background where characters meet and where events take place is an important element in the story. We take advantage of the movie script to extracting significant spatial background and suggest a scene map describing spatial arrangements and distances in the movie. Important places where main characters first meet or where they stay during long periods of time can be extracted through this scene map. In view of the aforementioned three elements (character, event, background), we extract a variety of information related to the story and evaluate the performance of the proposed method. We can track story information extracted over time and detect a change in the character's emotion or inner nature, spatial movement, and conflicts and resolutions in the story.

Incomplete Cholesky Decomposition based Kernel Cross Modal Factor Analysis for Audiovisual Continuous Dimensional Emotion Recognition

  • Li, Xia;Lu, Guanming;Yan, Jingjie;Li, Haibo;Zhang, Zhengyan;Sun, Ning;Xie, Shipeng
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.13 no.2
    • /
    • pp.810-831
    • /
    • 2019
  • Recently, continuous dimensional emotion recognition from audiovisual clues has attracted increasing attention in both theory and in practice. The large amount of data involved in the recognition processing decreases the efficiency of most bimodal information fusion algorithms. A novel algorithm, namely the incomplete Cholesky decomposition based kernel cross factor analysis (ICDKCFA), is presented and employed for continuous dimensional audiovisual emotion recognition, in this paper. After the ICDKCFA feature transformation, two basic fusion strategies, namely feature-level fusion and decision-level fusion, are explored to combine the transformed visual and audio features for emotion recognition. Finally, extensive experiments are conducted to evaluate the ICDKCFA approach on the AVEC 2016 Multimodal Affect Recognition Sub-Challenge dataset. The experimental results show that the ICDKCFA method has a higher speed than the original kernel cross factor analysis with the comparable performance. Moreover, the ICDKCFA method achieves a better performance than other common information fusion methods, such as the Canonical correlation analysis, kernel canonical correlation analysis and cross-modal factor analysis based fusion methods.

A Phenomenological Study on the Restoration Experience for Suicide Ideation of Korean Elders (한국 노인의 자살생각 극복경험)

  • Jo, Kae-Hwa;Kim, Yeong-Kyeong
    • Journal of Korean Academy of Nursing
    • /
    • v.38 no.2
    • /
    • pp.258-269
    • /
    • 2008
  • Purpose: The purpose of this study was to understand and analyze the experience of restoration among Korean elders with suicide ideation. Methods: A phenomenological research method guided data collection and analysis. A total of five elders having had suicide ideation participated. Data were collected through individual in-depth interviews. All interviews were audio taped and transcribed verbatim. Coding was used to establish different concepts and categories. Results: As the results of analysis, the following three constituents have been found as a retrospective focus based on the primary suicide ideation: expanding their view and facing reality, reconstructing their view about life and death as well as self. Conclusion: The results of this study may contribute to health professionals working at various crisis settings to understand Korean elders with suicide ideation.

Medical Image Workstation Using Multimedia Technique (멀티미디어를 이용한 의료용 영상 워크스테이션)

  • 이태수;차은종
    • Journal of Biomedical Engineering Research
    • /
    • v.15 no.1
    • /
    • pp.63-70
    • /
    • 1994
  • A medical image workstation was developed using multimedia technique. The system based on PC-486DX was designed to acquire medical images produced by medical imaging instruments and related audio information, that is, doctors'reporting results. Input int'ormation was processed and analyzed, then the results were presented in the form of graph and animation. All the informations of the system were hierarchically related with the image as the apex. Processing and Analysis algorithms were implemented so that the diagnostic accuracy could be improved. The diagnosed id'ormation can be transferred for patient diagnosis through LAN (local area network).

  • PDF

L2 Learner's Perspectives of How Personal and Instructional Factors Influence Achievement in Online-incorporated Environment

  • Kim, Jeong-Yeon
    • English Language & Literature Teaching
    • /
    • v.16 no.4
    • /
    • pp.39-69
    • /
    • 2010
  • This study aims to identify how participants in online-incorporated English learning perceive interaction between achievement and factors of learning and personality. Using grounded theory analysis, this study attempts to generate a theoretical model depicting how the factors work with the L2 learners situated in the learning setting. A total of 231 college freshmen participated in online and offline EFL learning programs for the duration of one semester. In addition, all respondents completed a survey questionnaire on their learning experiences. In the investigation of the differences between low- and high-proficiency groups, audio-taped interviews with 20 selected students, 10 from each group, have revealed differences not only in the types of personal and instructional factors, but also, more importantly, in the interrelationship between these factors in each group's learning model. These models effectively explained the statistically significant differences in four questionnaire items, such as online learning and contributions of offline class sections to their L2 achievement. These findings entail L2 practitioners' shared understandings of their students' perspectives of learning in the specific L2 learning context.

  • PDF

Development of a Real-time Vehicle Driving Simulator

  • Kim, Hyun-Ju;Park, Min-Kyu;Lee, Min-Cheoul;You, Wan-Suk
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.51.2-51
    • /
    • 2001
  • A vehicle driving simulator is a virtual reality device which makes a human being feel as if the one drives a vehicle actually. The driving simulator is effectively used for studying interaction of a driver-vehicle and developing the vehicle system of new concepts. The driving simulator consists of a motion platform, a motion controller, a visual and audio system, a vehicle dynamic analysis system, a vehicle operation system and etc. The vehicle dynamic analysis system supervises overall operation of the simulator and also simulates dynamic motion of a multi-body vehicle model in real-time. In this paper, the main procedures to develop the driving simulator are classified by 4 parts. First, a vehicle motion platform and a motion controller, which generates realistic motion using a six degree of freedom Stewart platform driven hydraulically. Secondly, a visual system generates high fidelity visual scenes which are displayed on a screen ...

  • PDF

Analysis of Sound Transmitting System using Power line Communication Technique (전력선을 이용한 음향전달 시스템의 구성 및 특성 분석)

  • Kim, Ho-Soo;Lee, Myung-Sub;Koo, Kyung-Wan;Han, Sang-Ok
    • Journal of the Korean Institute of Illuminating and Electrical Installation Engineers
    • /
    • v.18 no.3
    • /
    • pp.128-134
    • /
    • 2004
  • This paper presents a result of sound transmitting system with power line communication technique. Sound transmitting system is a transmitter which transmits modulated audio signal to power to power line and receiver which is capable of detecting it with earphone or speaker. It has been evaluated with the frequency characteristics and spectrum analysis. And, from the result of evaluation on the developed system, we confirmed the superior sound transmitting characteristics, and the possibility of application on a language laboratory.

A tudy on the TV Microphonic Phenomenon (TV 마이크로포닉 현상에 관한 연구)

  • 성길주;윤경렬;이재응;이수훈;임진수
    • Journal of KSNVE
    • /
    • v.5 no.1
    • /
    • pp.123-132
    • /
    • 1995
  • The microphonic phenomenon in TV(television) is a phenomenon that a stained pattern locally appears in the screen or moves like waves. This can be observed when audio signal of TV has specific frequencies under loud volume of sound. In this study, microphonic phenomenon has been investigated, and two practical ways of circumventing this has been proposed. Based on modal analysis of several TV parts(Cathod Ray Tube, shadowmask, etc.), it was proved that the microphonic phenomenon is caused by the resonance of the shadow mask. One of the proposed ways to circumvent this phenonenon is increasing the thickness of the frame, the other is removing the middle welding points between the frame and the shadow mask. The effects of these modifications are evaluated by the finite element analysis, and the results show that the magnitude of vibration of shadow mask reduced by 10 - 20dB, which is large enough to provent microphonic phenomenon even under maximum level of sound volume.

  • PDF