• Title/Summary/Keyword: Audio Quality

Search Result 446, Processing Time 0.027 seconds

Speaker-Dependent Emotion Recognition For Audio Document Indexing

  • Hung LE Xuan;QUENOT Georges;CASTELLI Eric
    • Proceedings of the IEEK Conference
    • /
    • summer
    • /
    • pp.92-96
    • /
    • 2004
  • The researches of the emotions are currently great interest in speech processing as well as in human-machine interaction domain. In the recent years, more and more of researches relating to emotion synthesis or emotion recognition are developed for the different purposes. Each approach uses its methods and its various parameters measured on the speech signal. In this paper, we proposed using a short-time parameter: MFCC coefficients (Mel­Frequency Cepstrum Coefficients) and a simple but efficient classifying method: Vector Quantification (VQ) for speaker-dependent emotion recognition. Many other features: energy, pitch, zero crossing, phonetic rate, LPC... and their derivatives are also tested and combined with MFCC coefficients in order to find the best combination. The other models: GMM and HMM (Discrete and Continuous Hidden Markov Model) are studied as well in the hope that the usage of continuous distribution and the temporal behaviour of this set of features will improve the quality of emotion recognition. The maximum accuracy recognizing five different emotions exceeds $88\%$ by using only MFCC coefficients with VQ model. This is a simple but efficient approach, the result is even much better than those obtained with the same database in human evaluation by listening and judging without returning permission nor comparison between sentences [8]; And this result is positively comparable with the other approaches.

  • PDF

Security Method of Multimedia Data Characteristics on Video Conference System (영상회의 시스템에서 멀티미디어 데이터 특성에 따른 보안 방법)

  • Han, Kun-Hee
    • Journal of the Korea Society of Computer and Information
    • /
    • v.10 no.4 s.36
    • /
    • pp.143-148
    • /
    • 2005
  • Video conference system it is various at internet and uses the reading is become accomplished. Research of like this portion synchronization of audio, the compression technique and multimedia data, supports the video conference the research of the Mbone of the IP multicast for being active. being become accomplished the multimedia service which is various an video from internet, the line speed of communication becomes high-speed anger and to follow leads is become accomplished. The video conference from opening elder brother dispersion internet network environment the problem against the image which is an image conference data and a voice security is serious and it raises its head. To sleep it presents the security method which from the video conference it follows in quality of multimedia data from the dissertation which it sees and it does.

  • PDF

The Evolving Sound Art (Part 1): Sonic Singularities and Chronicle Traces (진화하는 사운드 아트 (1부): 소리의 특이성과 시대적 기록)

  • Lee, Irene Eunyoung
    • The Journal of the Convergence on Culture Technology
    • /
    • v.6 no.1
    • /
    • pp.395-401
    • /
    • 2020
  • Sound Art retains heterogeneous and borderless inborn-characteristics on it. Despite it is a non-mainstream art which could not foster fertile soil to bring up many established artists yet, the domestic area is keep growing and expanding. And now it will soon be that time of overcoming the debates between the art world and the music world to widely embrace de-facto artworks and practices, and bringing more quality critiques. This article talks about a concise history of sound art by addressing some singularities and chronicle traces of it which may be helpful information to lead into more opened future discussion forums in the domestic sound art field.

A Study on Evaluation of Environmental Characteristics of Maternity Room (산후관리시설의 산모실 환경특성평가에 관한 연구)

  • Hwang, Yeon-Sook;Son, Yeo-Rym
    • Korean Institute of Interior Design Journal
    • /
    • v.16 no.6
    • /
    • pp.47-55
    • /
    • 2007
  • The purpose of this study is to evaluate environmental characteristics of maternity rooms. The method of this study is a field survey on 8 samples of postpartum care centers in Seoul. The plan, colors, materials, furniture and environmental characteristics of maternity rooms are analyzed. The characteristics of maternity rooms environment were categorized into four items; comfort, privacy, communication and dwelling. The results are as follows: Western-style and rooming-separation system of maternity rooms are used. Maternity rooms are generally good for dwelling quality but insufficient for communication. There are a lack of supply to control a temperature Individually in maternity rooms. It demands to make the type of one-sided public space between maternity room and living room for privacy. All of the maternity rooms surveyed are furnished with TV, radio, and telephone but, to improve communication with visitors, it is recommended that more convenient supplies such as audio and video system, chairs, and table be equipped. There are needs for sky-light windows in maternity rooms. It is necessary to research more about the space of living room, nursing room and service area, and we need more study about baby, nurser and owner' spaces.

Visual Telephone System of Differential Task Interrupt Method (차등 태스크 인터럽트 방식의 영상단말 시스템)

  • 박배욱;정하재;오창석
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.6 no.5
    • /
    • pp.739-746
    • /
    • 2002
  • In this paper, a new visual telephone system which has a differential task interrupt transfer feature for real time video phone service is presented. Owing to the result of Interrupt transfer of different speed according to the time critical degree of tasks, the flow of audio and video data stream can be kept as constant speed in other word that means video phone services are carried out in real time. The ITU-T H.32x visual telephone recommendations are first analyzed, and the unsatisfactory items of existing systems are second inquired the cause, such as performance, quality. And then the design concept and ideas which enable it to solve them are third devised, the next, the new architecture of visual telephone system for real time video phone source are designed, which make it possible to solve the existing problems by means of different tasks interrupt transfer method.

Design and Implementation of Real-Time Teleconferencing System using the Simplified Resource Reservation on Real-Time CORBA Supporting RIOP (RIOP를 이용한 실시간 CORBA 상에서의 단순화된 자원예약 메커니즘을 이용한 실시간 화상회의 시스템의 설계 및 구현)

  • Hyeon, Ho-Jae;Hong, Seong-Jun;Han, Seon-Yeong
    • The Transactions of the Korea Information Processing Society
    • /
    • v.6 no.7
    • /
    • pp.1897-1908
    • /
    • 1999
  • Multimedia services(i.e. teleconferencing and Video on demand) have been developed on MBone. The video and audio data of them require Real-Time service using QoS(Quality of Service) guarantees. RSVP(Resource reSerVation Protocol) on the Internet has been suggested to support QoS guarantees. But currently, it has two problems : complexity and scalability. To solve these problems, this paper describes the design and implement of teleconferencing system with QoS guarantees by simplifying the resource reservation processing to solve the RSVP's complexity and scalability.

  • PDF

PACS Data Transmission in Hospital Network Based on DDS Middleware (DDS 미들웨어 기반 병원 전산망 PACS 데이터의 전송)

  • Kim, Nam-Ho;Lee, Suk-Hwan;Choi, Chang Yeol;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.16 no.3
    • /
    • pp.290-301
    • /
    • 2013
  • The hospital network requires the effective transmission of multimedia PCAS data for medical treatment. But the network traffic has happened frequently in consultation hours because of the limited resources of hospital network and high capacity of PACS data. This is major interruption for the medical treatment. This problem can be solved by the adaptive QoS. In this paper, we design the middleware based QoS architecture in hospital network for controlling the contribution system. Our virtual simulation verifies that our middleware assures QoS of the priority PACS data of audio and image compared with the conventional hospital network.

Multimodal audiovisual speech recognition architecture using a three-feature multi-fusion method for noise-robust systems

  • Sanghun Jeon;Jieun Lee;Dohyeon Yeo;Yong-Ju Lee;SeungJun Kim
    • ETRI Journal
    • /
    • v.46 no.1
    • /
    • pp.22-34
    • /
    • 2024
  • Exposure to varied noisy environments impairs the recognition performance of artificial intelligence-based speech recognition technologies. Degraded-performance services can be utilized as limited systems that assure good performance in certain environments, but impair the general quality of speech recognition services. This study introduces an audiovisual speech recognition (AVSR) model robust to various noise settings, mimicking human dialogue recognition elements. The model converts word embeddings and log-Mel spectrograms into feature vectors for audio recognition. A dense spatial-temporal convolutional neural network model extracts features from log-Mel spectrograms, transformed for visual-based recognition. This approach exhibits improved aural and visual recognition capabilities. We assess the signal-to-noise ratio in nine synthesized noise environments, with the proposed model exhibiting lower average error rates. The error rate for the AVSR model using a three-feature multi-fusion method is 1.711%, compared to the general 3.939% rate. This model is applicable in noise-affected environments owing to its enhanced stability and recognition rate.

Performance Evaluation of WiBro System based on AMC and H-ARQ by Simulation (AMC와 H-ARQ 에 따른 시뮬레이션 기반의 WiBro 시스템 성능 평가)

  • Seo, Won-Kyeong;Choi, Jae-In;Cho, You-Ze
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.36 no.1A
    • /
    • pp.19-27
    • /
    • 2011
  • WiBro is a wireless mobile communication system which supports a high data rate and high mobility in anywhere and anytime. Although WiBro system provides multimedia service including data, audio and video services, customers require better service quality in WiBro system. But, many researches have theoretically evaluated the performance of WiBro system without systematic studies by simulation. Therefore, in this paper, we evaluate a performance of WiBro system using OPNET simulator. We analyze system performance according to various modulation and coding schemes, and propose Adaptive Modulation and Coding (AMC) profile to support quality of services for user requirements. Also we evaluate the performance of WiBro system using AMC and Hybrid-Automatic Repeat Request (H-ARQ) technologies, and confirm that the proposed AMC profile can be applied to WiBro system with high performance.

Design of Digital Media Protection System using Elliptic Curve Encryption (타원 곡선 암호화를 이용한 영상 저작권 보호 시스템 설계)

  • Lee, Chan-Ho
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.46 no.1
    • /
    • pp.39-44
    • /
    • 2009
  • The advance of communication and networking technology enables high bandwidth multimedia data transmission. The development of high performance compression technology such as H.264 also encourages high quality video and audio data transmission. The trend requires efficient protection system for digital media rights. We propose an efficient digital media protection system using elliptic curve cryptography. Only key parameters are encrypted to reduce the burden of complex encryption and decryption in the proposed system, and the digital media are not played back or the quality is degraded if the encrypted information is missing. We need a playback system with an ECC processor to implement the proposed system. We implement an H.264 decoding system with a configurable ECC processor to verify the proposed protection system We verify that the H.264 movie is not decoded without the decrypted information.