• Title/Summary/Keyword: MPEG 오디오

Search Result 276, Processing Time 0.019 seconds

A MPEG Audio-Visual Conversational Communication Terminal on the B-ISDN Environment (광대역 ISDN용 MPEG 오디오-비쥬열 대화형 통신단말의 설계 및 구현)

  • Hwang, Dae-Hwan;Cho, Kyu-Seob
    • The Transactions of the Korea Information Processing Society
    • /
    • v.5 no.8
    • /
    • pp.1960-1971
    • /
    • 1998
  • The researches and developments to provide multimedia communication services such as Video on Demand(VoDJ), real time video phonc and multipoint vidco conferencing on broadband ISDN environmcnts have been proceeded with activity. Specifications for Vol) services which is worked by Digital Audio-Visual Council(DAVIC) to support detail technologies including total service system that is consist of VoD server. delive[\! networl, and Set-Top Box(STB) had been already finished and ITU-T SG16 also recommended the standards of H.300 series terminal aspects for conversational multimedia services, But the architectures of multimedia tenninals recommended and specified by these organizations do not have an efficient st11lcture to provide all of retrieval, distrihution and conversational service due to a different point of view about multimedia terminals and services. In this paper, we analyzed the recornmendatio!E and the specifications of intemational public and private organizations like lTU-T, DAVIC and ATM forum. As a result of these analysis. we propose an efficient terminal architecture, and then we have designed, lmplemented the multimedia communication terminal for offering VoI) and real- time conversation ,,, functional module test according to the individual commumication service session and confirined the validiry or terminal implemented to be used on broadband ISDK environments.

  • PDF

A New Tempo Feature Extraction Based on Modulation Spectrum Analysis for Music Information Retrieval Tasks

  • Kim, Hyoung-Gook
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.6 no.2
    • /
    • pp.95-106
    • /
    • 2007
  • This paper proposes an effective tempo feature extraction method for music information retrieval. The tempo information is modeled by the narrow-band temporal modulation components, which are decomposed into a modulation spectrum via joint frequency analysis. In implementation, the tempo feature is directly extracted from the modified discrete cosine transform coefficients, which is the output of partial MP3(MPEG 1 Layer 3) decoder. Then, different features are extracted from the amplitudes of modulation spectrum and applied to different music information retrieval tasks. The logarithmic scale modulation frequency coefficients are employed in automatic music emotion classification and music genre classification. The classification precision in both systems is improved significantly. The bit vectors derived from adaptive modulation spectrum is used in audio fingerprinting task That is proved to be able to achieve high robustness in this application. The experimental results in these tasks validate the effectiveness of the proposed tempo feature.

  • PDF

Effective Method to Change Multimedia Scene Configuration Information Using DOM Update (DOM update를 이용한 효율적인 멀티미디어 장면 구성 정보 변경 방안)

  • Kim, Kyuheon;Park, JungWook;Kim, Byungchul
    • Journal of Broadcast Engineering
    • /
    • v.18 no.1
    • /
    • pp.43-58
    • /
    • 2013
  • Richmedia Service means that interactive media service can provide view with various multimedia elements(such as Video, Audio, Text) at same time. Various Multimedia elements can be serviced by Scene Description technology standards like BIFS(Binary Format for Scenes) and LASeR(Light Application Scene Representation). By providing Scene Component information, richmedia service is available to various multimedia services. so users is available to personalized services fitting temporal and spatial options. In conventional technology, when the scene is changed by user or service, mobile deletes the scene of configuration information and makes new scene of configuration information. this is a very inefficient way. In this paper, Propoesed that by using DOM(Document Object Model) method, to pass only the dynamic configuration part, changes scene method.

A New Bandwidth Smoothing Technique for On-Line Video Services based on Multicasting (멀티캐스팅 방식의 온라인 비디오 서비스를 위한 새로운 대역폭 완화기법)

  • Jin, Seong-Gi;Kim, Jin-Seok;Gang, Seok-Ryeol;Yun, Hyeon-Su
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.26 no.8
    • /
    • pp.939-948
    • /
    • 1999
  • 주문형 비디오나 원격 회의 그리고 원격 강의와 같은 멀티미디어 애플리케이션들은 비디오나 오디오와 같은 멀티미디어 트래픽을 유발한다. 이러한 멀티미디어 트래픽의 가장 큰 특성은 폭주성이다. 폭주성은 통신망의 효율을 떨어뜨리는 매우 중요한 요인이며, 따라서 폭주성에 대처할 수 있는 효율적인 대역폭 할당 정책이 통신망 관리에서 매우 중요하다. 본 논문에서는 멀티캐스팅 방식의 온라인 비디오 애플리케이션에 대해서 대역폭 완화 작업이라고 불리우는 효율적이고 유용한 대역폭 할당 방안을 설계하였다. 본 논문에서 제시하는 새로운 대역폭 완화 기법은 비디오 서버에서 이미 전송된 데이타의 특성을 이용하여 대역폭 완화창의 크기를 조절하는 동적인 기법이다. 몇 가지 MPEG 트레이스들로 실험했을 때 기존의 온라인 대역폭 완화 기법과 비슷한 성능을 보이고, 특히 멀티캐스팅 방식의 온라인 비디오 서비스에 대해서는 서버의 자원 효율성을 향상시킴을 알 수 있었다. Abstract All of the multimedia applications such as VOD, teleconferencing, and tele-lecturing invoke multimedia traffic like video or audio traffic. The most important characteristic of these multimedia traffic is the burstiness property. So, bandwidth management is becoming the major part of network management. In this paper, we propose a new and efficient bandwidth management technique called bandwidth smoothing for the multicasting on-line video applications. Our bandwidth smoothing technique reduces as much of the network bandwidth required to transmit on-line video traffic as previously proposed methods, and improves the server's resource utilization especially for the multicast on-line video services.

VLSI Design of H.263 Video Codec Based on Modular Architecture (모듈화된 구조에 기반한 H.263 비디오 코덱 VLSI의 설계)

  • Kim, Myung-Jin;Lee, Sang-Hee;Kim, Keun-Bae
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.5
    • /
    • pp.477-485
    • /
    • 2002
  • In this paper, we present an efficient hardware architecture for the H.263 video codec and its VLSI implementation. This architecture is based on the unified interface by which internal hardware engines and an internal RISC processor are connected one another. The unified interface enables the modular design of internal blocks, efficient hardware/software partitioning, and pipelined paralled operations. The developed VLSI supports the H.263 version 2 profile 3 @ level 10, and moreover, both the control protocol H.245 and the multiplexing protocol H.223. Therefore, it can be used for the complete ITU-T H.324 or 3GPP 3G 324M multimedia processor with the help of an external audio codec. Simultaneous encoding and decoding of QCIF format images at a rate greater than 15 frames per second is achieved at 40 MHz clock frequency.

Abstraction Mechanism of Low-Level Video Features for Automatic Retrieval of Explosion Scenes (폭발장면 자동 검출을 위한 저급 수준 비디오 특징의 추상화)

  • Lee, Sang-Hyeok;Nang, Jong-Ho
    • Journal of KIISE:Software and Applications
    • /
    • v.28 no.5
    • /
    • pp.389-401
    • /
    • 2001
  • This paper proposes an abstraction mechanism of the low-level digital video features for the automatic retrievals of the explosion scenes from the digital video library. In the proposed abstraction mechanism, the regional dominant colors of the key frame and the motion energy of the shot are defined as the primary abstractions of the shot for the explosion scene retrievals. It is because an explosion shot usually consists of the frames with a yellow-tone pixel and the objects in the shot are moved rapidly. The regional dominant colors of shot are selected by dividing its key frame image into several regions and extracting their regional dominant colors, and the motion energy of the shot is defined as the edge image differences between key frame and its neighboring frame. The edge image of the key frame makes the retrieval of the explosion scene more precisely, because the flames usually veils all other objects in the shot so that the edge image of the key frame comes to be simple enough in the explosion shot. The proposed automatic retrieval algorithm declares an explosion scene if it has a shot with a yellow regional dominant color and its motion energy is several times higher than the average motion energy of the shots in that scene. The edge image of the key frame is also used to filter out the false detection. Upon the extensive exporimental results, we could argue that the recall and precision of the proposed abstraction and detecting algorithm are about 0.8, and also found that they are not sensitive to the thresholds. This abstraction mechanism could be used to summarize the long action videos, and extract a high level semantic information from digital video archive.

  • PDF