• Title/Summary/Keyword: MPEG-4 scene

Search Result 79, Processing Time 0.022 seconds

MPEG-4 based XMT APIs for Scene Description (장면 기술을 위한 MPEG-4 기반 XMT API 구현)

  • 정예선;김규헌;기명석
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2001.11b
    • /
    • pp.91-94
    • /
    • 2001
  • MPEG-4 시스템은 장면 자체를 하나의 구성 요소로 여기는 기존의 시스템과는 달리, 그 장면을 구성하는 부호화 또는 복호화된 A/V 객체(Audio/visual Objects)들을 하나의 단위로 인식하여, 다양한 멀티미디어 컨텐츠의 장면을 구성(Scene Composition)하고 표현 하는 것에 그 특징이 있다. 이러한 MPEG-4 시스템의 객체 기반 특징은 다양한 사용자와의 대화성(Interactivity)을 가능하게 하며 , 또한 편리한 컨텐츠 편집 및 재사용 등이 가능하기에 차세대 디지털 방송 컨텐츠 제작에 중요하게 활용될 전망이다. 객체 기반 A/V 편집 도구는 MPEG-4를 기반으로 차세대 디지털 방송 컨텐츠 제작을 용이하게 하기 위한 제작/편집 도구로써 , 장면을 표현하기 위하여 BIFS(Binary Format for Scene description)와 XMT(eXtensible MPEG-4 Textual format) 포맷을 모두 사용하고 있다. BIFS 포맷은 저작된 결과물을 바이너리 형태로 표현하기 때문에, 저작된 결과물을 전송하는 데에는 용이하나, 중간에 저작된 결과물을 확인하기 어렵고, 또한 기존의 다른 어플리케이션과의 상호 작용(Interoperability)과 교환(Exchange)에도 어려움이 따른다. 이에 반해, XMT는 차세대 마크업 언어로 각광 받고 있는 XML 에 그 기반을 두고 있기에 저작된 결과물을 제작자가 쉽게 저작물을 이해할 수 있으며, SMIL 과 X3D 같은 다른 어플리케이션과의 상호작용과 교환 또한 용이하게 한다 XMT는 기술 방법에 따라 XMT-A 와 XMT-0 두 가지 형태가 있으며, XMT-A 포맷은 VRML에서 발전한 X3D(extensible 3D)를 바탕으로 MPEG-4 시스템의 특징들을 수용하여 구성되고 BIFS와 일대일로 대응된다. 반면에 XMT-0는 멀티미디어 문서를 웹문서로 표현하는 SMIL 2.0 을 그 기반으로 하였기에 MPEG-4 시스템의 특징보다는 컨텐츠를 저작하는 제작자의 초점에 맞추어 개발된 형태이다. XMT를 이용하여 컨텐츠를 저작하기 위해서는 사용자 인터페이스를 통해 입력되는 저작 정보들을 손쉽게 저장하고 조작할 수 있으며, 또한 XMT 파일 형태로 출력하기 위한 API 가 필요하다. 이에, 본 논문에서는 XMT 형태의 중간 자료형으로의 저장 및 조작을 위하여 XML 에서 표준 인터페이스로 사용하고 있는 DOM(Document Object Model)을 기반으로 하여 XMT 문법에 적합하게 API를 정의하였으며, 또한, XMT 파일을 생성하기 위한 API를 구현하였다. 본 논문에서 제공된 API는 객체기반 제작/편집 도구에 응용되어 다양한 멀티미디어 컨텐츠 제작에 사용되었다.

  • PDF

Scene Change Detection Using MPEG Bitstream and Sectionally Decoded Video (MPEG 비트스트림과 구간 복호 영상을 사용한 장면 전환 검출)

  • 나윤정;하명환;이상길
    • Journal of Broadcast Engineering
    • /
    • v.4 no.2
    • /
    • pp.119-126
    • /
    • 1999
  • We proposed an algorithm which detects scene changes in video with speediness and accuracy. It is a two-step approach. In the first step, we decide potential scene change segments using the compressed domain data extracted by temporal sampling of MPEG compressed video. In the second step, we determine the exact scene change positions using the pixel values of each frame in those segments by means of combining the intensity and edge changes. In addition we discuss the method to remove false detection generated from camera flash. Integrating the above methods, we introduce a structure that can detect scene changes speedily and accurately.

  • PDF

A Content-Based Synchronization Approach using Scene Keywords in Enhanced TV based on MPEG-4 (MPEG-4 기반 연동형 방송에서 장면 키워드를 이용한 내용 기반 동기화 기법)

  • Yim, Hyun-Jeong;Lim, Soon-Bum
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.6
    • /
    • pp.737-741
    • /
    • 2010
  • When implementing Enhanced TV services, the time synchronization between the video stream that forms the background and the data contents overlaid on audio/video is an important issue. Currently, however, the basic method of synchronizing the data in the MPEG-4 environment is based on absolute time values. For more efficient synchronization when developing Enhanced TV content, this paper proposes a content-based synchronization in which the data content varies depending on the video content. The proposed content-based synchronization method is implemented by defining BIFS nodes more widely, based on scene keywords, and then using the metadata of MPEG7.

Carriage of MPEG-4 over MPEG-2 Transport Stream Protocol (MPEG-2 전송 스트림 프로토콜을 이용한 MPEG-4 데이터의 전송)

  • 안상우;최진수;김용석;김문철
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2000.11b
    • /
    • pp.101-106
    • /
    • 2000
  • We propose a method of the efficient injection of the MPEG-4 data into MPEG-2 A/V stream. The proposed method is to transmit MPEG-4 data synchronized with MPEG-2 A/V stream using MPEG-2 Transport Stream Protocol so that the user can decode MPEG-4 data on time at the client side, after extracting IOD (Initia1 Object Descriptor), OD (Object Descriptor), BIFS (Binary Format for Scone) and media data from mp4 file.

  • PDF

Slide-show of Panoramic Image through a Secondary Device by using MPEG-4 LASeR PMSI (MPEG-4 LASeR PMSI를 활용한 Secondary Device 기반 파노라믹 영상 슬라이드 쇼 재생 기술)

  • Park, Yongchul;Kim, Kyuheon
    • Journal of Broadcast Engineering
    • /
    • v.17 no.6
    • /
    • pp.1014-1028
    • /
    • 2012
  • Recently, N-screen service and secondary device have gotten an attention from public. Also, we can experience N-screen service through various digital devices. N-screen means multimedia technology which can seamlessly consume multimedia content. Secondary device means auxiliary multimedia device which can consume content related to main content through adjunct connection to main device. Not only be electronic manufactures interested in N-screen technology and services but also digital devices applied for N-screen have been released. But it has limitation that user can only consume content to be purchased from content company server not device of user. This paper proposes the system that composes effective and various N-screen multimedia service through MPEG-4 LASeR (Lightweight Application Scene Representation) PMSI (Presentation and Modification of Structured Information) as international standard technology which can provide scene description including many instruction for dynamic update of scene.

BIFS Information Generation about Scene of MPEG-4 contents (MPEG-4 컨텐츠의 씬에 대한 BIFS 정보 생성)

  • Bae, Su-Young;Kim, Sang-Wook
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2001.10a
    • /
    • pp.217-220
    • /
    • 2001
  • 본 논문에서는 MPEG-4 컨텐츠 개발에 있어서 장면 그래프 표현에 소비되는 노력을 절감하기 위해 사용자에게는 기존 멀티미디어 저작도구의 직관적인 저작 환경을 제공하고, 내부적으로 MPEG-4 장면 그래프로 변환하는 방법을 제시한다. 직관적인 저작 환경은 현재 가장 널리 사용되는 매크로미디어의 디렉터와 플래쉬로부터 도입했으며, 저작 환경은 시청각 작업 공간, 시간 작업 공간, 애니메이션 작업 공간으로 구성된다. 작업 공간에 저작된 내용은 MPEG-4 장면 그래프 생성 계층과 장면 그래프 구성 규칙을 통해 BIFS 코드로 생성된다.

  • PDF

A Scene Boundary Detection Scheme using Audio Information in MPEG System Stream (MPEG 시스템 스트림상에서 오디오 정보를 이용한 장면 경계 검출 방법)

  • Kim, Jae-Hong;Nang, Jong-Ho;Park, Soo-Yong
    • Journal of KIISE:Software and Applications
    • /
    • v.27 no.8
    • /
    • pp.864-876
    • /
    • 2000
  • This paper proposes a new scene boundary detection scheme for the MPEG System stream using MPEG Audio information and proves its usefulness by extensive experiments. A scene boundary has a characteristic that the audio as well as video information are changed rapidly. This paper first classifies this scene boundary into three cases ; Radical, Gradual, Micro Changes, with respect to the audio changes. The Radical change has a large-scale changing of decibel value and pitch value at a scene boundary, the Gradual change shows the long-time transition of decibel and pitch values from max to min or vice versa, and the Micro change displays a some change of pitch or frequency distribution without decibel changes. Upon this analysis, a new scene change detection algorithm detecting these three cases is proposed in which a progressive window with a time line is used to trace the changes in the audio information. Some experiments with various movies show that proposed algorithm could produce a high detection ratio for Radical change that is the most popular scene change in the movies, while producing a moderate detection ratio for Gradual and Micro changes. The proposed scene boundary detection scheme could be used to build a database for visual information like MPEG System stream.

  • PDF

Design and Implementation of Interactive Multi-view Visual Contents Authoring System (대화형 복수시점 영상콘텐츠 저작시스템 설계 및 구현)

  • Lee, In-Jae;Choi, Jin-Soo;Ki, Myung-Seok;Jeong, Se-Yoon;Moon, Kyung-Ae;Hong, Jin-Woo
    • Journal of Broadcast Engineering
    • /
    • v.11 no.4 s.33
    • /
    • pp.458-470
    • /
    • 2006
  • This paper describes issues and consideration on authoring of interactive multi-view visual content based on MPEG-4. The issues include types of multi-view visual content; scene composition for rendering; functionalities for user-interaction; and multi-view visual content file format. The MPEG-4 standard, which aims to provide an object based audiovisual coding tool, has been developed to address the emerging needs from communications, interactive broadcasting as well as from mixed service models resulting from technological convergence. Due to the feature of object based coding, the use of MPEG-4 can resolve the format diversity problem of multi-view visual contents while providing high interactivity to users. Throughout this paper, we will present which issues need to be determined and how they can be realized by means of MPEG-4 Systems.

Rich Media Framework based on MPEG-4 LASeR PMSI Technique (MPEG-4 LASeR PMSI 기술에 기반한 리치미디어 프레임워크 설계 및 구현)

  • Lee, In-Jae;Song, Seung-Won;Lee, Han-Kyu;Cha, Ji-Hun
    • Journal of Broadcast Engineering
    • /
    • v.15 no.2
    • /
    • pp.248-264
    • /
    • 2010
  • In this paper, we presents an efficient rich media framework based on MPEG-4 LASeR PMSI. Rich media provides distinctive features such as dynamic updates and object-based interactivity over the conventional AV centric media service. It rapidly gains its popularity as the convergent multimedia services between broadcasting and telecommunication. MPEG-4 LASeR is an international standard which provides specifications for such rich media services. Recently MPEG added an extension to LASeR called PMSI. It provides an efficient technique to present SI on a scene by referencing specific portions of SI from PI. The presented rich media application is using PMSI of MPEG-4 LASeR standard to provide users a widget-like rich media application. This application utilizes MPEG-4 LASeR with PMSI technique as PI, and this PI references SI to present information that resides in SI on a scene. In this paper, we provide descriptions of technical ingredients used to build the presented application. The framework is presented followed by the implementation result. Possible impacts and applicable services are described in the conclusion.

MPEG-DASH Services for 3D Contents Based on DMB AF (DMB AF 기반 3D 콘텐츠의 MPEG-DASH 서비스)

  • Kim, Yong Han;Park, Minkyu
    • Journal of Broadcast Engineering
    • /
    • v.18 no.1
    • /
    • pp.115-121
    • /
    • 2013
  • Recently an extension to DMB AF (Digital Multimedia Broadcasting Application Format) standard has been proposed in such a way that the extended DMB AF can include stereoscopic video and stereoscopic images for interactive service data, i.e., MPEG-4 BIFS (Binary Format for Scene) data, in addition to the existing 2D video and 2D images for BIFS services. In this paper we developed a service that provides the streaming of 3D contents in DMB AF by using MPEG-DASH (Dynamic Adaptive Streaming over HTTP) standard and validated it by implementing the client software.