• Title/Summary/Keyword: Mpeg-4

Search Result 1,150, Processing Time 0.029 seconds

Implementation of MP3 decoder with TMS320C541 DSP (TMS320C541 DSP를 이용한 MP3 디코더 구현)

  • 윤병우
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.4 no.3
    • /
    • pp.7-14
    • /
    • 2003
  • MPEG-1 audio standard is the algorithm for the compression of high-qualify digital audio signals. The standard dictates the functions of encoder and decoder pair, and includes three different layers as the complexity and the performance of the encoder and decoder. In this paper, we implemented the real-time system of MPEG-1 audio layer III decoder(MP3) with the TMS320C541 fixed point DSP chip. MP3 algorithm uses psycho-acoustic characteristic of human hearing system, and it reduces the amount of data with eliminating the signals hard to be heard to the hearing system of human being. It is difficult to implement MP3 decoder with fixed Point DSP because of it's broad dynamic range. We implemented realtime system with fixed DSP chip by using weighted look-up tables to reduce the amount of calculation and solve the problem of broad dynamic range.

  • PDF

Implementation of Fast Inverse Quantization and Inverse Transform Module for VC-1 (VC-1용 고속 역양자화 및 역변환 모듈 구현)

  • Kim, Kyung Hyun;Song, Hyung Don;Sohn, Seung Il
    • Proceedings of the Korea Contents Association Conference
    • /
    • 2007.11a
    • /
    • pp.837-841
    • /
    • 2007
  • 최근 영상을 중심으로 여러 형태의 정보를 결합하여 저장하거나 전송하는 멀티미디어가 많은 관심을 받고 있다. 현재 카메라와 관련된 동영상 캡처기술은 Motion JPEG이 주류를 이루고 있으며, 텔레비전, DMB 등의 방송 분야 및 DVD, VCR 분야에서는 MPEG-2, MPEG-4, H.264 및 WMV9 등의 압축 코덱이 채용되고 사용되고 있다. 그러나 이러한 다양한 영상 표준방식은 디코딩시 호환성 문제가 발생하게 되고 이에 따라 통합 코덱 연구가 필요하다. 이에 본 논문은 일반적 스텝 양자화외에 데드존 양자화를 사용하고 "$4{\times}4$", "$4{\times}8$", "$8{\times}4$", "$8{\times}8$"의 다양한 블록크기의 변환을 지원하는 VC-1을 기반으로 한 ITIQ C언어를 통해 시뮬레이션하고 최적화된 결과를 VHDL로 구현하여 향후 통합코덱 연구에 응용 가능하도록 연구 및 분석평가 하였다. 설계결과 4:2:0의 YCbCr포맷의 최초 $16{\times}16$블록을 복원하는데 483~510클록이 소요되었고 Xilinx XCVPC100 FF1696-6 환경에서 93,128개의 게이트 수와 71.469MHz의 동작속도를 나타내었다. 이는 640*480 크기의 컬러영상을 디코딩 하는데 프레임 당 최대 0.0074초가 소요됨을 의미하며 초당 30프레임의 영상에서도 0.222초면 디코딩이 가능한 결과이다.

  • PDF

Feature Points Selection Using Block-Based Watershed Segmentation and Polygon Approximation (블록기반 워터쉐드 영역분할과 다각형 근사화를 이용한 특징점 추출)

  • 김영덕;백중환
    • Proceedings of the Korea Institute of Convergence Signal Processing
    • /
    • 2000.12a
    • /
    • pp.93-96
    • /
    • 2000
  • In this paper, we suggest a feature points selection method using block-based watershed segmentation and polygon approximation for preprocessing of MPEG-4 mesh generation. 2D natural image is segmented by 8$\times$8 or 4$\times$4 block classification method and watershed algorithm. As this result, pixels on the watershed lines represent scene's interior feature and this lines are shapes of closed contour. Continuous pixels on the watershed lines are selected out feature points using Polygon approximation and post processing.

  • PDF

A Fast IFFT Algorithm for IMDCT of AAC Decoder (AAC 디코더의 IMDCT를 위한 고속 IFFT 알고리즘)

  • Chi, Hua-Jun;Kim, Tae-Hoon;Park, Ju-Sung
    • The Journal of the Acoustical Society of Korea
    • /
    • v.26 no.5
    • /
    • pp.214-219
    • /
    • 2007
  • This paper proposes a new IFFT(Inverse Fast Fourier Transform) algorithm, which is proper for IMDCT(Inverse Modified Discrete Cosine Transform) of MPEG-2 AAC(Advanced Audio Coding) decoder. The $2^n$(N-point) type IMDCT is the most powerful among many IMDCT algorithms, however it includes IFFT that requires many calculation cycles. The IFFT used in $2^n$(N-point) type IMDCT employ the bit-reverse data arrangement of inputs and N/4-point complex IFFT to reduce the calculation cycles. We devised a new data arrangement method of IFFT input and $N/4^{n+1}$-type IFFT and thus we can reduce multiplication cycles, addition cycles, and ROM size.

Development of Multimedia Annotation and Retrieval System using MPEG-7 based Semantic Metadata Model (MPEG-7 기반 의미적 메타데이터 모델을 이용한 멀티미디어 주석 및 검색 시스템의 개발)

  • An, Hyoung-Geun;Koh, Jae-Jin
    • The KIPS Transactions:PartD
    • /
    • v.14D no.6
    • /
    • pp.573-584
    • /
    • 2007
  • As multimedia information recently increases fast, various types of retrieval of multimedia data are becoming issues of great importance. For the efficient multimedia data processing, semantics based retrieval techniques are required that can extract the meaning contents of multimedia data. Existing retrieval methods of multimedia data are annotation-based retrieval, feature-based retrieval and annotation and feature integration based retrieval. These systems take annotator a lot of efforts and time and we should perform complicated calculation for feature extraction. In addition. created data have shortcomings that we should go through static search that do not change. Also, user-friendly and semantic searching techniques are not supported. This paper proposes to develop S-MARS(Semantic Metadata-based Multimedia Annotation and Retrieval System) which can represent and extract multimedia data efficiently using MPEG-7. The system provides a graphical user interface for annotating, searching, and browsing multimedia data. It is implemented on the basis of the semantic metadata model to represent multimedia information. The semantic metadata about multimedia data is organized on the basis of multimedia description schema using XML schema that basically comply with the MPEG-7 standard. In conclusion. the proposed scheme can be easily implemented on any multimedia platforms supporting XML technology. It can be utilized to enable efficient semantic metadata sharing between systems, and it will contribute to improving the retrieval correctness and the user's satisfaction on embedding based multimedia retrieval algorithm method.

A Real-time SoC Design of Foreground Object Segmentation (Foreground 객체 추출을 위한 실시간 SoC 설계)

  • Kim Ji-Su;Lee Tae-Ho;Lee Hyuk-Jae
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.43 no.9 s.351
    • /
    • pp.44-52
    • /
    • 2006
  • Recently developed MPEG-4 Part 2 compression standard provides a novel capability to handle arbitrary video objects. To support this capability, an efficient object segmentation technique is required. This paper proposes a real-time algorithm for foreground object segmentation in video sequences. The proposed algorithm consists of two steps: the first step that segments a video frame into multiple sub-regions using Spatio-Temporal Watershed Transform and the second step in which a foreground object segment is extracted from the sub-regions generated in the first step. For real-time processing, the algorithm is partitioned into hardware and software parts so that computationally expensive parts are off-loaded from a processor and executed by hardware accelerators. Simulation results show that the proposed implementation can handle QCIF-size video at 15 fps and extracts an accurate foreground object.

DTV Interactive Advertisement Authoring Tool Using Sketch Input and Evaluation Function (사용자 스케치 입력과 평가 함수를 이용한 디지털방송용 양방향광고 생성 도구)

  • Park, Tae-Jin;Choy, Yoon-Chul
    • Journal of KIISE:Software and Applications
    • /
    • v.37 no.1
    • /
    • pp.39-50
    • /
    • 2010
  • Interactive broadcasting service using wired/wireless Internet return channel has strong ripple effect. It allows the audiences to participate actively to the program they are watching, and communicating. This paper develops an authoring tool that makes an object-formed interactive advertisement from extracted areas of the advertising object the user specified in TV programs. In the authoring tool, the advertisement producer specifies the target object subjectively and the selected object keeps moving here and there repeatedly. Therefore, it is hard to make an object-formed interactive advertisement with existing tools. This paper suggests sketch-based interface technique for extracting advertising objects, and also provide evaluation functions to correct any sketch error. This paper also converts the area of object into MPEG-4 BIFS codes for authoring the object-formed interactive advertisement.

A Study on the Performance Improvement of Image Segmentation by Selective Application of Structuring Element in MPEG-4 (MPEG-4 기반 영상 분할에서 구조요소의 선택적 적용에 의한 분할성능 개선에 관한 연구)

  • 이완범;김환용
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.41 no.5
    • /
    • pp.165-173
    • /
    • 2004
  • Since the conventional image segmentation methods using mathematical morphology tend to yield over-segmented results, they normally need postprocess which merges small regions to obtain larger ones. To solve this over-segmentation problem without postprocess had to increase size of structuring element used marker extraction. As size of structuring element is very large, edge of region segments incorrectly. Therefore, this paper selectively applies structuring element of mathematical morphology to improve performance of image segmentation and classifies input image into texture region, edge region and simple region using averaged local variance and image gradient. Proposed image segmentation method removes the cause for over-segmentation of image as selectively applies size of structuring element to each region. Simulation results show that proposed method correctly segment for pixel region of similar luminance value and more correctly search texture region and edge region than conventional methods.

An Efficient Transmission Method of Panoramic Multimedia Contents in a Limited Bandwidth Environment (제한적 네트워크 환경 하에서 효율적인 파노라마식 멀티미디어 콘텐츠 분할 전송 방법)

  • Kim, Byung-Chul;Lee, Gun-Hee;Lee, In-Jae;Kim, Kyu-Heon
    • Journal of Broadcast Engineering
    • /
    • v.16 no.5
    • /
    • pp.811-823
    • /
    • 2011
  • This paper proposes an efficient transmission method for the panoramic multimedia contents. The panoramic video provides wide sight and various view-point to the user. The traditional methods of the panoramic multimedia content transmission has several limitations, as follow; A client suffers a long initial delay time to play a panoramic video when it is transmitted through a limited bandwidth network, because the panoramic video has larger data size than a general video. And if a client's display device has limited resolution, such as mobile phone, laptop PC monitor, etc. it can not display the entire panoramic video that has a wide view video sequence. So, in order to overcome the obstacles, this paper proposes an efficient transmission of panoramic multimedia contents. This method will increase the transmission efficiency throughout the technique of the scene description in MPEG-4 system. Also we demonstrated the efficiency of the proposed method by comparison with existing methods.

Effective segmentation of non-rigid object in a still picture and video sequences (정지영상/동영상에서 non-rigid object의 효율적인 영역 분할 방식에 관한 연구)

  • Lee, In-Jae;Kim, Yong-Ho;Kim, Jung-Gyu;Lee, Myeong-Ho;An, Chi-Deuk
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.39 no.1
    • /
    • pp.17-31
    • /
    • 2002
  • The new MPEG-4 video coding standard enables content-based functionalities. Image segmentation is an indispensable process for it. This paper addresses an effective segmentation of non-rigid objects. Non-rigid objects are deformable objects with fuzzy, blurred and indefinite boundaries. So it is difficult to segment deformable objects precisely. In order to solve this problem, we propose an effective segmentation of non-rigid objects using watershed algorithms in still pictures. And we propose an automatic segmentation through intra-frame and inter-frame segmentation process in video sequences. Automatic segmentation preforms boundary-based and region-based segmentation to extract precise object boundaries.