• Title/Summary/Keyword: multimedia transcoding

Search Result 74, Processing Time 0.028 seconds

An Effective P-Frame Transcoding from H.264 to MPEG-2 (H.264 to MPEG-2 Transcoding을 위한 효율적인 P-Frame 변환 방법)

  • Kim, Gi-Hong;Son, Nam-Rye;Lee, Guee-Sang
    • The KIPS Transactions:PartB
    • /
    • v.17B no.1
    • /
    • pp.31-36
    • /
    • 2010
  • After the launch of MPEG-2, it is widely used in multimedia applications like a Digital-TV or a DVD. Then, After the launch of H.264 at 2004, it has been expected to replace MPEG-2 and services IPTV and DMB. As we have been used to MPEG-2 devices by this time, we can not access H.264 Broadcast with MPEG-2 device. So We propose a new approach to transcode H.264 video into MPEG-2 form which can facilitate to display H.264 video with MPEG-2 device. To reduce the quality loss by transcoding, we use CPDT(Cascaded Pixel Domain Transcoder) structure. And to minimize processing time, SKIP block, INTRA block and motion vectors obtain from decoding process is employed for transcoding. we use BMA(Boundary Matching Algorithm) to select only one from candidate motion vectors. Experimental results show a considerable improved PSNR with reduction in processing time compared with existing methods.

H.264/AVC to MPEG-2 Video Transcoding by using Motion Vector Clustering (움직임벡터 군집화를 이용한 H.264/AVC에서 MPEG-2로의 비디오 트랜스코딩)

  • Shin, Yoon-Jeong;Son, Nam-Rye;Nguyen, Dinh Toan;Lee, Guee-Sang
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.5 no.1
    • /
    • pp.23-30
    • /
    • 2010
  • The H.264/AVC is increasingly used in broadcast video applications such as Internet Protocol television (IPTV), digital multimedia broadcasting (DMB) because of high compression performance. But the H.264/AVC coded video can be delivered to the widespread end-user equipment for MPEG-2 after transcoding between this video standards. This paper suggests a new transcoding algorithm for H.264/AVC to MPEG-2 transcoder that uses motion vector clustering in order to reduce the complexity without loss of video quality. The proposed method is exploiting the motion information gathered during h.264 decoding stage. To reduce the search space for the MPEG-2 motion estimation, the predictive motion vector is selected with a least distortion of the candidated motion vectors. These candidate motion vectors are considering the correlation of direction and distance of motion vectors of variable blocks in H.264/AVC. And then the best predictive motion vector is refined with full-search in ${\pm}2$ pixel search area. Compared with a cascaded decoder-encoder, the proposed transcoder achieves computational complexity savings up to 64% with a similar PSNR at the constant bitrate(CBR).

Semantic Concept-based Video Transcoding Method and System (의미적 개념 기반 비디오 트랜스코딩 방법 및 시스템)

  • Jung Yong Ju;Kim Young Suk;Thang Truong Cong;Ro Yong Man;Kim Tae-hee;Kim Jea-Gon
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2004.11a
    • /
    • pp.59-63
    • /
    • 2004
  • 본 논문에서는 다양한 사용자 환경에서 비디오의 범용적인 서비스를 위한 다차원 비디오 트랜스코딩의 판단에 관하여 논한다 효율적인 판단을 위해 여러 영화 비디오 클립들을 비슷한 의미적 개념을 가지는 비디오들과 비슷한 장면 복잡도를 가지는 비디오들로 분류하고, 각 종류별로 주관적인 테스트(subjective test)를 실시하여 비디오 트랜스코딩에 있어서 사용자인지(perception)의 특성을 분석한다. 이렇게 분석된 인간의 시각 특성들을 이용해 비디오 트랜스코딩 판단 궤적(trajectory)을 만들고 이를 다차원 비디오 트랜스코딩 판단 시에 적용하기 위한 방법을 제안한다.

  • PDF

DCT-domain MPEG-2/H.264 Video Transcoder System Architecture for DMB Services (DMB 서비스를 위한 DCT 기반 MPEG-2/H.264 비디오 트랜스코더 시스템 구조)

  • Lee Joo-Kyong;Kwon Soon-Young;Park Seong-Ho;Kim Young-Ju;Chung Ki-Dong
    • The KIPS Transactions:PartB
    • /
    • v.12B no.6 s.102
    • /
    • pp.637-646
    • /
    • 2005
  • Most of the multimedia contents for DBM services art provided as MPEG-2 bit streams. However, they have to be transcoded to H.264 bit streams for practical services because the standard video codec for DMB is H.264. The existing transcoder architecture is Cascaded Pixel-Domain Transcoding Architecture, which consists of the MPEG-2 dacoding phase and the H.264 encoding phase. This architecture can be easily implemented using MPEG-2 decoder and H.264 encoder without source modifying. However. It has disadvantages in transcoding time and DCT-mismatch problem. In this paper, we propose two kinds of transcoder architecture, DCT-OPEN and DCT-CLOSED, to complement the CPDT architecture. Although DCT-OPEN has lower PSNR than CPDT due to drift problem, it is efficient for real-time transcoding. On the contrary, the DCT-CLOSED architecture has the advantage of PSNR over CPDT at the cost of transcoding time.

An Optimal Adaptation Framework for Transmission of Multiple Visual Objects (다중 시각 객체 전송을 위한 최적화 적응 프래임워크)

  • Lim, Jeong-Yeon;Kim, Mun-Churl
    • Journal of KIISE:Software and Applications
    • /
    • v.35 no.4
    • /
    • pp.207-218
    • /
    • 2008
  • With the growth of the Internet, multimedia streaming becomes an important means to deliver video contents over the Internet and the amount of the streaming multimedia contents is also getting increased. However, it becomes difficult to guarantee the quality of service in real-time over the IP network environment with instantaneously varying bandwidth. In this paper, we propose an optimal adaptation framework for streaming contents over the Internet in the sense that the perceptual quality of the multi-angie content with multiple visual objects is maximized given the constraints such as available bandwidth and transcoding cost. In the multi-angle video service framework, the user can select his/her preferred alternate views among the given multiple video streams captured at different view angles for a same event. This enhanced experience often entails streaming problems in real-time over the network, such as instantaneous bandwidth changes in the Internet. In order to cope with this problem, we assume that multi-angle video contents are encoded at different bitrates and the appropriate video streams are then selected or transcoded for delivery to meet such bandwidth constraints. For the user selective consumption of the various bitstreams in the multi-angle video service, the bitstream in each angle can be encoded in various bitrate, and the user can select a sub-bitrstream in the given bitrstreams or transcode the corresponding content in order to deliver the optimally adapted video contents to the instantaneously changing network condition. Therefore, we define the transcoding cost which means the time taken for transcoding the video stream and formulate a unified optimization framework which maximizes the perceptual quality of the multiple video objects in the given constraints such as the transcoding cost and the network bandwidth. Finally, we present plenty of the experimental results to show the effectiveness of the proposed method.

Video Watermarking Scheme for Scalable Video Coding using ROI

  • Yoon, Ji-Sun;Kwon, Seong-Geun;Lee, Suk-Hwan;Song, Yoon-Chul;Kim, Min-Hwan;Kwon, Ki-Ryong
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.6
    • /
    • pp.796-806
    • /
    • 2008
  • This paper presents a blind video watermarking algorithm that has the robustness against spatial, temporal, and SNR scalability and transcoding for the copyright protection of video contents in heterogeneous multimedia service. The proposed process of watermark embedding and detecting is accomplished on base layer for considering spatial scalability. The watermark consists of the string and the ordering number of string for considering temporal scalability. Thus, each of frames has the bitstream of one character and a ordering number of its character. To robust against FGS, the proposed algorithm quantizes low and middle frequency coefficients in ROI region of each of frames and embeds its watermark bitstream into the specific bits of the quantized coefficients. Experimental results verified that the proposed algorithm satisfies the invisibility of watermark and also has the robustness against spatial scalability, temporal scalability and FGS.

  • PDF

Adaptive Motion Vector Resampling Method for Efficient Resizing Transcoding (효율적인 크기조절 트랜스코딩을 위한 적응적 움직임 벡터 재산출 방법)

  • Lee, Kyu-Chan;Kim, Seong-Hoon;Oh, Seoung-Jun;Park, Ho-Chong;Ahn, Chang-Beom;Seo, Jeong-Il
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2005.11a
    • /
    • pp.169-172
    • /
    • 2005
  • 크기조절 트랜스코딩에서 움직임 벡터 재 예측 과정은 많은 연산량을 필요로 하기 때문에, 실시간 처리를 위해서는 이 과정의 연산량을 줄이는 것이 필요하다. 본 논문에서는 여러 영상에 대해 예측 움직임 벡터를 산출하는 방법을 적응적으로 수행함으로써, 기존 방법에 비해 화질열화 없이 연산량을 줄이는 방법을 제안한다. 전체 움직임의 크기와 움직임 벡터들의 균일성(homogeneity)을 이용하여 움직임이 작을 때는 움직임 벡터 재산출 과정 없이 예측 움직임 벡터 성분을 0으로, 움직임이 크면 움직임 벡터들의 균일성의 정도에 따라 평균값 또는 중간값을 예측 움직임 벡터 성분으로 적응적으로 선택하였다. 그리고 좀 더 효율적인 움직임 벡터 수행을 위해 제안된 과정을 수평, 수직 성분에 각각 따로 적용하였다. 가중치를 부여하여 평균값을 취하는 가중평균 방법과 비효 실험한 결과, 같은 PSNR을 유지하는 조건에서 움직임 벡터 재산출 과정의 덧셈과 곱셈 연산의 수가 평균적으로 각각 96%, 42% 정도 감소하였다.

  • PDF

Performance Enhancement of Scaling Filter and Transcoder using CUDA (CUDA를 활용한 스케일링 필터 및 트랜스코더의 성능향상)

  • Han, Jae-Geun;Ko, Young-Sub;Suh, Sung-Han;Ha, Soon-Hoi
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.16 no.4
    • /
    • pp.507-511
    • /
    • 2010
  • In this paper, we propose to enhance the performance of software transcoder by using GPGPU for scaling filters. Video transcoding is a technique that translates a video file to another video file that has a different coding algorithm and/or a different frame size. Its demand increases as more multimedia devices with different specification coexist in our daily life. Since transcoding is computationally intensive, a software transcoder that runs on a CPU takes long processing time. In this paper, we achieve significant speed-up by parallelizing the scaling filter using a GPGPU that can provide significantly large computation power. Through extensive experiments with various video scripts of different size and with various scaling filter options, it is verified that the enhanced transcoder could achieve 36% performance improvement in the default option, and up to 101% in a certain option.

MHP-based SCORM Contents Trans-Coding System for DiTV Service (DiTV 서비스를 위한 MHP 기반의 SCORM 콘텐츠 트랜스코딩 시스템)

  • Im, Seung-Hyun;Lee, Si-Hwa;Hwang, Dae-Hoon
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.5
    • /
    • pp.642-651
    • /
    • 2007
  • Recently, digital convergence, whose core demand is OSMU (One Sourse Multi Use),has been the main topic in e-learning domain and industry. However, the existing web learning content and the new resource developed toprovide contents to different learning environment must be processed to adapt the new learning settings, which causes the cost and time problem, So in this paper we design and implement a Java based SCORM content transcoding system which can transcode the SCORM-based learning content into MHP-based DiTV content in order to adapt t-learning environment using DiTV, which is closer to our real life. Using this system which has ability of inter-operation, reuse, highly-use, the problem mentioned above can be solved well. Moreover, it is possible for a learner who is not familiar with computer to study using DiTV instead of PC.

  • PDF

XML Document Transcoding using Dynamic Profile and Annotation (동적 프로파일과 어노테이션을 이용한 XML 문서 트랜스코딩)

  • 정쌍용;손원성;이진상;임순범;최윤철
    • Proceedings of the Korea Multimedia Society Conference
    • /
    • 2003.11b
    • /
    • pp.1023-1026
    • /
    • 2003
  • 현재 유선에서 지원되는 웹 컨텐츠를 개인용 단말기에서 지원하기에는 단말기의 성능상 한계(screen size, memory size, bandwidth 등) 때문에 여러 가지 문제가 있다. 트랜스코딩이란 이러한 기존 유선 환경에서 제공되는 웹 컨텐츠를 특정 환경에 적합한 형태로 변환 하는 것을 의미한다. 그러나 이와 관련된 기존 연구에서는 사용자가 요구하는 사항만을 변환 하거나 서비스 제공자가 일방적으로 변환하여 웹 컨텐츠를 제공하고 있어 이슈변화에 따른 사용자의 대처능력이 떨어지고 사용자의 사용성이 저하되며, 사용자에게 무의미한 정보 제공의 가능성이 있다. 이러한 문제점들을 해결하기 위해 본 논문에서는 멀티미디어 뉴스 제작을 위한 표준인 NewsML을 대상으로 사용자의 동적 프로파일과 서비스제공자의 어노테이션을 이용하여 사용자가 요구하는 기사와 서비스 제공자가 제공하는 기사를 같이 변환하는 기법을 제안한다. 본 논문의 결과 갑자기 발생하는 사회적 이슈변화에 따른 사용자의 대처능력이 향상 되고 사용자가 불필요한 정보에 과다하게 노출되는 것을 막을 수 있다.

  • PDF