• Title/Summary/Keyword: Video Parsing

Search Result 34, Processing Time 0.025 seconds

Generation of Video Clips Utilizing Shot Boundary Detection (샷 경계 검출을 이용한 영상 클립 생성)

  • Kim, Hyeok-Man;Cho, Seong-Kil
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.7 no.6
    • /
    • pp.582-592
    • /
    • 2001
  • Video indexing plays an important role in the applications such as digital video libraries or web VOD which archive large volume of digital videos. Video indexing is usually based on video segmentation. In this paper, we propose a software tool called V2Web Studio which can generate video clips utilizing shot boundary detection algorithm. With the V2Web Studio, the process of clip generation consists of the following four steps: 1) Automatic detection of shot boundaries by parsing the video, 2) Elimination of errors by manually verifying the results of the detection, 3) Building a modeling structure of logical hierarchy using the verified shots, and 4) Generating multiple video clips corresponding to each logically modeled segment. The aforementioned steps are performed by shot detector, shot verifier, video modeler and clip generator in the V2Web Studio respectively.

  • PDF

Video Quality Metric Using One-Dimensional Histograms of Motion Vectors (움직임 벡터의 1차원 히스토그램을 이용한 비디오 화질 평가 척도)

  • Han, Ho-Sung;Kim, Dong-O;Park, Bae-Hong;Sim, Dong-Gyu
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.2
    • /
    • pp.21-28
    • /
    • 2008
  • This paper proposes a novel reduced-reference assessment method for video quality assessment, in which one-dimensional (1-D) histograms of motion vectors (MVs) are used as features of videos. The proposed method is more efficient than the conventional methods in view of computation time, because the proposed quality metric decodes MVs directly from video stream in the parsing process instead of reconstructing the distorted video at the receiver. Moreover, in view of data size, the propose method is efficient because a sender transmits 1-D histograms of MVs accumulated over whole input video sequences. Here, we use 1-D histograms of MVs accumulated over the whole video sequences, which is different from the conventional methods that assessed each image independently. For testing the similarity between histograms, we use histogram intersection and histogram difference methods. We compare the proposed method with the conventional methods for 52 video clips, which are coded under varying bit rate, image size, and frame rate. Experimental results show that the proposed method is more efficient than the conventional methods and that the proposed method is more similar to the mean opinion score (MOS) than conventional algorithms.

A Practical RTP Packetization Scheme for SVC Video Transport over IP Networks

  • Seo, Kwang-Deok;Kim, Jin-Soo;Jung, Soon-Heung;Yoo, Jeong-Ju
    • ETRI Journal
    • /
    • v.32 no.2
    • /
    • pp.281-291
    • /
    • 2010
  • Scalable video coding (SVC) has been standardized as an extension of the H.264/AVC standard. This paper proposes a practical real-time transport protocol (RTP) packetization scheme to transport SVC video over IP networks. In combined scalability of SVC, a coded picture of a base or scalable enhancement layer is produced as one or more video layers consisting of network abstraction layer (NAL) units. The SVC NAL unit header contains a (DID, TID, QID) field to identify the association of each SVC NAL unit with its scalable enhancement layer without parsing the payload part of the SVC NAL unit. In this paper, we utilize the (DID, TID, QID) information to derive hierarchical spatio-temporal relationship of the SVC NAL units. Based on the derivation using the (DID, TID, QID) field, we propose a practical RTP packetization scheme for generating single RTP sessions in unicast and multicast transport of SVC video. The experimental results indicate that the proposed packetization scheme can be efficiently applied to transport SVC video over IP networks with little induced delay, jitter, and computational load.

Implementation of AR Remote Rendering Techniques for Real-time Volumetric 3D Video

  • Lee, Daehyeon;Lee, Munyong;Lee, Sang-ha;Lee, Jaehyun;Kwon, Soonchul
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.12 no.2
    • /
    • pp.90-97
    • /
    • 2020
  • Recently, with the growth of mixed reality industrial infrastructure, relevant convergence research has been proposed. For real-time mixed reality services such as remote video conferencing, the research on real-time acquisition-process-transfer methods is required. This paper aims to implement an AR remote rendering method of volumetric 3D video data. We have proposed and implemented two modules; one, the parsing module of the volumetric 3D video to a game engine, and two, the server rendering module. The result of the experiment showed that the volumetric 3D video sequence data of about 15 MB was compressed by 6-7%. The remote module was streamed at 27 fps at a 1200 by 1200 resolution. The results of this paper are expected to be applied to an AR cloud service.

Semantic Scenes Classification of Sports News Video for Sports Genre Analysis (스포츠 장르 분석을 위한 스포츠 뉴스 비디오의 의미적 장면 분류)

  • Song, Mi-Young
    • Journal of Korea Multimedia Society
    • /
    • v.10 no.5
    • /
    • pp.559-568
    • /
    • 2007
  • Anchor-person scene detection is of significance for video shot semantic parsing and indexing clues extraction in content-based news video indexing and retrieval system. This paper proposes an efficient algorithm extracting anchor ranges that exist in sports news video for unit structuring of sports news. To detect anchor person scenes, first, anchor person candidate scene is decided by DCT coefficients and motion vector information in the MPEG4 compressed video. Then, from the candidate anchor scenes, image processing method is utilized to classify the news video into anchor-person scenes and non-anchor(sports) scenes. The proposed scheme achieves a mean precision and recall of 98% in the anchor-person scenes detection experiment.

  • PDF

Network-Adaptive HD Video Streaming with Cross-Layered WLAM Channel Monitoring (Cross Layer 기반의 무선랜 채널 모니터링을 적용한 네트워크 적응형 HD 비디오 스트리밍)

  • Park Sang-Hoon;Yoon Ha-Young;Kim Jong-Won;Cho Chang-Sik
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.31 no.4A
    • /
    • pp.421-430
    • /
    • 2006
  • In this paper, we propose a practical implementation of network-adaptive HD(high definition) MPEG-2 video streaming with a cross-layered channel monitoring(CLM) over the IEEE 802.11a WLAN(wireless local area network). For wireless channel monitoring, AP(access point) periodically measures the MAC(medium access control) layer transmission information and sends the monitoring information to a streaming server. This makes that the streaming server reacts more quickly as well as efficiently to the fluctuated wireless channel than that of the end-to-end monitoring(E2EM) scheme for the video adaptation. The streaming sewer dynamically performs the priority-based frame dropping to adjust the video sending rate according to the measured wireless channel condition. For this purpose, our streaming system nicely provides frame-based prioritized packetization by using a real-time stream parsing module. Various evaluation results over an IEEE 802.11a WLAM testbed are provided to verify the intended QoS adaptation capability The experimental results show that the proposed system can effectively mitigate the quality degradation of video streaming caused by the fluctuations of time-varying wireless channel condition.

Parallel Parsing of MPEG Video in Heterogeneous Distributed Environment (이질 분산환경에서 MPEG 비디오의 병렬 파싱)

  • 남윤영;황인준
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2003.04a
    • /
    • pp.91-93
    • /
    • 2003
  • 디지털 비디오의 사용 증가로 인해 비디오의 인덱싱, 브라우징, 검색과 같은 서비스가 필요하게 되었다. 이와 같은 서비스의 효율적인 구축을 위한 단계 중, 비디오 파싱은 멀티미디어 애플리케이션에서 중요한 부분을 차지하고 있다. 일반적으로. 비디오 파싱은 복잡한 연산을 하기 때문에. 전통적인 단일 컴퓨터에 의한 파싱 방법은 많은 시간이 소요된다. 이러한 문제는 병렬 및 분산 컴퓨팅을 사용하여 파싱에 필요한 시간을 줄임으로써 해결할 수 있다. 본 논문에서는 이질 분산 환경의 멀티 컴퓨터를 사용한 세 가지 MPEG 비디오의 병렬 스케쥴링 알고리즘을 제안하였다. 성능 평가에서는 제안한 스케쥴링 알고리즘의 speedup과 load balancing에 관하여 비교 실험하였으며, 제안한 스케쥴링 알고리즘을 통해 매우 좋은 성능 향상을 얻을 수 있었다.

  • PDF

Design of Video Analytics Information Management System for Optimizing the Storage Load. (스토리지 부하 최적화를 위한 영상분석정보 관리시스템 설계)

  • Shim, Jae-Sung;Park, Seok-Cheon
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2014.04a
    • /
    • pp.941-942
    • /
    • 2014
  • 본 논문에서는 기존 지능형 영상감시 시스템에서의 영상정보 분석 시 발생하는 XML 데이터가 불필요하게 많이 생성(XML data 30ea/1s)되어 스토리지의 저장 및 관리 부하를 개선하기 위해 분석되어진 XML 데이터를 Parsing & Filtering하여 저장함으로써 생성되는 데이터를 간소화함으로써, 효율적인 관리가 가능한 스토리지 부하 최적화를 위한 영상분석정보 관리시스템을 설계하였다.

Design and Implementation of the Feature Information Parsing System for Video Image (동영상 이미지의 특징정보 분석 시스템 설계 및 구현)

  • 최내원;지정규
    • Journal of the Korea Society of Computer and Information
    • /
    • v.7 no.3
    • /
    • pp.1-8
    • /
    • 2002
  • Due to the fast development in computer application technologies, a video is now being more widely used than ever in many areas. The current information analyzing systems are basically built to process text-based data. Thus, it has little bits Problems when it needs to correctly represent the ambiguity of a video, when it has to process a large amount of comments. or when it lacks the objectivity that the jobs require. We would like to purpose the method that is capable of analyze a large amount of video efficiently. To extract the color, we translate the color from RGB to HSI and use the information that matches with the representative colors. To extract the shape information, we use improved moment invariants(IMI) so that we can solve many problems of histogram intersection.

  • PDF

News Video Shot Boundary Detection using Singular Value Decomposition and Incremental Clustering (특이값 분해와 점증적 클러스터링을 이용한 뉴스 비디오 샷 경계 탐지)

  • Lee, Han-Sung;Im, Young-Hee;Park, Dai-Hee;Lee, Seong-Whan
    • Journal of KIISE:Software and Applications
    • /
    • v.36 no.2
    • /
    • pp.169-177
    • /
    • 2009
  • In this paper, we propose a new shot boundary detection method which is optimized for news video story parsing. This new news shot boundary detection method was designed to satisfy all the following requirements: 1) minimizing the incorrect data in data set for anchor shot detection by improving the recall ratio 2) detecting abrupt cuts and gradual transitions with one single algorithm so as to divide news video into shots with one scan of data set; 3) classifying shots into static or dynamic, therefore, reducing the search space for the subsequent stage of anchor shot detection. The proposed method, based on singular value decomposition with incremental clustering and mercer kernel, has additional desirable features. Applying singular value decomposition, the noise or trivial variations in the video sequence are removed. Therefore, the separability is improved. Mercer kernel improves the possibility of detection of shots which is not separable in input space by mapping data to high dimensional feature space. The experimental results illustrated the superiority of the proposed method with respect to recall criteria and search space reduction for anchor shot detection.