• 제목/요약/키워드: Video modeling

Search Result 311, Processing Time 0.021 seconds

Statistical Model for Emotional Video Shot Characterization (비디오 셧의 감정 관련 특징에 대한 통계적 모델링)

  • 박현재;강행봉
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.28 no.12C
    • /
    • pp.1200-1208
    • /
    • 2003
  • Affective computing plays an important role in intelligent Human Computer Interactions(HCI). To detect emotional events, it is desirable to construct a computing model for extracting emotion related features from video. In this paper, we propose a statistical model based on the probabilistic distribution of low level features in video shots. The proposed method extracts low level features from video shots and then from a GMM(Gaussian Mixture Model) for them to detect emotional shots. As low level features, we use color, camera motion and sequence of shot lengths. The features can be modeled as a GMM by using EM(Expectation Maximization) algorithm and the relations between time and emotions are estimated by MLE(Maximum Likelihood Estimation). Finally, the two statistical models are combined together using Bayesian framework to detect emotional events in video.

A Functional Modeling of Composition Manager for Service Composition Based on TINA (개방형 정보통신망 기반의 서비스 컴포지션을 위한 컴포지션 관리자 모델링)

  • 신영석;임선환
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.8 no.2
    • /
    • pp.344-351
    • /
    • 2004
  • This paper describes a modeling of service composition manager based on TINA (Telecommunication Information Networking Architecture). The service composition function is mainly motivated by the desire to easily generate new service using existing services from retailers or third party service providers. The TNA-C (Consortium) specification for the service composition does not include the detailed composition procedures and its object models. In this paper, we propose a model of components for the service composition, which adapts a static composition feature in a single provider domain To validate the proposed modeling, we implemented prototype service composition function, which combines two multimedia services; a VOD (Video On Demand) service and a VCS (Video Conference Service) service. As a result, we obtain the specification of the detailed composition architecture between a retailer domain and a third-party service provider domain.

Motion Segmentation from Color Video Sequences based on AMF

  • Kim, Alla;Kim, Yoon-Ho
    • The Journal of Korea Institute of Information, Electronics, and Communication Technology
    • /
    • v.2 no.3
    • /
    • pp.31-38
    • /
    • 2009
  • A process of identifying moving objects from data is typical task in many computer vision applications. In this paper, we propose a motion segmentation method that generally consists from background subtraction and foreground pixel segmentation. The Approximated Median Filter (AMF) was chosen to perform background modelling. To demonstrate the effectiveness of proposed approach, we tested it gray-scale video data as well as RGB color space.

  • PDF

Data Model, Query Language, and Indexing Scheme for Structured Video Documents (구조화된 비디오 문서의 데이터 모델 및 질의어와 색인 기법)

  • 류은숙;이규철
    • Journal of Korea Multimedia Society
    • /
    • v.1 no.1
    • /
    • pp.1-17
    • /
    • 1998
  • Video information is an important component of multimedia systems such as Digital Library, World-Wide Web (WWW), and Video-On-Demand (VOD) service system. Video information has hierarchical document structure inherently, so it is named "structure video document" in this paper. This paper proposes a data model, a query language, and an indexing scheme for structured video documents in order to store, retrieve, and share video documents efficiently. In representing structured video documents, the object-oriented data modeling technique is used since the hierarchical structure information can be modeled as complex objects. We also define object types for the structure information. Our query language supports not only content-based retrieval, which means the queries based on the structure of video documents, and spatial/temporal relation for video documents. In order to perform structure queries efficiently, as well as to reduce the storage overhead of indices, an optimized inverted index structure is proposed.

  • PDF

A Water Model Study on Molten Metal Flow in Die Cavity of Die Casting (다이캐스트 주물의 금형공동내에서 탕류에 관한 수모델적 연구)

  • Kim, Myung-Jae;Choi, Hee-Ho;Cho, Nam-Don
    • Journal of Korea Foundry Society
    • /
    • v.14 no.6
    • /
    • pp.576-589
    • /
    • 1994
  • Water modeling experiments and computer simulation for the predictions of defects of die castings are very important to produce high quality castings with less cost. The relation between the variable air vent system and the characteristics of the fluid flow in the die cavity is studied by using water modeling tests, which give ideas on reasonable designing of die cavity, vent arrangement and gating system. In order to test the water modeling, injection is done by using water containing NaCl. Flow behaviors in cavities are visualized by high speed camera and video tape recorder, and local filling time is measured with electrode sensors. Special attention is paid to the configuration of die cavity. Simulated results by computer are examined and compared with the results of water modeling experiments. There are close correlations between the simulated results and water modeling ones.

  • PDF

City-Scale Modeling for Street Navigation

  • Huang, Fay;Klette, Reinhard
    • Journal of information and communication convergence engineering
    • /
    • v.10 no.4
    • /
    • pp.411-419
    • /
    • 2012
  • This paper proposes a semi-automatic image-based approach for 3-dimensional (3D) modeling of buildings along streets. Image-based urban 3D modeling techniques are typically based on the use of aerial and ground-level images. The aerial image of the relevant area is extracted from publically available sources in Google Maps by stitching together different patches of the map. Panoramic images are common for ground-level recording because they have advantages for 3D modeling. A panoramic video recorder is used in the proposed approach for recording sequences of ground-level spherical panoramic images. The proposed approach has two advantages. First, detected camera trajectories are more accurate and stable (compared to methods using multi-view planar images only) due to the use of spherical panoramic images. Second, we extract the texture of a facade of a building from a single panoramic image. Thus, there is no need to deal with color blending problems that typically occur when using overlapping textures.

Adaptive Background Modeling for Crowded Scenes (혼잡한 환경에 적합한 적응적인 배경모델링 방법)

  • Lee, Gwang-Gook;Song, Su-Han;Ka, Kee-Hwan;Yoon, Ja-Young;Kim, Jae-Jun;Kim, Whoi-Yul
    • Journal of Korea Multimedia Society
    • /
    • v.11 no.5
    • /
    • pp.597-609
    • /
    • 2008
  • Due to the recursive updating nature of background model, previous background modeling methods are often perturbed by crowd scenes where foreground pixels occurs more frequently than background pixels. To resolve this problem, an adaptive background modeling method, which is based on the well-known Gaussian mixture background model, is proposed. In the proposed method, the learning rate of background model is adaptively adjusted with respect to the crowdedness of the scene. Consequently, the learning process is suppressed in crowded scene to maintain proper background model. Experiments on real dataset revealed that the proposed method could perform background subtraction effectively even in crowd situation while the performance is almost the same to the previous method in normal scenes. Also, the F-measure was increased by 5-10% compared to the previous background modeling methods in the video of crowded situations.

  • PDF

Auto-Analysis of Traffic Flow through Semantic Modeling of Moving Objects (움직임 객체의 의미적 모델링을 통한 차량 흐름 자동 분석)

  • Choi, Chang;Cho, Mi-Young;Choi, Jun-Ho;Choi, Dong-Jin;Kim, Pan-Koo
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.8 no.6
    • /
    • pp.36-45
    • /
    • 2009
  • Recently, there are interested in the automatic traffic flowing and accident detection using various low level information from video in the road. In this paper, the automatic traffic flowing and algorithm, and application of traffic accident detection using traffic management systems are studied. To achieve these purposes, the spatio-temporal relation models using topological and directional relations have been made, then a matching of the proposed models with the directional motion verbs proposed by Levin's verbs of inherently directed motion is applied. Finally, the synonym and antonym are inserted by using WordNet. For the similarity measuring between proposed modeling and trajectory of moving object in the video, the objects are extracted, and then compared with the trajectories of moving objects by the proposed modeling. Because of the different features with each proposed modeling, the rules that have been generated will be applied to the similarity measurement by TSR (Tangent Space Representation). Through this research, we can extend our results to the automatic accident detection of vehicle using CCTV.

  • PDF

Transform domain Wyner-Ziv Coding based on the frequency-adaptive channel noise modeling (주파수 적응 채널 잡음 모델링에 기반한 변환영역 Wyner-Ziv 부호화 방법)

  • Kim, Byung-Hee;Ko, Bong-Hyuck;Jeon, Byeung-Woo
    • Journal of Broadcast Engineering
    • /
    • v.14 no.2
    • /
    • pp.144-153
    • /
    • 2009
  • Recently, as the necessity of a light-weighted video encoding technique has been rising for applications such as UCC(User Created Contents) or Multiview Video, Distributed Video Coding(DVC) where a decoder, not an encoder, performs the motion estimation/compensation taking most of computational complexity has been vigorously investigated. Wyner-Ziv coding reconstructs an image by eliminating the noise on side information which is decoder-side prediction of original image using channel code. Generally the side information of Wyner-Ziv coding is generated by using frame interpolation between key frames. The channel code such as Turbo code or LDPC code which shows a performance close to the Shannon's limit is employed. The noise model of Wyner-Ziv coding for channel decoding is called Virtual Channel Noise and is generally modeled by Laplacian or Gaussian distribution. In this paper, we propose a Wyner-Ziv coding method based on the frequency-adaptive channel noise modeling in transform domain. The experimental results with various sequences prove that the proposed method makes the channel noise model more accurate compared to the conventional scheme, resulting in improvement of the rate-distortion performance by up to 0.52dB.

A Frame-based Coding Mode Decision for Temporally Active Video Sequence in Distributed Video Coding (분산비디오부호화에서 동적비디오에 적합한 프레임별 모드 결정)

  • Hoangvan, Xiem;Park, Jong-Bin;Shim, Hiuk-Jae;Jeon, Byeung-Woo
    • Journal of Broadcast Engineering
    • /
    • v.16 no.3
    • /
    • pp.510-519
    • /
    • 2011
  • Intra mode decision is a useful coding tool in Distributed Video Coding (DVC) for improving DVC coding efficiency for video sequences having fast motion. A major limitation associated with the existing intra mode decision methods, however, is that its efficiency highly depends on user-specified thresholds or modeling parameters. This paper proposes an entropy-based method to address this problem. The probabilities of intra and Wyner?Ziv (WZ) modes are determined firstly by examining correlation of pixels in spatial and temporal directions. Based on these probabilities, entropy of the intra and the WZ modes are computed. A comparison based on the entropy values decides a coding mode between intra coding and WZ coding without relying on any user-specified thresholds or modeling parameters. Experimental results show its superior rate-distortion performance of improvements of PSNR up to 2 dB against a conventional Wyner?Ziv coding without intra mode decision. Furthermore, since the proposed method does not require any thresholds or modeling parameters from users, it is very attractive for real life applications.