• Title/Summary/Keyword: motion vector estimation

Search Result 365, Processing Time 0.024 seconds

Fast Block Mode Decision of Spatial Enhancement Layer using Interlayer Motion Vector Estimation in Scalable Video Coding (스케일러블 비디오 부호화에서 공간 계층간 움직임 벡터 예측를 이용한 고속 모드 결정)

  • Lee, Bum-Shik;Kim, Mun-Chul;Hahm, Sang-Jin;Lee, Keun-Sik;Park, Chang-Seob
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2007.02a
    • /
    • pp.13-17
    • /
    • 2007
  • 스케일러블 비디오 코딩(SVC, Scalable Video Coding)은 MPEG(Moving Picture Expert Group)과 VCEG (Video Coding Expert Group)의 JVT(Joint VIdeo Team)에 의해 현재 표준화 되고 있는 새로운 압축 표준 기술이며 시간, 공간 및 화질의 스케일러빌리티를 지원하기 위해 계층 구조를 가지고 있다. 공간적 스케일러빌리티를 위해 기본 계층으로부터 텍스처, 움직임 그리고 잔차신호 정보를 예측하여 사용한다. 그러나 고효율의 압축효과를 얻기 위해 기존의 방식에서는 기본계층에서 얻은 세가지 정보이외에 현재 향상 계층에서 자체적으로 얻은 부호화 정보를 비교하여 최소의 RD(Rate Distortion) 비용을 가지는 정보를 이용하여 부호화 하도록 되어 있다. 하지만 이러한 방식은 향상 계층에서 인터 모드 결정 시 $16\times16,\;16\times8,\;8\times16,\;8\times8,\;4\times4,\;4\times8,\;4\times4$ 블록 모드에 대한 움직임 벡터 예측 및 보상 과정을 거쳐야 하기 때문에 향상 계층에서의 부호화 복잡도는 기본 계층에 비해 상당히 증가하게 된다. 본 논문에서는 기본계층에서 예측한 움직임 벡터 정보를 이용하여 항상 계층에서 모드 결정을 고속화하는 방법에 대해 소개한다. 제안된 방법은 기본 계층에서 예측한 블록모드 중에서 큰 블록인 $16\times16$ 블록에서 움직임 벡터가 (0, 0) 일 경우에 대하여 향상 계층에서는 $16\times16$매크로 블록에 대해서만 움직임 예측 및 보상을 수행함으로써 향상 계층에서 움직임 모드 결정을 조기에 완료하게 된다. 이것은 하위 공간 계층에서 예측한 움직임 벡터 정보가 아주 작을 때는 큰 블록 크기로 모드로 결정되는 일반적인 원리를 이용한 것이고 이 제안 방법을 이용하였을 경우 향상계층에의 모드 결정과정을 고속화함으로써 전체 스케일러빌 비디오 부호하기의 연산량 및 복잡도를 최대 70%까지 감소 시켰다. 그러나 연산량 감소에 따른 비트율의 증가와 화질 열화는 각각 최대 1.32%와 최대 0.11dB로 무시할 수 있을 정도로 작음을 확인 하였다.

  • PDF

Parallel Gaussian Processes for Gait and Phase Analysis (보행 방향 및 상태 분석을 위한 병렬 가우스 과정)

  • Sin, Bong-Kee
    • Journal of KIISE
    • /
    • v.42 no.6
    • /
    • pp.748-754
    • /
    • 2015
  • This paper proposes a sequential state estimation model consisting of continuous and discrete variables, as a way of generalizing all discrete-state factorial HMM, and gives a design of gait motion model based on the idea. The discrete state variable implements a Markov chain that models the gait dynamics, and for each state of the Markov chain, we created a Gaussian process over the space of the continuous variable. The Markov chain controls the switching among Gaussian processes, each of which models the rotation or various views of a gait state. Then a particle filter-based algorithm is presented to give an approximate filtering solution. Given an input vector sequence presented over time, this finds a trajectory that follows a Gaussian process and occasionally switches to another dynamically. Experimental results show that the proposed model can provide a very intuitive interpretation of video-based gait into a sequence of poses and a sequence of posture states.

Optimal Facial Emotion Feature Analysis Method based on ASM-LK Optical Flow (ASM-LK Optical Flow 기반 최적 얼굴정서 특징분석 기법)

  • Ko, Kwang-Eun;Park, Seung-Min;Park, Jun-Heong;Sim, Kwee-Bo
    • Journal of the Korean Institute of Intelligent Systems
    • /
    • v.21 no.4
    • /
    • pp.512-517
    • /
    • 2011
  • In this paper, we propose an Active Shape Model (ASM) and Lucas-Kanade (LK) optical flow-based feature extraction and analysis method for analyzing the emotional features from facial images. Considering the facial emotion feature regions are described by Facial Action Coding System, we construct the feature-related shape models based on the combination of landmarks and extract the LK optical flow vectors at each landmarks based on the centre pixels of motion vector window. The facial emotion features are modelled by the combination of the optical flow vectors and the emotional states of facial image can be estimated by the probabilistic estimation technique, such as Bayesian classifier. Also, we extract the optimal emotional features that are considered the high correlation between feature points and emotional states by using common spatial pattern (CSP) analysis in order to improvise the operational efficiency and accuracy of emotional feature extraction process.

An estimation of implied volatility for KOSPI200 option (KOSPI200 옵션의 내재변동성 추정)

  • Choi, Jieun;Lee, Jang Taek
    • Journal of the Korean Data and Information Science Society
    • /
    • v.25 no.3
    • /
    • pp.513-522
    • /
    • 2014
  • Using the assumption that the price of a stock follows a geometric Brownian motion with constant volatility, Black and Scholes (BS) derived a formula that gives the price of a European call option on the stock as a function of the stock price, the strike price, the time to maturity, the risk-free interest rate, the dividend rate paid by the stock, and the volatility of the stock's return. However, implied volatilities of BS method tend to depend on the stock prices and the time to maturity in practice. To address this shortcoming, we estimate the implied volatility function as a function of the strike priceand the time to maturity for data consisting of the daily prices for KOSPI200 call options from January 2007 to May 2009 using support vector regression (SVR), the multiple additive regression trees (MART) algorithm, and ordinary least squaress (OLS) regression. In conclusion, use of MART or SVR in the BS pricing model reduced both RMSE and MAE, compared to the OLS-based BS pricing model.

Voice Activity Detection using Motion and Variation of Intensity in The Mouth Region (입술 영역의 움직임과 밝기 변화를 이용한 음성구간 검출 알고리즘 개발)

  • Kim, Gi-Bak;Ryu, Je-Woong;Cho, Nam-Ik
    • Journal of Broadcast Engineering
    • /
    • v.17 no.3
    • /
    • pp.519-528
    • /
    • 2012
  • Voice activity detection (VAD) is generally conducted by extracting features from the acoustic signal and a decision rule. The performance of such VAD algorithms driven by the input acoustic signal highly depends on the acoustic noise. When video signals are available as well, the performance of VAD can be enhanced by using the visual information which is not affected by the acoustic noise. Previous visual VAD algorithms usually use single visual feature to detect the lip activity, such as active appearance models, optical flow or intensity variation. Based on the analysis of the weakness of each feature, we propose to combine intensity change measure and the optical flow in the mouth region, which can compensate for each other's weakness. In order to minimize the computational complexity, we develop simple measures that avoid statistical estimation or modeling. Specifically, the optical flow is the averaged motion vector of some grid regions and the intensity variation is detected by simple thresholding. To extract the mouth region, we propose a simple algorithm which first detects two eyes and uses the profile of intensity to detect the center of mouth. Experiments show that the proposed combination of two simple measures show higher detection rates for the given false positive rate than the methods that use a single feature.