• Title/Summary/Keyword: Motion similarity

Search Result 157, Processing Time 0.026 seconds

FRACTAL CODING OF VIDEO SEQUENCE USING CPM AND NCIM

  • Kim, Chang-Su;Kim, Rin-Chul;Lee, Sang-Uk
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 1996.06b
    • /
    • pp.72-76
    • /
    • 1996
  • We propose a novel algorithm for fractal video sequence coding, based on the circular prediction mapping (CPM), in which each range block is approximated by a domain block in the circularly previous frame. In our approach, the size of the domain block is set to be same as that of the range block for exploiting the high temporal correlation between the adjacent frames, while most other fractal coders use the domain block larger than the range block. Therefore the domain-range mapping in the CPM is similar to the block matching algorithm in the motion compensation techniques, and the advantages of this similarity are discussed. Also we show that the CPM can be combined with non-contractive inter-frame mapping (NCIM), improving the performance of the fractal sequence coder further. The computer simulation results on real image sequences demonstrate that the proposed algorithm provides very promising performance at low bit-rate, ranging from 40 Kbps to 250 Kbps.

  • PDF

Facial Animation Generation by Korean Text Input (한글 문자 입력에 따른 얼굴 에니메이션)

  • Kim, Tae-Eun;Park, You-Shin
    • The Journal of the Korea institute of electronic communication sciences
    • /
    • v.4 no.2
    • /
    • pp.116-122
    • /
    • 2009
  • In this paper, we propose a new method which generates the trajectory of the mouth shape for the characters by the user inputs. It is based on the character at a basis syllable and can be suitable to the mouth shape generation. In this paper, we understand the principle of the Korean language creation and find the similarity for the form of the mouth shape and select it as a basic syllable. We also consider the articulation of this phoneme for it and create a new mouth shape trajectory and apply at face of an 3D avatar.

  • PDF

Optimization of Railway Alignment Using GIS (GIS를 이용한 철도선형최적화)

  • 강인준;이준석;김수성
    • Proceedings of the KSR Conference
    • /
    • 2002.10a
    • /
    • pp.727-732
    • /
    • 2002
  • This study is to develop the model of alignment optimization based on design criteria by approaching through alignment of railway design and problems in economy, environment and technology for satisfying traffic volume of the main roads caused by economical and social developments. Now, Geographic Information System isn't applied when designing a present railway in home. And the design of railway alignment is still set on importance of transition curves and cant according to passenger comfort in abroad so tile study of railway alignment is at initiation phase so far. This paper is about decision of optimal alignment between two stations such as starting point and ending point automatically using GIS in optimization of railway alignment. A route between Sungsan city and Shinpung city is the training area and the study compared and evaluated optimal railway route by GIS automatically with present railway route designed. Present optimal fomulas was used in this study for optimization of railway alignment. The model of optimization of railway alignment was developed through topographical elements and it was mentioned by the model of road alignment because of the similarity in design of alignment. But the design of lateral track irregularities, cant fur passenger comfort and motion sickness fellowed by train rolling have to be considered more. Anyway, this study farmed the basis of using GIS and the study should be keep going on in the future.

  • PDF

Detection of Objects Temporally Stop Moving with Spatio-Temporal Segmentation (시공간 영상분할을 이용한 이동 및 이동 중 정지물체 검출)

  • Kim, Do-Hyung;Kim, Gyeong-Hwan
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.40 no.1
    • /
    • pp.142-151
    • /
    • 2015
  • This paper proposes a method for detection of objects temporally stop moving in video sequences taken by a moving camera. Even though the consequence of missed detection of those objects could be catastrophic in terms of application level requirements, not much attention has been paid in conventional approaches. In the proposed method, we introduce cues for consistent detection and tracking of objects: motion potential, position potential, and color distribution similarity. Integration of the three cues in the graph-cut algorithm makes possible to detect objects that temporally stop moving and are newly appearing. Experiment results prove that the proposed method can not only detect moving objects but also track objects stop moving.

Joint Spatial-Temporal Quality Improvement Scheme for H.264 Low Bit Rate Video Coding via Adaptive Frameskip

  • Cui, Ziguan;Gan, Zongliang;Zhu, Xiuchang
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.6 no.1
    • /
    • pp.426-445
    • /
    • 2012
  • Conventional rate control (RC) schemes for H.264 video coding usually regulate output bit rate to match channel bandwidth by adjusting quantization parameter (QP) at fixed full frame rate, and the passive frame skipping to avoid buffer overflow usually occurs when scene changes or high motions exist in video sequences especially at low bit rate, which degrades spatial-temporal quality and causes jerky effect. In this paper, an active content adaptive frame skipping scheme is proposed instead of passive methods, which skips subjectively trivial frames by structural similarity (SSIM) measurement between the original frame and the interpolated frame via motion vector (MV) copy scheme. The saved bits from skipped frames are allocated to coded key ones to enhance their spatial quality, and the skipped frames are well recovered based on MV copy scheme from adjacent key ones at the decoder side to maintain constant frame rate. Experimental results show that the proposed active SSIM-based frameskip scheme acquires better and more consistent spatial-temporal quality both in objective (PSNR) and subjective (SSIM) sense with low complexity compared to classic fixed frame rate control method JVT-G012 and prior objective metric based frameskip method.

U-net with vision transformer encoder for polyp segmentation in colonoscopy images (비전 트랜스포머 인코더가 포함된 U-net을 이용한 대장 내시경 이미지의 폴립 분할)

  • Ayana, Gelan;Choe, Se-woon
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2022.10a
    • /
    • pp.97-99
    • /
    • 2022
  • For the early identification and treatment of colorectal cancer, accurate polyp segmentation is crucial. However, polyp segmentation is a challenging task, and the majority of current approaches struggle with two issues. First, the position, size, and shape of each individual polyp varies greatly (intra-class inconsistency). Second, there is a significant degree of similarity between polyps and their surroundings under certain circumstances, such as motion blur and light reflection (inter-class indistinction). U-net, which is composed of convolutional neural networks as encoder and decoder, is considered as a standard for tackling this task. We propose an updated U-net architecture replacing the encoder part with vision transformer network for polyp segmentation. The proposed architecture performed better than the standard U-net architecture for the task of polyp segmentation.

  • PDF

Design of intelligent computing networks for a two-phase fluid flow with dusty particles hanging above a stretched cylinder

  • Tayyab Zamir;Farooq Ahmed Shah;Muhammad Shoaib;Atta Ullah
    • Computers and Concrete
    • /
    • v.32 no.4
    • /
    • pp.399-410
    • /
    • 2023
  • This study proposes a novel use of backpropagated Levenberg-Marquardt neural networks based on computational intelligence heuristics to comprehend the examination of hybrid nanoparticles on the flow of dusty liquid via stretched cylinder. A two-phase model is employed in the present work to describe the fluid flow. The use of desulphated nanoparticles of silver and molybdenum suspended in water as base fluid. The mathematical model represented in terms of partial differential equations, Implementing similarity transformationsis model is converted to ordinary differential equations for the analysis . By adjusting the particle mass concentration and curvature parameter, a unique technique is utilized to generate a dataset for the proposed Levenberg-Marquardt neural networks in various nanoparticle circumstances on the flow of dusty liquid via stretched cylinder. The intelligent solver Levenberg-Marquardt neural networks is trained, tested and verified to identify the nanoparticles on the flow of dusty liquid solution for various situations. The Levenberg-Marquardt neural networks approach is applied for the solution of the hybrid nanoparticles on the flow of dusty liquid via stretched cylinder model. It is validated by comparison with the standard solution, regression analysis, histograms, and absolute error analysis. Strong agreement between proposed results and reference solutions as well as accuracy provide an evidence of the framework's validity.

A Study on the Guided Search Method for Transcoding MPEG2 P frame to H.263 P frame in a Compressed Domain (압축상태에서 MPEG2 P 프레임을 H.263 P 프레임으로 변환하기 위한 가이드 탐색 방법 연구)

  • Um, Sung-Min;Kang, Eui-Seon;Lim, Young-Wan;Hwang, Jae-Gak
    • The KIPS Transactions:PartB
    • /
    • v.9B no.6
    • /
    • pp.745-752
    • /
    • 2002
  • The purpose of the paper is to enable a format transcoding between a heterogeneous compression format in a real time, and to enhance the compression ratio using characteristics of the compressed frame. In this paper, for the heterogeneous format transcoding, we tried to transcode from MPEG2 having a lower compression ratio to H.263 having a higher compression ratio. After analyzing MPEG 2 bit stream and H.263 bit stream of the same original video, we found that the number of intra coded macro blocks in MPEG 2 data is much higher than the number of the intra coded macro blocks in H.263 data. In the process of P frame generation, a intra coded macro block is generated when a motion estimation value representing the similarity between the previous frame and current frame does not meet a threshold. Especially the intra coded macro block has a great impact on the compression ratio. Hence the paper, we tried to minimize the number of intra coded macro blocks in H.263 data stream which is transcoded from MPEG 2 in a compressed domain. For the purpose, we propose a guided search method for transcoding the INTRA coded block into INTER coded block using the information about motion vectors surrounding the intra macro block in order to minimize the complexity of the motion estimation process. The experimental results show that the transcoding of MPEG 2 into H.263 can be done in a real time successfully.

Multi-View Image Deblurring for 3D Shape Reconstruction (3차원 형상 복원을 위한 다중시점 영상 디블러링)

  • Choi, Ho Yeol;Park, In Kyu
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.49 no.11
    • /
    • pp.47-55
    • /
    • 2012
  • In this paper, we propose a method to reconstruct accurate 3D shape object by using multi-view images which are disturbed by motion blur. In multi-view deblurring, more precise PSF estimation can be done by using the geometric relationship between multi-view images. The proposed method first estimates initial 2D PSFs from individual input images. Then 3D PSF candidates are projected on the input images one by one to find the best one which are mostly consistent with the initial 2D PSFs. 3D PSF consists with direction and density and it represents the 3D trajectory of object motion. 야to restore 3D shape by using multi-view images computes the similarity map and estimates the position of 3D point. The estimated 3D PSF is again projected to input images and they replaces the intial 2D PSFs which are finally used in image deblurring. Experimental result shows that the quality of image deblurring and 3D reconstruction improves significantly compared with the result when the input images are independently deblurred.

Video Quality Metric Using One-Dimensional Histograms of Motion Vectors (움직임 벡터의 1차원 히스토그램을 이용한 비디오 화질 평가 척도)

  • Han, Ho-Sung;Kim, Dong-O;Park, Bae-Hong;Sim, Dong-Gyu
    • Journal of the Institute of Electronics Engineers of Korea SP
    • /
    • v.45 no.2
    • /
    • pp.21-28
    • /
    • 2008
  • This paper proposes a novel reduced-reference assessment method for video quality assessment, in which one-dimensional (1-D) histograms of motion vectors (MVs) are used as features of videos. The proposed method is more efficient than the conventional methods in view of computation time, because the proposed quality metric decodes MVs directly from video stream in the parsing process instead of reconstructing the distorted video at the receiver. Moreover, in view of data size, the propose method is efficient because a sender transmits 1-D histograms of MVs accumulated over whole input video sequences. Here, we use 1-D histograms of MVs accumulated over the whole video sequences, which is different from the conventional methods that assessed each image independently. For testing the similarity between histograms, we use histogram intersection and histogram difference methods. We compare the proposed method with the conventional methods for 52 video clips, which are coded under varying bit rate, image size, and frame rate. Experimental results show that the proposed method is more efficient than the conventional methods and that the proposed method is more similar to the mean opinion score (MOS) than conventional algorithms.