• Title/Summary/Keyword: Invariant feature

Search Result 433, Processing Time 0.027 seconds

Content-based Image Retrieval using Color Ratio and Moment of Object Region (객체영역의 컬러비와 모멘트를 이용한 내용기반 영상검색)

  • Kim, Eun-Kyong;Oh, Jun-Taek;Kim, Wook-Hyun
    • The KIPS Transactions:PartB
    • /
    • v.9B no.4
    • /
    • pp.501-508
    • /
    • 2002
  • In this paper, we propose a content-based image retrieval using the color ratio and moment of object region. We acquire an optimal spatial information by the region splitting that utilizes horizontal-vertical projection and dominant color. It is based on hypothesis that an object locates in the center of image. We use color ratio and moment as feature informations. Those are extracted from the splitted regions and have the invariant property for various transformation, and besides, similarity measure utilizes a modified histogram intersection to acquire correlation information between bins in a color histogram. In experimental results, the proposed method shows more flexible and efficient performance than existing methods based on region splitting.

Recognition and Pose Estimation of 3-D Objects for Visual Servoing (Visual Servoing을 위한 3차원 물체의 인식 및 자세 추정)

  • Yang, Jae-Ho;Jeong, Moon-Ho;Park, Mig-Non
    • Proceedings of the KIEE Conference
    • /
    • 2006.07d
    • /
    • pp.1931-1932
    • /
    • 2006
  • 로봇이 어떤 물체를 인지하고 그 물체에 대해 어떤 작업을 하고자 할 때 특정 물체의 인식 문제, 3차원 정보를 획득하는 문제, 자세를 추정하는 문제 등 해결해야 될 문제들이 있다. 물체를 인식하는 과정에서는 주위 배경과 물체의 크기의 변화, 회전, 가려짐 등으로 인해 물체 인식을 어렵게 만드는 요소들이 있다. 2차원 이미지를 통해 3차원 정보를 추출하는 과정은 일반적으로 두 대의 카메라를 이용하여 스테레오 이미지를 통해 얻는다. 이 때 좌우 영상간의 매칭의 과정이 필요하다. 자세 추정의 문제는 카메라 좌표와 물체의 좌표간의 관계를 알아야 한다. Visual Servoing을 어렵게 만드는 많은 요인들이 있으며 본 논문에서는 물체의 크기, 회전, 이동에 불변인 디스크립터(descriptor)를 사용하는 SIFT(Scale Invariant Feature Transform)를 통해 3차원 물체의 인식과 자세를 추정하는 방법을 제시한다. 또한 자세 추정을 위해 2차원 Keypoint들의 매칭을 3차원 정보를 통해 검증하는 방법을 제시한다. (SIFT에 의해 추출된 point를 Keypoint라 명한다.)

  • PDF

Parallel Model Feature Extraction to Improve Performance of a BCI System (BCI 시스템의 성능 개선을 위한 병렬 모델 특징 추출)

  • Chum, Pharino;Park, Seung-Min;Sim, Kwee-Bo
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.19 no.11
    • /
    • pp.1022-1028
    • /
    • 2013
  • It is well knowns that based on the CSP (Common Spatial Pattern) algorithm, the linear projection of an EEG (Electroencephalography) signal can be made to spaces that optimize the discriminant between two patterns. Sharing disadvantages from linear time invariant systems, CSP suffers from the non-stationary nature of EEGs causing the performance of the classification in a BCI (Brain-Computer Interface) system to drop significantly when comparing the training data and test data. The author has suggested a simple idea based on the parallel model of CSP filters to improve the performance of BCI systems. The model was tested with a simple CSP algorithm (without any elaborate regularizing methods) and a perceptron learning algorithm as a classifier to determine the improvement of the system. The simulation showed that the parallel model could improve classification performance by over 10% compared to conventional CSP methods.

3D Workspace Modeling Based on Context Understanding for Robotic Manipulation (컨텍스트 이해를 통한 로봇의 작업을 위해 필요한 3D 작업공간 모델링)

  • Kim, Eun-Young;Lee, Suk-Han;Jang, Dae-Sik;Han, Jung-Hyun
    • Annual Conference of KIPS
    • /
    • 2005.05a
    • /
    • pp.1619-1622
    • /
    • 2005
  • 본 논문에서는 로봇이 작업을 계획하기 위해 필요한 3차원 작업 공간을 세 가지의 컨텍스트(context)들을 이해함으로써 빠르게 모델링하는 새로운 기법을 소개 하고 있다. 로봇이 사람과 비슷한 속도와 정확도로 작업 공간을 이해하고 모델링하는 것에 초점을 두고 있으며 이를 위해 작업 공간상의 특징적인 세 가지의 컨텍스트(작업공간의 간략화를 위한 전체 공간상의 평면특징, 데이터베이스에 미리 정의된 물체 그리고 로봇의 주어진 작업에 따라 다양한 상세함을 갖는 그 외의 장애물)를 정의하였고, 그것들을 빠르게 이해함으로써 어떻게 3차원 작업 공간을 형성하는지 설명하고 있다. 본 논문에서 3 차원 정보를 갖는 scale invariant feature transformation(SIFT)를 stereo-sis SIFT 로 간주했으며 이를 이용하여 위에서 언급한 컨텍스트들을 이해하였고 다양한 카메라의 위치로부터 얻어지는 여러 개의 장면들을 정합하였다. 또한, 실험을 통해 제안한 방법의 타당성도 검증하였다.

  • PDF

A New Shape-Based Object Category Recognition Technique using Affine Category Shape Model (Affine Category Shape Model을 이용한 형태 기반 범주 물체 인식 기법)

  • Kim, Dong-Hwan;Choi, Yu-Kyung;Park, Sung-Kee
    • The Journal of Korea Robotics Society
    • /
    • v.4 no.3
    • /
    • pp.185-191
    • /
    • 2009
  • This paper presents a new shape-based algorithm using affine category shape model for object category recognition and model learning. Affine category shape model is a graph of interconnected nodes whose geometric interactions are modeled using pairwise potentials. In its learning phase, it can efficiently handle large pose variations of objects in training images by estimating 2-D homography transformation between the model and the training images. Since the pairwise potentials are defined on only relative geometric relationship betweenfeatures, the proposed matching algorithm is translation and in-plane rotation invariant and robust to affine transformation. We apply spectral matching algorithm to find feature correspondences, which are then used as initial correspondences for RANSAC algorithm. The 2-D homography transformation and the inlier correspondences which are consistent with this estimate can be efficiently estimated through RANSAC, and new correspondences also can be detected by using the estimated 2-D homography transformation. Experimental results on object category database show that the proposed algorithm is robust to pose variation of objects and provides good recognition performance.

  • PDF

A Novel Multifocus Image Fusion Algorithm Based on Nonsubsampled Contourlet Transform

  • Liu, Cuiyin;Cheng, Peng;Chen, Shu-Qing;Wang, Cuiwei;Xiang, Fenghong
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.3
    • /
    • pp.539-557
    • /
    • 2013
  • A novel multifocus image fusion algorithm based on NSCT is proposed in this paper. In order to not only attain the image focusing properties and more visual information in the fused image, but also sensitive to the human visual perception, a local multidirection variance (LEOV) fusion rule is proposed for lowpass subband coefficient. In order to introduce more visual saliency, a modified local contrast is defined. In addition, according to the feature of distribution of highpass subband coefficients, a direction vector is proposed to constrain the modified local contrast and construct the new fusion rule for highpass subband coefficients selection The NSCT is a flexible multiscale, multidirection, and shift-invariant tool for image decomposition, which can be implemented via the atrous algorithm. The proposed fusion algorithm based on NSCT not only can prevent artifacts and erroneous from introducing into the fused image, but also can eliminate 'block effect' and 'frequency aliasing' phenomenon. Experimental results show that the proposed method achieved better fusion results than wavelet-based and CT-based fusion method in contrast and clarity.

Feature Based Techniques for a Driver's Distraction Detection using Supervised Learning Algorithms based on Fixed Monocular Video Camera

  • Ali, Syed Farooq;Hassan, Malik Tahir
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.12 no.8
    • /
    • pp.3820-3841
    • /
    • 2018
  • Most of the accidents occur due to drowsiness while driving, avoiding road signs and due to driver's distraction. Driver's distraction depends on various factors which include talking with passengers while driving, mood disorder, nervousness, anger, over-excitement, anxiety, loud music, illness, fatigue and different driver's head rotations due to change in yaw, pitch and roll angle. The contribution of this paper is two-fold. Firstly, a data set is generated for conducting different experiments on driver's distraction. Secondly, novel approaches are presented that use features based on facial points; especially the features computed using motion vectors and interpolation to detect a special type of driver's distraction, i.e., driver's head rotation due to change in yaw angle. These facial points are detected by Active Shape Model (ASM) and Boosted Regression with Markov Networks (BoRMaN). Various types of classifiers are trained and tested on different frames to decide about a driver's distraction. These approaches are also scale invariant. The results show that the approach that uses the novel ideas of motion vectors and interpolation outperforms other approaches in detection of driver's head rotation. We are able to achieve a percentage accuracy of 98.45 using Neural Network.

Image Registration Based On Statistical Descriptors In Frequency Domain

  • Chang, Min-hyuk;Ahmad, Muhammad-Bilal;Lee, Cheul-hee;Chun, Jong-hoon;Park, Seung-jin;Park, Jong-an
    • Proceedings of the IEEK Conference
    • /
    • 2002.07c
    • /
    • pp.1531-1534
    • /
    • 2002
  • Shape description and its corresponding matching algorithm is one of the main concerns in MPEG-7. In this paper, a new method is proposed for shape registration of 2D objects for MPEG-7 Shapes are recognized using the Hu statistical moments in frequency domain. The Hu moments are moment-based descriptors of planar shapes, which are invariant under general translation, rotational, scaling, and reflection transformation. The image is transformed into frequency domain using Fourier Transform. Annular and radial wedge distributions fur the power spectra are extracted. Different statistical features (Hu moments) are found f3r the power spectrum of each selected transformed individual feature. The Euclidean distance of the extracted moment descriptors of the features are found with respect to the shapes in the database. The minimum Euclidean distance is the candidate for the matched shape. The simulation results are performed on the test shapes of MPEG-7.

  • PDF

A novel hardware design for SIFT generation with reduced memory requirement

  • Kim, Eung Sup;Lee, Hyuk-Jae
    • JSTS:Journal of Semiconductor Technology and Science
    • /
    • v.13 no.2
    • /
    • pp.157-169
    • /
    • 2013
  • Scale Invariant Feature Transform (SIFT) generates image features widely used to match objects in different images. Previous work on hardware-based SIFT implementation requires excessive internal memory and hardware logic [1]. In this paper, a new hardware organization is proposed to implement SIFT with less memory and hardware cost than the previous work. To this end, a parallel Gaussian filter bank is adopted to eliminate the buffers that store intermediate results because parallel operations allow all intermediate results available at the same time. Furthermore, the processing order is changed from the raster-scan order to the block-by-block order so that the line buffer size storing the source image is also reduced. These techniques trade the reduction of memory size with a slight increase of the execution time and external memory bandwidth. As a result, the memory size is reduced by 94.4%. The proposed hardware for SIFT implementation includes the Descriptor generation block, which is omitted in the previous work [1]. The addition of the hardwired descriptor generation improves the computation speed by about 30 times when compared with the previous work.

An Image Retrieving Scheme Using Salient Features and Annotation Watermarking

  • Wang, Jenq-Haur;Liu, Chuan-Ming;Syu, Jhih-Siang;Chen, Yen-Lin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.8 no.1
    • /
    • pp.213-231
    • /
    • 2014
  • Existing image search systems allow users to search images by keywords, or by example images through content-based image retrieval (CBIR). On the other hand, users might learn more relevant textual information about an image from its text captions or surrounding contexts within documents or Web pages. Without such contexts, it's difficult to extract semantic description directly from the image content. In this paper, we propose an annotation watermarking system for users to embed text descriptions, and retrieve more relevant textual information from similar images. First, tags associated with an image are converted by two-dimensional code and embedded into the image by discrete wavelet transform (DWT). Next, for images without annotations, similar images can be obtained by CBIR techniques and embedded annotations can be extracted. Specifically, we use global features such as color ratios and dominant sub-image colors for preliminary filtering. Then, local features such as Scale-Invariant Feature Transform (SIFT) descriptors are extracted for similarity matching. This design can achieve good effectiveness with reasonable processing time in practical systems. Our experimental results showed good accuracy in retrieving similar images and extracting relevant tags from similar images.