• Title/Summary/Keyword: Visual Feature Extraction

Search Result 141, Processing Time 0.027 seconds

Scene Change Detection and Key Frame Selection Using Fast Feature Extraction in the MPEG-Compressed Domain (MPEG 압축 영상에서의 고속 특징 요소 추출을 이용한 장면 전환 검출과 키 프레임 선택)

  • 송병철;김명준;나종범
    • Journal of Broadcast Engineering
    • /
    • v.4 no.2
    • /
    • pp.155-163
    • /
    • 1999
  • In this paper, we propose novel scene change detection and key frame selection techniques, which use two feature images, i.e., DC and edge images, extracted directly from MPEG compressed video. For fast edge image extraction. we suggest to utilize 5 lower AC coefficients of each DCT. Based on this scheme, we present another edge image extraction technique using AC prediction. Although the former is superior to the latter in terms of visual quality, both methods all can extract important edge features well. Simulation results indicate that scene changes such as cut. fades, and dissolves can be correctly detected by using the edge energy diagram obtained from edge images and histograms from DC images. In addition. we find that our edge images are comparable to those obtained in the spatial domain while keeping much lower computational cost. And based on HVS, a key frame of each scene can also be selected. In comparison with an existing method using optical flow. our scheme can select semantic key frames because we only use the above edge and DC images.

  • PDF

Contend Base Image Retrieval using Color Feature of Central Region and Optimized Comparing Bin (중앙 영역의 컬러 특징과 최적화된 빈 수를 이용한 내용기 반 영상검색)

  • Ryu, Eun-Ju;Song, Young-Jun;Park, Won-Bae;Ahn, Jae-Hyeong
    • The KIPS Transactions:PartB
    • /
    • v.11B no.5
    • /
    • pp.581-586
    • /
    • 2004
  • In this paper, we proposed a content-based image retrieval using a color feature for central region and its optimized comparing bin method. Human's visual characteristic is influenced by existent of central object. So we supposed that object is centrally located in image and then we extract color feature at central region. When the background of image is simple, the retrieval result can be bad affected by major color of background. Our method overcome this drawback as a result of the human visual characteristic. After we transform Image into HSV color space, we extract color feature from the quantized image with 16 level. The experimental results showed that the method using the eight high rank bin is better than using the 16 bin The case which extracts the feature with image's central region was superior compare with the case which extracts the feature with the whole image about 5%.

A Recognition Framework for Facial Expression by Expression HMM and Posterior Probability (표정 HMM과 사후 확률을 이용한 얼굴 표정 인식 프레임워크)

  • Kim, Jin-Ok
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.11 no.3
    • /
    • pp.284-291
    • /
    • 2005
  • I propose a framework for detecting, recognizing and classifying facial features based on learned expression patterns. The framework recognizes facial expressions by using PCA and expression HMM(EHMM) which is Hidden Markov Model (HMM) approach to represent the spatial information and the temporal dynamics of the time varying visual expression patterns. Because the low level spatial feature extraction is fused with the temporal analysis, a unified spatio-temporal approach of HMM to common detection, tracking and classification problems is effective. The proposed recognition framework is accomplished by applying posterior probability between current visual observations and previous visual evidences. Consequently, the framework shows accurate and robust results of recognition on as well simple expressions as basic 6 facial feature patterns. The method allows us to perform a set of important tasks such as facial-expression recognition, HCI and key-frame extraction.

Korean Web Content Extraction using Tag Rank Position and Gradient Boosting (태그 서열 위치와 경사 부스팅을 활용한 한국어 웹 본문 추출)

  • Mo, Jonghoon;Yu, Jae-Myung
    • Journal of KIISE
    • /
    • v.44 no.6
    • /
    • pp.581-586
    • /
    • 2017
  • For automatic web scraping, unnecessary components such as menus and advertisements need to be removed from web pages and main contents should be extracted automatically. A content block tends to be located in the middle of a web page. In particular, Korean web documents rarely include metadata and have a complex design; a suitable method of content extraction is therefore needed. Existing content extraction algorithms use the textual and structural features of content blocks because processing visual features requires heavy computation for rendering and image processing. In this paper, we propose a new content extraction method using the tag positions in HTML as a quasi-visual feature. In addition, we develop a tag rank position, a type of tag position not affected by text length, and show that gradient boosting with the tag rank position is a very accurate content extraction method. The result of this paper shows that the content extraction method can be used to collect high-quality text data automatically from various web pages.

Automatic Face Identification System Using Adaptive Face Region Detection and Facial Feature Vector Classification

  • Kim, Jung-Hoon;Do, Kyeong-Hoon;Lee, Eung-Joo
    • Proceedings of the IEEK Conference
    • /
    • 2002.07b
    • /
    • pp.1252-1255
    • /
    • 2002
  • In this paper, face recognition algorithm, by using skin color information of HSI color coordinate collected from face images, elliptical mask, fratures of face including eyes, nose and mouth, and geometrical feature vectors of face and facial angles, is proposed. The proposed algorithm improved face region extraction efficacy by using HSI information relatively similar to human's visual system along with color tone information about skin colors of face, elliptical mask and intensity information. Moreover, it improved face recognition efficacy with using feature information of eyes, nose and mouth, and Θ1(ACRED), Θ2(AMRED) and Θ 3(ANRED), which are geometrical face angles of face. In the proposed algorithm, it enables exact face reading by using color tone information, elliptical mask, brightness information and structural characteristic angle together, not like using only brightness information in existing algorithm. Moreover, it uses structural related value of characteristics and certain vectors together for the recognition method.

  • PDF

Sensor Fusion-Based Semantic Map Building (센서융합을 통한 시맨틱 지도의 작성)

  • Park, Joong-Tae;Song, Jae-Bok
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.17 no.3
    • /
    • pp.277-282
    • /
    • 2011
  • This paper describes a sensor fusion-based semantic map building which can improve the capabilities of a mobile robot in various domains including localization, path-planning and mapping. To build a semantic map, various environmental information, such as doors and cliff areas, should be extracted autonomously. Therefore, we propose a method to detect doors, cliff areas and robust visual features using a laser scanner and a vision sensor. The GHT (General Hough Transform) based recognition of door handles and the geometrical features of a door are used to detect doors. To detect the cliff area and robust visual features, the tilting laser scanner and SIFT features are used, respectively. The proposed method was verified by various experiments and showed that the robot could build a semantic map autonomously in various indoor environments.

Similar Image Retrieval Technique based on Semantics through Automatic Labeling Extraction of Personalized Images

  • Jung-Hee, Seo
    • Journal of information and communication convergence engineering
    • /
    • v.22 no.1
    • /
    • pp.56-63
    • /
    • 2024
  • Despite the rapid strides in content-based image retrieval, a notable disparity persists between the visual features of images and the semantic features discerned by humans. Hence, image retrieval based on the association of semantic similarities recognized by humans with visual similarities is a difficult task for most image-retrieval systems. Our study endeavors to bridge this gap by refining image semantics, aligning them more closely with human perception. Deep learning techniques are used to semantically classify images and retrieve those that are semantically similar to personalized images. Moreover, we introduce a keyword-based image retrieval, enabling automatic labeling of images in mobile environments. The proposed approach can improve the performance of a mobile device with limited resources and bandwidth by performing retrieval based on the visual features and keywords of the image on the mobile device.

Development of Inspect Algorithm for Pallets Using Vision System

  • Lee, Man-Hyung;Hong, Suh-Il
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2001.10a
    • /
    • pp.101.6-101
    • /
    • 2001
  • This paper deals with inspect algorithm using visual system. One of the major problems that arise during polymer production is the estimation of the noise of the product(bad pallets). An erroneous output can cause a lot of losses (production and financial losses). Therefore new methods for real-time inspection of the noise are demanded. For this reason, we have presented a development of vision system algorithm for the defect inspection of PE pallets. First of all, in order to detect the edge of object, the differential filter is used. And we apply to the labeling algorithm for feature extraction. This algorithm is designed for the defect inspection of pallets ...

  • PDF

Data Classification of Visual Quality for Image Recognition (영상인식을 위한 화질의 데이터 분류성)

  • Cho, Jae-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2021.01a
    • /
    • pp.279-280
    • /
    • 2021
  • 패턴 또는 영상을 인식하기 위하여 먼저 기계 학습 모델을 선택하고, 선택된 모델은 여러 단계의 처리 단계 과정으로써, 학습 데이터 구성과 특징 추출 그리고 분류기 등으로 크게 나눌 수 있다. 기존의 학습 모델의 처리 단계 중 학습 데이터 구성은 첫 번째 중요한 단계이다. 본 논문에서는 학습 데이터들의 특징을 분석하여 데이터 분류성의 척도로 사용될 수 있는지를 검토하여 차후 기계 학습 및 딥 러닝의 인식을 높이고자 한다.

  • PDF

Visual Information Processing and Image Recognition (시각정보처리와 영상인식)

  • Cho, Jae-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.387-388
    • /
    • 2022
  • 기계학습은 문제가 복잡하여 수학적으로 정의를 하는 것이 어려울 때 유용하게 쓸 수 있는 방법으로 최근 패턴 또는 영상을 인식하기 위하여 급속도록 많이 사용되고 있다. 본 논문에서는 기존의 학습 모델과는 다르게 인간의 시각정보처리과정 중 망막의 특성과 시각피질의 특성을 고려한 모델을 제시하여 학습의 첫 단계인 원시 데이터를 가공하는 과정을 개선함으로써 좀 더 효율적인 인식모델을 제안하고자 한다.

  • PDF