• Title/Summary/Keyword: Feature detector

Search Result 191, Processing Time 0.022 seconds

Speaker Identification Using Higher-Order Statistics In Noisy Environment (고차 통계를 이용한 잡음 환경에서의 화자식별)

  • Shin, Tae-Young;Kim, Gi-Sung;Kwon, Young-Uk;Kim, Hyung-Soon
    • The Journal of the Acoustical Society of Korea
    • /
    • v.16 no.6
    • /
    • pp.25-35
    • /
    • 1997
  • Most of speech analysis methods developed up to date are based on second order statistics, and one of the biggest drawback of these methods is that they show dramatical performance degradation in noisy environments. On the contrary, the methods using higher order statistics(HOS), which has the property of suppressing Gaussian noise, enable robust feature extraction in noisy environments. In this paper we propose a text-independent speaker identification system using higher order statistics and compare its performance with that using the conventional second-order-statistics-based method in both white and colored noise environments. The proposed speaker identification system is based on the vector quantization approach, and employs HOS-based voiced/unvoiced detector in order to extract feature parameters for voiced speech only, which has non-Gaussian distribution and is known to contain most of speaker-specific characteristics. Experimental results using 50 speaker's database show that higher-order-statistics-based method gives a better identificaiton performance than the conventional second-order-statistics-based method in noisy environments.

  • PDF

Facial Expression Recognition using Face Alignment and AdaBoost (얼굴정렬과 AdaBoost를 이용한 얼굴 표정 인식)

  • Jeong, Kyungjoong;Choi, Jaesik;Jang, Gil-Jin
    • Journal of the Institute of Electronics and Information Engineers
    • /
    • v.51 no.11
    • /
    • pp.193-201
    • /
    • 2014
  • This paper suggests a facial expression recognition system using face detection, face alignment, facial unit extraction, and training and testing algorithms based on AdaBoost classifiers. First, we find face region by a face detector. From the results, face alignment algorithm extracts feature points. The facial units are from a subset of action units generated by combining the obtained feature points. The facial units are generally more effective for smaller-sized databases, and are able to represent the facial expressions more efficiently and reduce the computation time, and hence can be applied to real-time scenarios. Experimental results in real scenarios showed that the proposed system has an excellent performance over 90% recognition rates.

Text Region Extraction using Pattern Histogram of Character-Edge Map in Natural Images (문자-에지 맵의 패턴 히스토그램을 이용한 자연이미지에서의 텍스트 영역 추출)

  • Park, Jong-Cheon;Hwang, Dong-Guk;Lee, Woo-Ram;Kwon, Kyo-Hyun;Jun, Byoung-Min
    • Proceedings of the KAIS Fall Conference
    • /
    • 2006.11a
    • /
    • pp.220-224
    • /
    • 2006
  • The text to be included in the natural images has many important information in the natural image. Therefore, if we can extract the text in natural images, It can be applied to many important applications. In this paper, we propose a text region extraction method using pattern histogram of character-edge map. We extract the edges with the Canny edge detector and creates 16 kind of edge map from an extracted edges. And then we make a character-edge map of 8 kinds that have a character feature with a combination of an edge map. We extract text region using 8 kinds of character-edge map and 16 kind of edge map. Verification of text candidate region uses analysis of a character-edge map pattern histogram and structural feature of text region. The method to propose experimented with various kind of the natural images. The proposed approach extracted text region from a natural images to have been composed of a complex background, various letters, various text colors effectively.

  • PDF

Head Pose Estimation with Accumulated Historgram and Random Forest (누적 히스토그램과 랜덤 포레스트를 이용한 머리방향 추정)

  • Mun, Sung Hee;Lee, Chil woo
    • Smart Media Journal
    • /
    • v.5 no.1
    • /
    • pp.38-43
    • /
    • 2016
  • As smart environment is spread out in our living environments, the needs of an approach related to Human Computer Interaction(HCI) is increases. One of them is head pose estimation. it related to gaze direction estimation, since head has a close relationship to eyes by the body structure. It's a key factor in identifying person's intention or the target of interest, hence it is an essential research in HCI. In this paper, we propose an approach for head pose estimation with pre-defined several directions by random forest classifier. We use canny edge detector to extract feature of the different facial image which is obtained between input image and averaged frontal facial image for extraction of rotation information of input image. From that, we obtain the binary edge image, and make two accumulated histograms which are obtained by counting the number of pixel which has non-zero value along each of the axes. This two accumulated histograms are used to feature of the facial image. We use CAS-PEAL-R1 Dataset for training and testing to random forest classifier, and obtained 80.6% accuracy.

A Study on Image Recognition based on the Characteristics of Retinal Cells (망막 세포 특성에 의한 영상인식에 관한 연구)

  • Cho, Jae-Hyun;Kim, Do-Hyeon;Kim, Kwang-Baek
    • Journal of the Korea Institute of Information and Communication Engineering
    • /
    • v.11 no.11
    • /
    • pp.2143-2149
    • /
    • 2007
  • Visual Cortex Stimulator is among artificial retina prosthesis for blind man, is the method that stimulate the brain cell directly without processing the information from retina to visual cortex. In this paper, we propose image construction and recognition model that is similar to human visual processing by recognizing the feature data with orientation information, that is, the characteristics of visual cortex. Back propagation algorithm based on Delta-bar delta is used to recognize after extracting image feature by Kirsh edge detector. Various numerical patterns are used to analyze the performance of proposed method. In experiment, the proposed recognition model to extract image characteristics with the orientation of information from retinal cells to visual cortex makes a little difference in a recognition rate but shows that it is not sensitive in a variety of learning rates similar to human vision system.

Detection of LSB Matching Revisited Using Pixel Difference Feature

  • Li, Wenxiang;Zhang, Tao;Zhu, Zhenhao;Zhang, Yan;Ping, Xin
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.7 no.10
    • /
    • pp.2514-2526
    • /
    • 2013
  • This paper presents a detection method for least significant bit matching revisited (LSBMR) steganography. Previous research shows that the adjacent pixels of natural images are highly correlated and the value 0 appears most frequently in pixel difference. Considering that the message embedding process of LSBMR steganography has a weighted-smoothing effect on the distribution of pixel difference, the frequency of the occurrence of value 0 in pixel difference changes most significantly whereas other values approximately remain unchanged during message embedding. By analyzing the effect of LSBMR steganography on pixel difference distribution, an equation is deduced to estimate the frequency of difference value 0 using the frequencies of difference values 1 and 2. The sum of the ratio of the estimated value to the actual value as well as the ratio of the frequency of difference value 1 to difference value 0 is used as the steganalytic detector. Experimental results show that the proposed method can effectively detect LSBMR steganography and can outperform previous proposed methods.

Sign Language Translation Using Deep Convolutional Neural Networks

  • Abiyev, Rahib H.;Arslan, Murat;Idoko, John Bush
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.14 no.2
    • /
    • pp.631-653
    • /
    • 2020
  • Sign language is a natural, visually oriented and non-verbal communication channel between people that facilitates communication through facial/bodily expressions, postures and a set of gestures. It is basically used for communication with people who are deaf or hard of hearing. In order to understand such communication quickly and accurately, the design of a successful sign language translation system is considered in this paper. The proposed system includes object detection and classification stages. Firstly, Single Shot Multi Box Detection (SSD) architecture is utilized for hand detection, then a deep learning structure based on the Inception v3 plus Support Vector Machine (SVM) that combines feature extraction and classification stages is proposed to constructively translate the detected hand gestures. A sign language fingerspelling dataset is used for the design of the proposed model. The obtained results and comparative analysis demonstrate the efficiency of using the proposed hybrid structure in sign language translation.

NON-UNIFORMITY CORRECTION- SYSTEM ANALYSIS FOR MULTI-SPECTRAL CAMERA

  • Park Jong-Euk;Kong Jong-Pil;Heo Haeng-Pal;Kim Young Sun;Chang Young Jun
    • Proceedings of the KSRS Conference
    • /
    • 2005.10a
    • /
    • pp.478-481
    • /
    • 2005
  • The PMU (Payload Management Unit) is the main subsystem for the management, control and power supply of the MSC (Multi-Spectral Camera) Payload operation. It is the most important function for the electro-optical camera system that performs the Non-Uniformity Correction (NUC) function of the raw imagery data, rearranges the data from the CCD (Charge Coupled Device) detector and output it to the Data Compression and Storage Unit (DCSU). The NUC board in PMU performs it. In this paper, the NUC board system is described in terms of the configuration and the function, the efficiency for non-uniformity correction, and the influence of the data compression upon the peculiar feature of the CCD pixel. The NUC board is an image-processing unit within the PMU that receives video data from the CEV (Camera Electronic Unit) boards via a hotlinkand performs non-uniformity corrections upon the pixels according to commands received from the SBC (Single Board Computer) in the PMU. The lossy compression in DCSU needs the NUC in on-orbit condition.

  • PDF

Rotated object recognition based on corner feature points in mobile environment (모바일 환경 응용을 위한 코너 특징점 기반의 회전 객체 검출)

  • Kim, Dae-Hwan;Piao, Jin-Chun;Kim, Shin-Dug
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2013.07a
    • /
    • pp.23-26
    • /
    • 2013
  • 최근 모바일 장치의 영상 데이터 처리 능력 확대와 더불어 사용자가 요구하는 다양한 영상 데이터의 효율적인 인식 기술 연구가 요구되어지고 있다. 모바일 환경은 고성능 PC 환경과 달리 저사양의 CPU와 메모리를 탑재하고 있어, 영상에서 원하는 객체를 인식하기 위한 기존의 방법론으로는 사용자 요구를 실시간으로 충족하기 어려운 부분이 존재한다. 이에 모바일 환경에 맞는 객체 인식 방법론의 개발이 요구된다. 모바일 환경에서 실시간으로 객체 인식을 하기 위하여, 본 논문에서는 객체 코너 정보를 이용한 Harris corner detector[1]로부터 객체의 특징점을 추출하고, 이를 바탕으로 하여 영상내의 객체 정보 인식 방법을 제안한다. 제안하는 방법에 의해, 입력 영상에서 객체의 코너 정보를 빠르게 추출, 기존 특징점과의 비교를 통하여 영상 내부의 객체 인식을 진행한다. 일반적으로, 회전된 특징점 객체의 정보는 객체의 회전 정도에 따라 코너 픽셀 색상 정보의 변화가 발생하게 된다. 특징점의 색상값은 객체의 회전 정도에 영향을 받아 주변의 픽셀값과 혼합되는 특성이 존재한다. 본 논문에서는 회전 변경된 픽셀 색상값의 영향을 분석하여, 회전된 객체의 특징점 추출 및 객체 검출에 반영하도록 하여, 영상 내부의 회전된 객체 검출의 수행에 효과적으로 이용될 수 있도록 한다. 특징점의 코너 정보를 이용하여 객체를 인식하는 것은, 객체의 인식률은 다소 감소하더라도 모바일 환경에서 계산량의 감소를 통한 실시간 활용이 가능하도록 한다. 이러한 특성은 저성능 CPU와 메모리에서도 회전된 객체의 인식을 수행할 수 있게 하는데 상당히 효과적이다.

  • PDF

Merging Features and Optical-NIR Color Gradient of Early-type Galaxies

  • Kim, Du-Ho;Im, Myeong-Sin
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.35 no.2
    • /
    • pp.41.1-41.1
    • /
    • 2010
  • It has been suggested that merging plays an important role in the formation and the evolution of early-type galaxies. Optical-NIR color gradients of early-type galaxies in high density environments are found to be less steep than those in low density environment, hinting frequent merger activities in early-type galaxies in high density environment. In order to confirm if the flat color gradient is the result of dry merger, we decided to look deeply to find merging features and get their relation with color gradient. We selected samples which show extreme values of optical-NIR color gradients based on the data of previous study, and observed them at Maidanak observatory 1.5m telescope with long exposure. After masking out overlaid sources, our analysis reveals that these galaxies do not have extreme color gradient values. High degree sky flat technique was used during observation to aid discovery of faint, extended features. However, flatness of detector (SNUCAM) was good enough, so we could not see any marked improvement in image quality compared to those using normal sky flats. Additionally we noticed a feature that looks like merging tidal tail in the CFHT archival image, but this does not show up on the image we obtained. This demonstrates that flatness and correct sky estimation is very important when we look for faint merging features. In future we plan to enlarge the number of the sample.

  • PDF