• Title/Summary/Keyword: Image Pattern Matching

Search Result 225, Processing Time 0.028 seconds

Hand Shape Recognition with Disparity Pattern of Multiple Model Images (복수 모델영상의 상위도 패턴을 이용한 손형상 인식)

  • 이칠우
    • Journal of Korea Multimedia Society
    • /
    • v.2 no.4
    • /
    • pp.400-408
    • /
    • 1999
  • This paper describes a method for making the "disparity pattern" which is basis of image matching with brightness difference; called disparity, between multiple model images, and an algorithm which recognizes hand shape by utilizing the pattern in measuring the distance between a input image and model images. The virtue of the algorithm is that only simple brightness difference calculated from multiple images by managing a whole image as the fundamental processing unit is patterned in two dimensional shape and then is used in the recognition process. Consequently, this method is very useful for other recognition algorithm requiring comparison of large scale image since correlation among multiple model images is applied simultaneously in recognition process.

  • PDF

Robust Facial Expression Recognition Based on Local Directional Pattern

  • Jabid, Taskeed;Kabir, Md. Hasanul;Chae, Oksam
    • ETRI Journal
    • /
    • v.32 no.5
    • /
    • pp.784-794
    • /
    • 2010
  • Automatic facial expression recognition has many potential applications in different areas of human computer interaction. However, they are not yet fully realized due to the lack of an effective facial feature descriptor. In this paper, we present a new appearance-based feature descriptor, the local directional pattern (LDP), to represent facial geometry and analyze its performance in expression recognition. An LDP feature is obtained by computing the edge response values in 8 directions at each pixel and encoding them into an 8 bit binary number using the relative strength of these edge responses. The LDP descriptor, a distribution of LDP codes within an image or image patch, is used to describe each expression image. The effectiveness of dimensionality reduction techniques, such as principal component analysis and AdaBoost, is also analyzed in terms of computational cost saving and classification accuracy. Two well-known machine learning methods, template matching and support vector machine, are used for classification using the Cohn-Kanade and Japanese female facial expression databases. Better classification accuracy shows the superiority of LDP descriptor against other appearance-based feature descriptors.

Detection of Mammographic Microcalcifications by Statistical Pattern Classification 81 Pattern Matching (통계적 패턴 분류법과 패턴 매칭을 이용한 유방영상의 미세석회화 검출)

  • 양윤석;김덕원;김은경
    • Journal of Biomedical Engineering Research
    • /
    • v.18 no.4
    • /
    • pp.357-364
    • /
    • 1997
  • The early detection of breast cancer is clearly a key ingredient for reducing breast cancer mortality. Microcalcification is the only visible feature of the DCIS's(ductal carcinoma in situ) which consist 15 ~ 20% of screening-detected breast cancer. Therefore, the analysis of the shapes and distributions of microcalcifications is very significant for the early detection. The automatic detection procedures have b(:on the concern of digital image processing for many years. We proposed here one efficient method which is essentially statistical pattern classification accelerated by one representative feature, correlation coefficient. We compared the results by this additional feature with results by a simple gray level thresholding. The average detection rate was increased from 48% by gray level feature only to 83% by the proposed method The performances were evaluated with TP rates and FP counts, and also with Bayes errors.

  • PDF

Semantic Image Retrieval Using Color Distribution and Similarity Measurement in WordNet (컬러 분포와 WordNet상의 유사도 측정을 이용한 의미적 이미지 검색)

  • Choi, Jun-Ho;Cho, Mi-Young;Kim, Pan-Koo
    • The KIPS Transactions:PartB
    • /
    • v.11B no.4
    • /
    • pp.509-516
    • /
    • 2004
  • Semantic interpretation of image is incomplete without some mechanism for understanding semantic content that is not directly visible. For this reason, human assisted content-annotation through natural language is an attachment of textual description to image. However, keyword-based retrieval is in the level of syntactic pattern matching. In other words, dissimilarity computation among terms is usually done by using string matching not concept matching. In this paper, we propose a method for computerized semantic similarity calculation In WordNet space. We consider the edge, depth, link type and density as well as existence of common ancestors. Also, we have introduced method that applied similarity measurement on semantic image retrieval. To combine wi#h the low level features, we use the spatial color distribution model. When tested on a image set of Microsoft's 'Design Gallery Line', proposed method outperforms other approach.

Self-localization of a Mobile Robot for Decreasing the Error and VRML Image Overlay (오차 감소를 위한 이동로봇 Self-Localization과 VRML 영상오버레이 기법)

  • Kwon Bang-Hyun;Shon Eun-Ho;Kim Young-Chul;Chong Kil-To
    • Journal of Institute of Control, Robotics and Systems
    • /
    • v.12 no.4
    • /
    • pp.389-394
    • /
    • 2006
  • Inaccurate localization exposes a robot to many dangerous conditions. It could make a robot be moved to wrong direction or damaged by collision with surrounding obstacles. There are numerous approaches to self-localization, and there are different modalities as well (vision, laser range finders, ultrasonic sonars). Since sensor information is generally uncertain and contains noise, there are many researches to reduce the noise. But, the correctness is limited because most researches are based on statistical approach. The goal of our research is to measure more exact robot location by matching between built VRML 3D model and real vision image. To determine the position of mobile robot, landmark-localization technique has been applied. Landmarks are any detectable structure in the physical environment. Some use vertical lines, others use specially designed markers, In this paper, specially designed markers are used as landmarks. Given known focal length and a single image of three landmarks it is possible to compute the angular separation between the lines of sight of the landmarks. The image-processing and neural network pattern matching techniques are employed to recognize landmarks placed in a robot working environment. After self-localization, the 2D scene of the vision is overlaid with the VRML scene.

VRML image overlay method for Robot's Self-Localization (VRML 영상오버레이기법을 이용한 로봇의 Self-Localization)

  • Sohn, Eun-Ho;Kwon, Bang-Hyun;Kim, Young-Chul;Chong, Kil-To
    • Proceedings of the KIEE Conference
    • /
    • 2006.04a
    • /
    • pp.318-320
    • /
    • 2006
  • Inaccurate localization exposes a robot to many dangerous conditions. It could make a robot be moved to wrong direction or damaged by collision with surrounding obstacles. There are numerous approaches to self-localization, and there are different modalities as well (vision, laser range finders, ultrasonic sonars). Since sensor information is generally uncertain and contains noise, there are many researches to reduce the noise. But, the correctness is limited because most researches are based on statistical approach. The goal of our research is to measure more exact robot location by matching between built VRML 3D model and real vision image. To determine the position of mobile robot, landmark-localitzation technique has been applied. Landmarks are any detectable structure in the physical environment. Some use vertical lines, others use specially designed markers, In this paper, specially designed markers are used as landmarks. Given known focal length and a single image of three landmarks it is possible to compute the angular separation between the lines of sight of the landmarks. The image-processing and neural network pattern matching techniques are employed to recognize landmarks placed in a robot working environment. After self-localization, the 2D scene of the vision is overlaid with the VRML scene.

  • PDF

A Study on a Ginseng Grade Decision Making Algorithm Using a Pattern Recognition Method (패턴인식을 이용한 수삼 등급판정 알고리즘에 관한 연구)

  • Jeong, Seokhoon;Ko, Kuk Won;Kang, Je-Yong;Jang, Suwon;Lee, Sangjoon
    • KIPS Transactions on Software and Data Engineering
    • /
    • v.5 no.7
    • /
    • pp.327-332
    • /
    • 2016
  • This study is a leading research project to develop an automatic grade decision making algorithm of a 6-years-old fresh ginseng. For this work, we developed a Ginseng image acquiring instrument which can take 4-direction's images of a Ginseng at the same time and obtained 245 jingen images using the instrument. The 12 parameters were extracted for each image by a manual way. Lastly, 4 parameters were selected depending on a Ginseng grade classification criteria of KGC Ginseng research institute and a survey result which a distribution of averaging 12 parameters. A pattern recognition classifier was used as a support vector machine, designed to "k-class classifier" using the OpenCV library which is a open-source platform. We had been surveyed the algorithm performance(Correct Matching Ratio, False Acceptance Ratio, False Reject Ratio) when the training data number was controlled 10 to 20. The result of the correct matching ratio is 94% of the $1^{st}$ ginseng grade, 98% of the $2^{nd}$ ginseng grade, 90% of the $3^{rd}$ ginseng grade, overall, showed high recognition performance with all grades when the number of training data are 10.

A Covariance-matching-based Model for Musical Symbol Recognition

  • Do, Luu-Ngoc;Yang, Hyung-Jeong;Kim, Soo-Hyung;Lee, Guee-Sang;Dinh, Cong Minh
    • Smart Media Journal
    • /
    • v.7 no.2
    • /
    • pp.23-33
    • /
    • 2018
  • A musical sheet is read by optical music recognition (OMR) systems that automatically recognize and reconstruct the read data to convert them into a machine-readable format such as XML so that the music can be played. This process, however, is very challenging due to the large variety of musical styles, symbol notation, and other distortions. In this paper, we present a model for the recognition of musical symbols through the use of a mobile application, whereby a camera is used to capture the input image; therefore, additional difficulties arise due to variations of the illumination and distortions. For our proposed model, we first generate a line adjacency graph (LAG) to remove the staff lines and to perform primitive detection. After symbol segmentation using the primitive information, we use a covariance-matching method to estimate the similarity between every symbol and pre-defined templates. This method generates the three hypotheses with the highest scores for likelihood measurement. We also add a global consistency (time measurements) to verify the three hypotheses in accordance with the structure of the musical sheets; one of the three hypotheses is chosen through a final decision. The results of the experiment show that our proposed method leads to promising results.

Hybrid Neural Classifier Combined with H-ART2 and F-LVQ for Face Recognition

  • Kim, Do-Hyeon;Cha, Eui-Young;Kim, Kwang-Baek
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.1287-1292
    • /
    • 2005
  • This paper presents an effective pattern classification model by designing an artificial neural network based pattern classifiers for face recognition. First, a RGB image inputted from a frame grabber is converted into a HSV image which is similar to the human beings' vision system. Then, the coarse facial region is extracted using the hue(H) and saturation(S) components except intensity(V) component which is sensitive to the environmental illumination. Next, the fine facial region extraction process is performed by matching with the edge and gray based templates. To make a light-invariant and qualified facial image, histogram equalization and intensity compensation processing using illumination plane are performed. The finally extracted and enhanced facial images are used for training the pattern classification models. The proposed H-ART2 model which has the hierarchical ART2 layers and F-LVQ model which is optimized by fuzzy membership make it possible to classify facial patterns by optimizing relations of clusters and searching clustered reference patterns effectively. Experimental results show that the proposed face recognition system is as good as the SVM model which is famous for face recognition field in recognition rate and even better in classification speed. Moreover high recognition rate could be acquired by combining the proposed neural classification models.

  • PDF

Correction of Perspective Distortion Image Using Depth Information (깊이 정보를 이용한 원근 왜곡 영상의 보정)

  • Kwon, Soon-Kak;Lee, Dong-Seok
    • Journal of Korea Multimedia Society
    • /
    • v.18 no.2
    • /
    • pp.106-112
    • /
    • 2015
  • In this paper, we propose a method for correction of perspective distortion on a taken image. An image taken by a camera is caused perspective distortion depending on the direction of the camera when objects are projected onto the image. The proposed method in this paper is to obtain the normal vector of the plane through the depth information using a depth camera and calculate the direction of the camera based on this normal vector. Then the method corrects the perspective distortion to the view taken from the front side by performing a rotation transformation on the image according to the direction of the camera. Through the proposed method, it is possible to increase the processing speed than the conventional method such as correction of perspective distortion based on color information.