• Title/Summary/Keyword: Classification for Each

Search Result 3,953, Processing Time 0.034 seconds

Region Analysis of Business Card Images Acquired in PDA Using DCT and Information Pixel Density (DCT와 정보 화소 밀도를 이용한 PDA로 획득한 명함 영상에서의 영역 해석)

  • 김종흔;장익훈;김남철
    • The Journal of Korean Institute of Communications and Information Sciences
    • /
    • v.29 no.8C
    • /
    • pp.1159-1174
    • /
    • 2004
  • In this paper, we present an efficient algorithm for region analysis of business card images acquired in a PDA by using DCT and information pixel density. The proposed method consists of three parts: region segmentation, information region classification, and text region classification. In the region segmentation, an input business card image is partitioned into 8 f8 blocks and the blocks are classified into information and background blocks using the normalized DCT energy in their low frequency bands. The input image is then segmented into information and background regions by region labeling on the classified blocks. In the information region classification, each information region is classified into picture region or text region by using a ratio of the DCT energy of horizontal and vertical edge components to that in low frequency band and a density of information pixels, that are black pixels in its binarized region. In the text region classification, each text region is classified into large character region or small character region by using the density of information pixels and an averaged horizontal and vertical run-lengths of information pixels. Experimental results show that the proposed method yields good performance of region segmentation, information region classification, and text region classification for test images of several types of business cards acquired by a PDA under various surrounding conditions. In addition, the error rates of the proposed region segmentation are about 2.2-10.1% lower than those of the conventional region segmentation methods. It is also shown that the error rates of the proposed information region classification is about 1.7% lower than that of the conventional information region classification method.

An Anomaly Detection Framework Based on ICA and Bayesian Classification for IaaS Platforms

  • Wang, GuiPing;Yang, JianXi;Li, Ren
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • v.10 no.8
    • /
    • pp.3865-3883
    • /
    • 2016
  • Infrastructure as a Service (IaaS) encapsulates computer hardware into a large amount of virtual and manageable instances mainly in the form of virtual machine (VM), and provides rental service for users. Currently, VM anomaly incidents occasionally occur, which leads to performance issues and even downtime. This paper aims at detecting anomalous VMs based on performance metrics data of VMs. Due to the dynamic nature and increasing scale of IaaS, detecting anomalous VMs from voluminous correlated and non-Gaussian monitored performance data is a challenging task. This paper designs an anomaly detection framework to solve this challenge. First, it collects 53 performance metrics to reflect the running state of each VM. The collected performance metrics are testified not to follow the Gaussian distribution. Then, it employs independent components analysis (ICA) instead of principal component analysis (PCA) to extract independent components from collected non-Gaussian performance metric data. For anomaly detection, it employs multi-class Bayesian classification to determine the current state of each VM. To evaluate the performance of the designed detection framework, four types of anomalies are separately or jointly injected into randomly selected VMs in a campus-wide testbed. The experimental results show that ICA-based detection mechanism outperforms PCA-based and LDA-based detection mechanisms in terms of sensitivity and specificity.

HKIB-20000 & HKIB-40075: Hangul Benchmark Collections for Text Categorization Research

  • Kim, Jin-Suk;Choe, Ho-Seop;You, Beom-Jong;Seo, Jeong-Hyun;Lee, Suk-Hoon;Ra, Dong-Yul
    • Journal of Computing Science and Engineering
    • /
    • v.3 no.3
    • /
    • pp.165-180
    • /
    • 2009
  • The HKIB, or Hankookilbo, test collections are two archives of Korean newswire stories manually categorized with semi-hierarchical or hierarchical category taxonomies. The base newswire stories were made available by the Hankook Ilbo (The Korea Daily) for research purposes. At first, Chungnam National University and KISTI collaborated to manually tag 40,075 news stories with categories by semi-hierarchical and balanced three-level classification scheme, where each news story has only one level-3 category (single-labeling). We refer to this original data set as HKIB-40075 test collection. And then Yonsei University and KISTI collaborated to select 20,000 newswire stories from the HKIB-40075 test collection, to rearrange the classification scheme to be fully hierarchical but unbalanced, and to assign one or more categories to each news story (multi-labeling). We refer to this modified data set as HKIB-20000 test collection. We benchmark a k-NN categorization algorithm both on HKIB-20000 and on HKIB-40075, illustrating properties of the collections, providing baseline results for future studies, and suggesting new directions for further research on Korean text categorization problem.

Classification and Analysis of the Somatotype of Middle-aged Women through Side View Silhouette (우리나라 중년여성의 측면체형 분류)

  • 김순자
    • Journal of the Korean Society of Clothing and Textiles
    • /
    • v.20 no.2
    • /
    • pp.373-389
    • /
    • 1996
  • The purpose of this study was to classify the somatotype based on the side view and to analyze the characteristics of each somatotype. The subjects were 201 middle-aged women aged from 35 to 54. Data were collected through anthropometry and photometry and analyzed by factor analysis, cluster analysis, analysis of variance, and discriminant analysis. As the result of factor analysis for the classification of somatotypes, 6 factors which explain 80.8% of variance were extracted from 35 photometric measurement. Using factor scores cluster analysis was carried out and the subjects were classified into 4 cluster Each cluster was classified as straight type, turning over type, bending type and swayback according to its position to the relative plumb line and their side view contour. And 4 somatotypes were analyzed by theirs direct anthropometric and indirect Photometric measurment to represent physical characteristics of each group.

  • PDF

The methodology on the application of EEG as a diagonostic measures in Korean Traditional Medicine (뇌파의 한의학적 진단 지표로의 활용 방안에 대한 연구초안)

  • Seo, Young-Hyo;Kim, Gyeong-Cheol;Kim, Bo-Kyung
    • Journal of Oriental Neuropsychiatry
    • /
    • v.18 no.1
    • /
    • pp.37-61
    • /
    • 2007
  • Objective : By examining EEG status in Korean Traditional Medicine (KTM) from the viewpoint of 'form-qi theory(形氣論)', We wish to prepare for the fundamentals of applicability of KTM diagnoses to EEG. In addition, through reinterpretation of existing Western Medicine reports from the viewpoint of KTM, We tried to find out interrelationship between them. Method : In this paper, a methodology applicable to KTM diagnoses of EEG is presented from the EEG features in waveform characteristics, personalized diversity, and cognitive activity reflection. Results : Frequency bands are assigned to corresponding one of the eight trigrams in terms of yin/yang balance, which is analogous with EEG spectrum analysis mostly used in EEG quantification. The amplitude ratio of each EEG for each frequency band gives meaningful index numbers which can be used in EEG data interpretation, and every index number is named after the sixty four hexagrams. These approaches are adopted through both '4-band classification system and '6-band classification system', and applied to pre-existing reported EEG data obtained from normal adults. These analyses show that changes and distribution pattern in the index numbers are observed as a whole on both left-right line and front-back line connecting EEG measurement cephalic electrodes. And differences in distribution pattern of three index numbers deduced from '6-band classification system' are discussed according to constitution. Conclusion : The index numbers introduced here, which are the spectral power ratio for each EEG, are based on KTM yin/yang balance. These index numbers vary according to cephalic location, so its application in terms of traditional meridian theory is strongly expected. The index number distribution also shows different patterns according to constitution.

  • PDF

A new pattern classification algorithm for two-dimensional objects

  • You, Bum-Jae;Bien, Zeungnam
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1990.10b
    • /
    • pp.917-922
    • /
    • 1990
  • Pattern classification is an essential step in automatic robotic assembly which joins together finite number of seperated industrial parts. In this paper, a fast and systematic algorithm for classifying occlusion-free objects is proposed, using the notion of incremental circle transform which describes the boundary contour of an object as a parametric vector function of incremental elements. With similarity transform and line integral, normalized determinant curve of an object classifies each object, independent of position, orientation, scaling of an object and cyclic shift of the stating point for the boundary description.

  • PDF

Adoption of Support Vector Machine and Independent Component Analysis for Implementation of Speech Recognizer (음성인식기 구현을 위한 SVM과 독립성분분석 기법의 적용)

  • 박정원;김평환;김창근;허강인
    • Proceedings of the IEEK Conference
    • /
    • 2003.07e
    • /
    • pp.2164-2167
    • /
    • 2003
  • In this paper we propose effective speech recognizer through recognition experiments for three feature parameters(PCA, ICA and MFCC) using SVM(Support Vector Machine) classifier In general, SVM is classification method which classify two class set by finding voluntary nonlinear boundary in vector space and possesses high classification performance under few training data number. In this paper we compare recognition result for each feature parameter and propose ICA feature as the most effective parameter

  • PDF

Binary classification by the combination of Adaboost and feature extraction methods (특징 추출 알고리즘과 Adaboost를 이용한 이진분류기)

  • Ham, Seaung-Lok;Kwak, No-Jun
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.49 no.4
    • /
    • pp.42-53
    • /
    • 2012
  • In pattern recognition and machine learning society, classification has been a classical problem and the most widely researched area. Adaptive boosting also known as Adaboost has been successfully applied to binary classification problems. It is a kind of boosting algorithm capable of constructing a strong classifier through a weighted combination of weak classifiers. On the other hand, the PCA and LDA algorithms are the most popular linear feature extraction methods used mainly for dimensionality reduction. In this paper, the combination of Adaboost and feature extraction methods is proposed for efficient classification of two class data. Conventionally, in classification problems, the roles of feature extraction and classification have been distinct, i.e., a feature extraction method and a classifier are applied sequentially to classify input variable into several categories. In this paper, these two steps are combined into one resulting in a good classification performance. More specifically, each projection vector is treated as a weak classifier in Adaboost algorithm to constitute a strong classifier for binary classification problems. The proposed algorithm is applied to UCI dataset and FRGC dataset and showed better recognition rates than sequential application of feature extraction and classification methods.

The Construction of Semantic Networks for Korean "Cooking Verb" Based on the Argument Information. (논항 정보 기반 "요리 동사"의 어휘의미망 구축 방안)

  • Lee, Sukeui
    • Korean Linguistics
    • /
    • v.48
    • /
    • pp.223-268
    • /
    • 2010
  • The purpose of this paper is to build a semantic networks of the 'cooking class' verb (based on 'CoreNet' of KAIST). This proceedings needs to adjust the concept classification. Then sub-categories of [Cooking] and [Foodstuff] hierarchy of CoreNet was adjusted for the construction of verb semantic networks. For the building a semantic networks, each meaning of 'Cooking verbs' of Korean has to be analyzed. This paper focused on the Korean 'heating' verbs and 'non-heating'verbs. Case frame structure and argument information were inserted for the describing verb information. This paper use a Propege 3.3 as a tool for building "cooking verb" semantic networks. Each verb and noun was inserted into it's class, and connected by property relation marker 'HasThemeAs', 'IsMaterialOf'.

The Petrological and Geomechanical Studies of Rock Masses in the Site Area of the 3rd and 4th Seoul Subway Lines for an Engineering Classification of Rock Masses (서울 지하철(地下鐵) 부지일대(敷地一帶) 암석(岩石)의 암석학적(岩石學的) 및 암석역학적(岩石力學的) 기준설정(基準設定)을 위(爲)한 연구(硏究))

  • Kim, Ok Joon;Lee, Dai Sung;Jeong, Bong Il
    • Economic and Environmental Geology
    • /
    • v.17 no.1
    • /
    • pp.57-78
    • /
    • 1984
  • The object of this study is to offer the standarized data for the design and calculating engineering cost of the rock excavation an the construction of the 3rd and 4th Seoul Subway lines From Jnauary to March in 1983, this study was carried out by the both methods of the field and laboratary studies. In the field, the geological survey in the entire area of Seoul City and sites on the subway lines were carried out and also a site measure of uniaxial compressional strength of rock masses by using Schmidt hammer was done. The labartory studies were carsied out by a study of preuions surveyes, microscopic studies of the mineral composition and degree of weathering of rocks, and measure of uniaxial compressional strengths Finally an engineering classification of each rock masses of South Africa council for Scientific and Industrial Research, CSIR, after Bieniawski, 1974. was done. In this method of classification 6 parameters such as strength of intact rock material, rock quality designation, spacing of fractures, condition of fractures, groundwater conditions, and the effect of fracture strike and dip orientation in tunnelling were used to evaluate rating of each rock mass.

  • PDF