• Title/Summary/Keyword: supervised and unsupervised classification

Search Result 100, Processing Time 0.025 seconds

Supervised Classification Using Training Parameters and Prior Probability Generated from VITD - The Case of QuickBird Multispectral Imagery

  • Eo, Yang-Dam;Lee, Gyeong-Wook;Park, Doo-Youl;Park, Wang-Yong;Lee, Chang-No
    • Korean Journal of Remote Sensing
    • /
    • v.24 no.5
    • /
    • pp.517-524
    • /
    • 2008
  • In order to classify an satellite imagery into geospatial features of interest, the supervised classification needs to be trained to distinguish these features through training sampling. However, even though an imagery is classified, different results of classification could be generated according to operator's experience and expertise in training process. Users who practically exploit an classification result to their applications need the research accomplishment for the consistent result as well as the accuracy improvement. The experiment includes the classification results for training process used VITD polygons as a prior probability and training parameter, instead of manual sampling. As results, classification accuracy using VITD polygons as prior probabilities shows the highest results in several methods. The training using unsupervised classification with VITD have produced similar classification results as manual training and/or with prior probability.

Unsupervised feature learning for classification

  • Abdullaev, Mamur;Alikhanov, Jumabek;Ko, Seunghyun;Jo, Geun Sik
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2016.07a
    • /
    • pp.51-54
    • /
    • 2016
  • In computer vision especially in image processing, it has become popular to apply deep convolutional networks for supervised learning. Convolutional networks have shown a state of the art results in classification, object recognition, detection as well as semantic segmentation. However, supervised learning has two major disadvantages. One is it requires huge amount of labeled data to get high accuracy, the second one is to train so much data takes quite a bit long time. On the other hand, unsupervised learning can handle these problems more cheaper way. In this paper we show efficient way to learn features for classification in an unsupervised way. The network trained layer-wise, used backpropagation and our network learns features from unlabeled data. Our approach shows better results on Caltech-256 and STL-10 dataset.

  • PDF

Damage classification of concrete structures based on grey level co-occurrence matrix using Haar's discrete wavelet transform

  • Kabir, Shahid;Rivard, Patrice
    • Computers and Concrete
    • /
    • v.4 no.3
    • /
    • pp.243-257
    • /
    • 2007
  • A novel method for recognition, characterization, and quantification of deterioration in bridge components and laboratory concrete samples is presented in this paper. The proposed scheme is based on grey level co-occurrence matrix texture analysis using Haar's discrete wavelet transform on concrete imagery. Each image is described by a subset of band-filtered images containing wavelet coefficients, and then reconstructed images are employed in characterizing the texture, using grey level co-occurrence matrices, of the different types and degrees of damage: map-cracking, spalling and steel corrosion. A comparative study was conducted to evaluate the efficiency of the supervised maximum likelihood and unsupervised K-means classification techniques, in order to classify and quantify the deterioration and its extent. Experimental results show both methods are relatively effective in characterizing and quantifying damage; however, the supervised technique produced more accurate results, with overall classification accuracies ranging from 76.8% to 79.1%.

Severity-based Fault Prediction using Unsupervised Learning (비감독형 학습 기법을 사용한 심각도 기반 결함 예측)

  • Hong, Euyseok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.18 no.3
    • /
    • pp.151-157
    • /
    • 2018
  • Most previous studies of software fault prediction have focused on supervised learning models for binary classification that determines whether an input module has faults or not. However, binary classification model determines only the presence or absence of faults in the module without considering the complex characteristics of the fault, and supervised model has the limitation that it requires a training data set that most development groups do not have. To solve these two problems, this paper proposes severity-based ternary classification model using unsupervised learning algorithms, and experimental results show that the proposed model has comparable performance to the supervised models.

Vegetation Mapping of Hawaiian Coastal Lowland Using Remotely Sensed Data (원격탐사 자료를 이용한 하와이 해안지역 식생 분류)

  • Park, Sun-Yurp
    • Journal of the Korean association of regional geographers
    • /
    • v.12 no.4
    • /
    • pp.496-507
    • /
    • 2006
  • A hybrid approach integrating both high-resolution and hyperspectral data sets was used to map vegetation cover of a coastal lowland area in the Hawaii Volcanoes National Park. Three common grass species (broomsedge, natal redtop, and pili) and other non-grass species, primarily shrubs, were focused in the study. A 3-step, hybrid approach, combining an unsupervised and a supervised classification schemes, was applied to the vegetation mapping. First, the IKONOS 1-m high-resolution data were classified to create a binary image (vegetated vs. non--vegetated) and converted to 20-meter resolution percent cover vegetation data to match AVIRIS data pixels. Second, the minimum noise fraction (MNF) transformation was used to extract a coherent dimensionality from the original AVIRIS data. Since the grasses and shubs were sparsely distributed and most image pixels were intermingled with lava surfaces, the reflectance component of lava was filtered out with a binary fractional cover analysis assuming that tile total reflectance of a pixel was a linear combination of the reflectance spectra of vegetation and the lava surface. Finally, a supervised approach was used to classify the plant species based on tile maximum likelihood algorithm.

  • PDF

A Sliding Window-based Multivariate Stream Data Classification (슬라이딩 윈도우 기반 다변량 스트림 데이타 분류 기법)

  • Seo, Sung-Bo;Kang, Jae-Woo;Nam, Kwang-Woo;Ryu, Keun-Ho
    • Journal of KIISE:Databases
    • /
    • v.33 no.2
    • /
    • pp.163-174
    • /
    • 2006
  • In distributed wireless sensor network, it is difficult to transmit and analyze the entire stream data depending on limited networks, power and processor. Therefore it is suitable to use alternative stream data processing after classifying the continuous stream data. We propose a classification framework for continuous multivariate stream data. The proposed approach works in two steps. In the preprocessing step, it takes input as a sliding window of multivariate stream data and discretizes the data in the window into a string of symbols that characterize the signal changes. In the classification step, it uses a standard text classification algorithm to classify the discretized data in the window. We evaluated both supervised and unsupervised classification algorithms. For supervised, we tested Bayesian classifier and SVM, and for unsupervised, we tested Jaccard, TFIDF Jaro and Jaro Winkler. In our experiments, SVM and TFIDF outperformed other classification methods. In particular, we observed that classification accuracy is improved when the correlation of attributes is also considered along with the n-gram tokens of symbols.

Temporal Analysis on the Transition of Land Cover Change and Growth of Mining Area Using Landsat TM/+ETM Satellite Imagery in Tuv, Mongolia (Landsat TM/+ETM 위성영상을 이용한 몽골 Tuv지역의 토지피복변화 및 광산지역확대 추이분석)

  • Erdenesumbee, Suld;Cho, Misu;Cho, Gisung
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.32 no.5
    • /
    • pp.451-457
    • /
    • 2014
  • Recently, the land degradation and pasture erosion in Tuv, located around Ulaanbaatar of Mongolia, have been increasing sharply due to escalating developments of mining sectors, well as the density of populations. Because of that, we have chosen the urban and mining area of Tuv for our study target. During the study, the temporal changes of land cover in Tuv, Mongolia were observed by the Landsat TM/+ETM satellite images from 2001 to 2009 that provided the fundamental dataset to apply NDVI and K-Mean algorithm of Unsupervised Classification and Maximum likelihood classification(MLC) of Supervised Classification in order to conclude in land cover change analyzation. The result of our study implies that the growth of mining area, the climate change, and the density of population led the land degradation to desertification.

A Comparison Study of Classification Algorithms in Data Mining

  • Lee, Seung-Joo;Jun, Sung-Rae
    • International Journal of Fuzzy Logic and Intelligent Systems
    • /
    • v.8 no.1
    • /
    • pp.1-5
    • /
    • 2008
  • Generally the analytical tools of data mining have two learning types which are supervised and unsupervised learning algorithms. Classification and prediction are main analysis tools for supervised learning. In this paper, we perform a comparison study of classification algorithms in data mining. We make comparative studies between popular classification algorithms which are LDA, QDA, kernel method, K-nearest neighbor, naive Bayesian, SVM, and CART. Also, we use almost all classification data sets of UCI machine learning repository for our experiments. According to our results, we are able to select proper algorithms for given classification data sets.

Statistical bioinformatics for gene expression data

  • Lee, Jae-K.
    • Proceedings of the Korean Society for Bioinformatics Conference
    • /
    • 2001.08a
    • /
    • pp.103-127
    • /
    • 2001
  • Gene expression studies require statistical experimental designs and validation before laboratory confirmation. Various clustering approaches, such as hierarchical, Kmeans, SOM are commonly used for unsupervised learning in gene expression data. Several classification methods, such as gene voting, SVM, or discriminant analysis are used for supervised lerning, where well-defined response classification is possible. Estimating gene-condition interaction effects require advanced, computationally-intensive statistical approaches.

  • PDF

A Clustering-based Semi-Supervised Learning through Initial Prediction of Unlabeled Data (미분류 데이터의 초기예측을 통한 군집기반의 부분지도 학습방법)

  • Kim, Eung-Ku;Jun, Chi-Hyuck
    • Journal of the Korean Operations Research and Management Science Society
    • /
    • v.33 no.3
    • /
    • pp.93-105
    • /
    • 2008
  • Semi-supervised learning uses a small amount of labeled data to predict labels of unlabeled data as well as to improve clustering performance, whereas unsupervised learning analyzes only unlabeled data for clustering purpose. We propose a new clustering-based semi-supervised learning method by reflecting the initial predicted labels of unlabeled data on the objective function. The initial prediction should be done in terms of a discrete probability distribution through a classification method using labeled data. As a result, clusters are formed and labels of unlabeled data are predicted according to the Information of labeled data in the same cluster. We evaluate and compare the performance of the proposed method in terms of classification errors through numerical experiments with blinded labeled data.