Search | Korea Science

Classification of High Dimensionality Data through Feature Selection Using Markov Blanket

Lee, Junghye;Jun, Chi-Hyuck
- Industrial Engineering and Management Systems
- /
- v.14 no.2
- /
- pp.210-219
- /
- 2015
A classification task requires an exponentially growing amount of computation time and number of observations as the variable dimensionality increases. Thus, reducing the dimensionality of the data is essential when the number of observations is limited. Often, dimensionality reduction or feature selection leads to better classification performance than using the whole number of features. In this paper, we study the possibility of utilizing the Markov blanket discovery algorithm as a new feature selection method. The Markov blanket of a target variable is the minimal variable set for explaining the target variable on the basis of conditional independence of all the variables to be connected in a Bayesian network. We apply several Markov blanket discovery algorithms to some high-dimensional categorical and continuous data sets, and compare their classification performance with other feature selection methods using well-known classifiers.
https://doi.org/10.7232/iems.2015.14.2.210 인용 PDF KSCI

Efficient Markov Feature Extraction Method for Image Splicing Detection (접합영상 검출을 위한 효율적인 마코프 특징 추출 방법)

Han, Jong-Goo;Park, Tae-Hee;Eom, Il-Kyu
- Journal of the Institute of Electronics and Information Engineers
- /
- v.51 no.9
- /
- pp.111-118
- /
- 2014
This paper presents an efficient Markov feature extraction method for detecting splicing forged images. The Markov states used in our method are composed of the difference between DCT coefficients in the adjacent blocks. Various first-order Markov state transition probabilities are extracted as features for splicing detection. In addition, we propose a feature reduction algorithm by analysing the distribution of the Markov probability. After training the extracted feature vectors using the SVM classifier, we determine whether the presence of the image splicing forgery. Experimental results verify that the proposed method shows good detection performance with a smaller number of features compared to existing methods.
https://doi.org/10.5573/ieie.2014.51.9.111 인용 PDF KSCI

Video Summarization Using Hidden Markov Model (은닉 마르코브 모델을 이용한 비디오 요약 시스템)

박호식;배철수
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.8 no.6
- /
- pp.1175-1181
- /
- 2004
This paper proposes a system to analyze and summarize the video shots of baseball game TV program into fifteen categories. Our System consists of three modules: feature extraction, Hidden Markov Model (HMM) training, and video shot categorization. Video Shots belongs to the same class are not necessarily similar, so we require that the training set is large enough to include video shot with all possible variations to create a robust Hidden Markov Model. In the experiments, we have illustrated that our system can recognize the 15 different shot classes with a success ratio of 84.72%.
PDF KSCI

Selective Feature Extraction Method Between Markov Transition Probability and Co-occurrence Probability for Image Splicing Detection (접합 영상 검출을 위한 마르코프 천이 확률 및 동시발생 확률에 대한 선택적 특징 추출 방법)

Han, Jong-Goo;Eom, Il-Kyu;Moon, Yong-Ho;Ha, Seok-Wun
- Journal of the Korea Institute of Information and Communication Engineering
- /
- v.20 no.4
- /
- pp.833-839
- /
- 2016
In this paper, we propose a selective feature extraction algorithm between Markov transition probability and co-occurrence probability for an effective image splicing detection. The Features used in our method are composed of the difference values between DCT coefficients in the adjacent blocks and the value of Kullback-Leibler divergence(KLD) is calculated to evaluate the differences between the distribution of original image features and spliced image features. KLD value is an efficient measure for selecting Markov feature or Co-occurrence feature because KLD shows non-similarity of the two distributions. After training the extracted feature vectors using the SVM classifier, we determine whether the presence of the image splicing forgery. To verify our algorithm we used grid search and 6-folds cross-validation. Based on the experimental results it shows that the proposed method has good detection performance with a limited number of features compared to conventional methods.
https://doi.org/10.6109/jkiice.2016.20.4.833 인용 PDF KSCI

A Method for Short Text Classification using SNS Feature Information based on Markov Logic Networks (SNS 특징정보를 활용한 마르코프 논리 네트워크 기반의 단문 텍스트 분류 방법)

Lee, Eunji;Kim, Pankoo
- Journal of Korea Multimedia Society
- /
- v.20 no.7
- /
- pp.1065-1072
- /
- 2017
As smart devices and social network services (SNSs) become increasingly pervasive, individuals produce large amounts of data in real time. Accordingly, studies on unstructured data analysis are actively being conducted to solve the resultant problem of information overload and to facilitate effective data processing. Many such studies are conducted for filtering inappropriate information. In this paper, a feature-weighting method considering SNS-message features is proposed for the classification of short text messages generated on SNSs, using Markov logic networks for category inference. The performance of the proposed method is verified through a comparison with an existing frequency-based classification methods.
https://doi.org/10.9717/kmms.2017.20.7.1065 인용 PDF KSCI

The Use of MSVM and HMM for Sentence Alignment

Fattah, Mohamed Abdel
- Journal of Information Processing Systems
- /
- v.8 no.2
- /
- pp.301-314
- /
- 2012
In this paper, two new approaches to align English-Arabic sentences in bilingual parallel corpora based on the Multi-Class Support Vector Machine (MSVM) and the Hidden Markov Model (HMM) classifiers are presented. A feature vector is extracted from the text pair that is under consideration. This vector contains text features such as length, punctuation score, and cognate score values. A set of manually prepared training data was assigned to train the Multi-Class Support Vector Machine and Hidden Markov Model. Another set of data was used for testing. The results of the MSVM and HMM outperform the results of the length based approach. Moreover these new approaches are valid for any language pairs and are quite flexible since the feature vector may contain less, more, or different features, such as a lexical matching feature and Hanzi characters in Japanese-Chinese texts, than the ones used in the current research.
https://doi.org/10.3745/JIPS.2012.8.2.301 인용 PDF KSCI

Investigating the Performance of Bayesian-based Feature Selection and Classification Approach to Social Media Sentiment Analysis (소셜미디어 감성분석을 위한 베이지안 속성 선택과 분류에 대한 연구)

Chang Min Kang;Kyun Sun Eo;Kun Chang Lee
- Information Systems Review
- /
- v.24 no.1
- /
- pp.1-19
- /
- 2022
Social media-based communication has become crucial part of our personal and official lives. Therefore, it is no surprise that social media sentiment analysis has emerged an important way of detecting potential customers' sentiment trends for all kinds of companies. However, social media sentiment analysis suffers from huge number of sentiment features obtained in the process of conducting the sentiment analysis. In this sense, this study proposes a novel method by using Bayesian Network. In this model MBFS (Markov Blanket-based Feature Selection) is used to reduce the number of sentiment features. To show the validity of our proposed model, we utilized online review data from Yelp, a famous social media about restaurant, bars, beauty salons evaluation and recommendation. We used a number of benchmarking feature selection methods like correlation-based feature selection, information gain, and gain ratio. A number of machine learning classifiers were also used for our validation tasks, like TAN, NBN, Sons & Spouses BN (Bayesian Network), Augmented Markov Blanket. Furthermore, we conducted Bayesian Network-based what-if analysis to see how the knowledge map between target node and related explanatory nodes could yield meaningful glimpse into what is going on in sentiments underlying the target dataset.
https://doi.org/10.14329/isr.2022.24.1.001 인용 PDF

Combination Tandem Architecture with Segmental Features for Robust Speech Recognition (강인한 음성 인식을 위한 탠덤 구조와 분절 특징의 결합)

Yun, Young-Sun;Lee, Yun-Keun
- MALSORI
- /
- no.62
- /
- pp.113-131
- /
- 2007
It is reported that the segmental feature based recognition system shows better results than conventional feature based system in the previous studies. On the other hand, the various studies of combining neural network and hidden Markov models within a single system are done with expectations that it may potentially combine the advantages of both systems. With the influence of these studies, tandem approach was presented to use neural network as the classifier and hidden Markov models as the decoder. In this paper, we applied the trend information of segmental features to tandem architecture and used posterior probabilities, which are the output of neural network, as inputs of recognition system. The experiments are performed on Auroral database to examine the potentiality of the trend feature based tandem architecture. From the results, the proposed system outperforms on very low SNR environments. Consequently, we argue that the trend information on tandem architecture can be additionally used for traditional MFCC features.
PDF

Emotion recognition from speech using Gammatone auditory filterbank

Le, Ba-Vui;Lee, Young-Koo;Lee, Sung-Young
- Proceedings of the Korean Information Science Society Conference
- /
- 2011.06a
- /
- pp.255-258
- /
- 2011
An application of Gammatone auditory filterbank for emotion recognition from speech is described in this paper. Gammatone filterbank is a bank of Gammatone filters which are used as a preprocessing stage before applying feature extraction methods to get the most relevant features for emotion recognition from speech. In the feature extraction step, the energy value of output signal of each filter is computed and combined with other of all filters to produce a feature vector for the learning step. A feature vector is estimated in a short time period of input speech signal to take the advantage of dependence on time domain. Finally, in the learning step, Hidden Markov Model (HMM) is used to create a model for each emotion class and recognize a particular input emotional speech. In the experiment, feature extraction based on Gammatone filterbank (GTF) shows the better outcomes in comparison with features based on Mel-Frequency Cepstral Coefficient (MFCC) which is a well-known feature extraction for speech recognition as well as emotion recognition from speech.

Enhanced Independent Component Analysis of Temporal Human Expressions Using Hidden Markov model

Lee, J.J.;Uddin, Zia;Kim, T.S.
- 한국HCI학회:학술대회논문집
- /
- 2008.02a
- /
- pp.487-492
- /
- 2008
Facial expression recognition is an intensive research area for designing Human Computer Interfaces. In this work, we present a new facial expression recognition system utilizing Enhanced Independent Component Analysis (EICA) for feature extraction and discrete Hidden Markov Model (HMM) for recognition. Our proposed approach for the first time deals with sequential images of emotion-specific facial data analyzed with EICA and recognized with HMM. Performance of our proposed system has been compared to the conventional approaches where Principal and Independent Component Analysis are utilized for feature extraction. Our preliminary results show that our proposed algorithm produces improved recognition rates in comparison to previous works.
PDF

Search Result 196, Processing Time 0.022 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)