Search | Korea Science

Performance Improvement of Web Document Classification through Incorporation of Feature Selection and Weighting (특징선택과 특징가중의 융합을 통한 웹문서분류 성능의 개선)

Lee, Ah-Ram;Kim, Han-Joon;Man, Xuan
- The Journal of the Institute of Internet, Broadcasting and Communication
- /
- v.13 no.4
- /
- pp.141-148
- /
- 2013
Automated classification systems which utilize machine learning develops classification models through learning process, and then classify unknown data into predefined set of categories according to the model. The performance of machine learning-based classification systems relies greatly upon the quality of features composing classification models. For textual data, we can use their word terms and structure information in order to generate the set of features. Particularly, in order to extract feature from Web documents, we need to analyze tag and hyperlink information. Recent studies on Web document classification focus on feature engineering technology other than machine learning algorithms themselves. Thus this paper proposes a novel method of incorporating feature selection and weighting which can improves classification models effectively. Through extensive experiments using Web-KB document collections, the proposed method outperforms conventional ones.
https://doi.org/10.7236/JIIBC.2013.13.4.141 인용 PDF KSCI

IoT-based Feature Selection Technique Research Trend (IoT 기반의 특징 선택 기법 연구 동향)

Lim, Hwan-Hee;Lee, Tae-Ho;Lee, Byung-Jun;Kim, Kyung-Tae;Youn, Hee-Yong
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2018.07a
- /
- pp.41-42
- /
- 2018
특징 선택이란, 기계학습에서 분류 정확도를 향상시키기 위해서 많은 특징들을 분석해 가장 좋은 성능을 나타낼 수 있게끔 특징의 부분집합을 찾아내는 방법이다. 특징 선택 연구는 수십만개의 변수가 있는 데이터 세트를 이용하는 응용분야에서 주로 연구된다. 이러한 응용 분야는 주로 텍스트 처리, 유전자 배열 분석과 같은 고차원 데이터를 분석하는 분야이다. 또한, IoT 환경은 많은 데이터를 처리하기 때문에, 데이터 분류나 데이터의 가공을 위해서는 특징 선택 기법이 필수적이다. 본 논문에서는 특징 선택 기법에 대해 설명하고, IoT 환경에서 특징 선택 기법을 제안한다.
PDF

Sign Language Recognition using a Modified Fuzzy Min-Max Neural Network Model (수정된 퍼지 최대-최소 신경망 모델을 이용한 수화 인식 기법)

Park, So-Jeong;Kim, Ho-Joon
- Proceedings of the Korea Information Processing Society Conference
- /
- 2011.11a
- /
- pp.257-260
- /
- 2011
본 논문에서는 수화인식을 위한 신경망에서 특징추출과 분류단계의 방법론과, 특징 선별 기법을 통하여 분류기의 규모를 최적화 하는 방법을 고찰한다. 색상 및 움직임정보로부터 특징영역의 시간에 따른 변화를 3 차원 볼륨형태의 데이터로 표현하며, 이로부터 특징지도를 생성하는 과정에서 특징영역의 위치에 대한 변이를 보완하는 방법을 고려한다. 특징추출과정과 패턴 분류과정에서 점진적 학습이 가능한 모델과 특징 수를 효과적으로 줄일 수 있는 방법론을 제시하였으며, 학습된 신경망으로부터 특징과 패턴 클래스간의 상대적 연관성 척도를 정의하여 특징을 선별하도록 하였다. 제안된 내용에 대하여 여섯 가지 수화패턴에 대상으로 한 실험을 통하여 그 유용성을 평가하였다.
https://doi.org/10.3745/PKIPS.y2011m11a.257 인용 PDF

The Comparison of features for Speech/Music Discrimination (음성/음악 분류를 위한 특징 비교)

Lee Kyong Rok;Seo Bong Su;Kim Jin Young
- Proceedings of the Acoustical Society of Korea Conference
- /
- spring
- /
- pp.157-160
- /
- 2000
본 논문에서는 멀티미디어 정보에서 원하는 정보를 추출하는 멀티미디어 인덱싱 중 오디오 인덱싱의 전처리 부격인 음성/음악 분류실험을 하였다. 오디오 인덱싱에 있어서 음성/음악 분류기는 원 오디오 신호에서 정보를 가진 음성 부분을 분리하는 역할을 한다. 실험에서는 음성/음악 분류에서 널리 쓰이는 멜캡스트럼(Mel Cepstrum), 정규화 로그 에너지(normalized log energy), 영교차(Zero-Crossings)를 특징 파라미터로 사용하였다[l, 2, 3]. 특징공간은 GMM(Gaussian Mixture Model)에 의해 모델링 되었고, 오디오 신호의 분류는 각각 3가지 분류항목(음성, 음악, 음성+음악)과 2가지 분류항목(음성, 음악)을 적용하였다. 실험결과 3가지 분류항목 적용시와 2가지 분류항목 적용시 모두 멜캡스트럼을 사용하였을 때 가장 좋은 결과를 보였다.
PDF

특징형상 테이터를 이용한 선행관계 추출과 작업순서 결정

이충수;노형민;김성식
- Proceedings of the Korean Operations and Management Science Society Conference
- /
- 1996.04a
- /
- pp.352-357
- /
- 1996
특징형상 데이터는 공정설계의 입력 정보로 사용되며, 부품 서술 데이터, 기하학적 데이터, 가공 기술적 데이터로 분류할 수 있다. 또한 공정순서및 작업순서 결정에서 선행관계는 반드시 고려하여 위배되지 않도록 해야하는 중요한 요소이다. 본 연구에서는 작업순서 결정시 만족해야하는 선행관계를 기하형상에 의한 선행관계, 단위 특징형상의 작업내용들간의 선행관계, 가공 경험에 의한 선행관계 등으로 분류/정의하였고, 특징형상 데이터와 가공지식을 이용하여 분류된 선행관계를 자동으로 추출하는 방법을 제안하였다. 그리고 추출한 선행관계를, 공구 교환횟수를 최소로 하는 작업순서 결정 알고리즘에 적용한 사례를 정리하였다.
PDF

A Modified Fuzzy Min-Max Neural Network for Pattern Classification (수정된 퍼지 최대최소 신경망을 이용한 패턴분류)

최형수;정경훈;김호준
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.04b
- /
- pp.565-567
- /
- 2004
본 연구에서는 효과적인 패턴 분류를 위한 방법론으로서 수정된 퍼지 최대최소 신경망 모델을 제안하고 그 유용성을 고찰한다 제안된 모델에서 각 하이퍼박스는 다차원의 특징공간상에서 한 영역으로 정의되며 각 특징에 대하여 가중치 개념이 추가된 소속함수를 갖는다. 이는 기존의 FMM 신경망에서 모든 특징에 대하여 균일하게 고려되었던 특징의 상대적 중요도를 서로 다른 값으로 반영할 수 있게 한다. 본 연구에서는 제안된 모델의 동작특성 및 학습방법을 소개하며, 실제 패턴 분류문제에 적용한 실험결과를 통하여 제안된 이론의 타당성을 평가한다.
PDF

Video Segmentation Using Image signal and Human characteristic (영상신호 특성 및 Human 특징을 이용한 실시간 영상 분류)

Kim, Min-Joon;Kim, Won-Ha
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2016.06a
- /
- pp.284-287
- /
- 2016
영상에서 배경으로부터 객체를 분류하는 영상 분류 알고리즘은 물체 인식 및 추적 등 다양한 응용분야에서 중요하다. 본 논문에서는 고정된 카메라에서 다수의 초기 프레임을 참조하여 실시간 영상 분류 방법을 제안한다. 먼저 전경과 배경을 구분하는 확률모델을 제안하였으며 초기 프레임 동안에 카메라의 특성을 추출하여 카메라에 적응적으로 영상을 분류한다. 또한 분류된 영상에서 human의 특징을 이용하여 분류된 결과를 보정하는 방법을 제안한다. 마지막으로 제안한 알고리즘의 실시간 분류 처리를 위하여 복잡도를 최소화 하였다.
PDF

Feature Ranking for Detection of Neuro-degeneration and Vascular Dementia in micro-Raman spectra of Platelet (특징 순위 방법을 이용한 혈소판 라만 스펙트럼에서 퇴행성 뇌신경질환과 혈관성 인지증 분류)

Park, Aa-Ron;Baek, Sung-June
- Journal of the Institute of Electronics Engineers of Korea CI
- /
- v.48 no.4
- /
- pp.21-26
- /
- 2011
Feature ranking is useful to gain knowledge of data and identify relevant features. In this study, we proposed a use of feature ranking for classification of neuro-degeneration and vascular dementia in micro-Raman spectra of platelet. The entire region of the spectrum is divided into local region including several peaks, followed by Gaussian curve fitting method in the region to be modeled. Local minima select from the subregion and then remove the background based on the position by using interpolation method. After preprocessing steps, significant features were selected by feature ranking method to improve the classification accuracy and the computational complexity of classification system. PCA (principal component analysis) transform the selected features and the overall features that is used classification with the number of principal components. These were classified as MAP (maximum a posteriori) and it compared with classification result using overall features. In all experiments, the computational complexity of the classification system was remarkably reduced and the classification accuracy was partially increased. Particularly, the proposed method increased the classification accuracy in the experiment classifying the Parkinson's disease and normal with the average 1.7 %. From the result, it confirmed that proposed method could be efficiently used in the classification system of the neuro-degenerative disease and vascular dementia of platelet.
PDF KSCI

Hierarchical Gabor Feature and Bayesian Network for Handwritten Digit Recognition (계층적인 가버 특징들과 베이지안 망을 이용한 필기체 숫자인식)

성재모;방승양
- Journal of KIISE:Software and Applications
- /
- v.31 no.1
- /
- pp.1-7
- /
- 2004
For the handwritten digit recognition, this paper Proposes a hierarchical Gator features extraction method and a Bayesian network for them. Proposed Gator features are able to represent hierarchically different level information and Bayesian network is constructed to represent hierarchically structured dependencies among these Gator features. In order to extract such features, we define Gabor filters level by level and choose optimal Gabor filters by using Fisher's Linear Discriminant measure. Hierarchical Gator features are extracted by optimal Gabor filters and represent more localized information in the lower level. Proposed methods were successfully applied to handwritten digit recognition with well-known naive Bayesian classifier, k-nearest neighbor classifier. and backpropagation neural network and showed good performance.
PDF KSCI

Two-Stage Neural Networks for Sign Language Pattern Recognition (수화 패턴 인식을 위한 2단계 신경망 모델)

Kim, Ho-Joon
- Journal of the Korean Institute of Intelligent Systems
- /
- v.22 no.3
- /
- pp.319-327
- /
- 2012
In this paper, we present a sign language recognition model which does not use any wearable devices for object tracking. The system design issues and implementation issues such as data representation, feature extraction and pattern classification methods are discussed. The proposed data representation method for sign language patterns is robust for spatio-temporal variances of feature points. We present a feature extraction technique which can improve the computation speed by reducing the amount of feature data. A neural network model which is capable of incremental learning is described and the behaviors and learning algorithm of the model are introduced. We have defined a measure which reflects the relevance between the feature values and the pattern classes. The measure makes it possible to select more effective features without any degradation of performance. Through the experiments using six types of sign language patterns, the proposed model is evaluated empirically.
https://doi.org/10.5391/JKIIS.2012.22.3.319 인용 PDF KSCI

Search Result 4,453, Processing Time 0.031 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)