• Title/Summary/Keyword: Weighting Selection

Search Result 164, Processing Time 0.034 seconds

Optimal k-Nearest Neighborhood Classifier Using Genetic Algorithm (유전알고리즘을 이용한 최적 k-최근접이웃 분류기)

  • Park, Chong-Sun;Huh, Kyun
    • Communications for Statistical Applications and Methods
    • /
    • v.17 no.1
    • /
    • pp.17-27
    • /
    • 2010
  • Feature selection and feature weighting are useful techniques for improving the classification accuracy of k-Nearest Neighbor (k-NN) classifier. The main propose of feature selection and feature weighting is to reduce the number of features, by eliminating irrelevant and redundant features, while simultaneously maintaining or enhancing classification accuracy. In this paper, a novel hybrid approach is proposed for simultaneous feature selection, feature weighting and choice of k in k-NN classifier based on Genetic Algorithm. The results have indicated that the proposed algorithm is quite comparable with and superior to existing classifiers with or without feature selection and feature weighting capability.

An Empirical Study on Improving the Performance of Text Categorization Considering the Relationships between Feature Selection Criteria and Weighting Methods (자질 선정 기준과 가중치 할당 방식간의 관계를 고려한 문서 자동분류의 개선에 대한 연구)

  • Lee Jae-Yun
    • Journal of the Korean Society for Library and Information Science
    • /
    • v.39 no.2
    • /
    • pp.123-146
    • /
    • 2005
  • This study aims to find consistent strategies for feature selection and feature weighting methods, which can improve the effectiveness and efficiency of kNN text classifier. Feature selection criteria and feature weighting methods are as important factor as classification algorithms to achieve good performance of text categorization systems. Most of the former studies chose conflicting strategies for feature selection criteria and weighting methods. In this study, the performance of several feature selection criteria are measured considering the storage space for inverted index records and the classification time. The classification experiments in this study are conducted to examine the performance of IDF as feature selection criteria and the performance of conventional feature selection criteria, e.g. mutual information, as feature weighting methods. The results of these experiments suggest that using those measures which prefer low-frequency features as feature selection criterion and also as feature weighting method. we can increase the classification speed up to three or five times without loosing classification accuracy.

Comparing Korean Spam Document Classification Using Document Classification Algorithms (문서 분류 알고리즘을 이용한 한국어 스팸 문서 분류 성능 비교)

  • Song, Chull-Hwan;Yoo, Seong-Joon
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.10c
    • /
    • pp.222-225
    • /
    • 2006
  • 한국은 다른 나라에 비해 많은 인터넷 사용자를 가지고 있다. 이에 비례해서 한국의 인터넷 유저들은 Spam Mail에 대해 많은 불편함을 호소하고 있다. 이러한 문제를 해결하기 위해 본 논문은 다양한 Feature Weighting, Feature Selection 그리고 문서 분류 알고리즘들을 이용한 한국어 스팸 문서 Filtering연구에 대해 기술한다. 그리고 한국어 문서(Spam/Non-Spam 문서)로부터 영사를 추출하고 이를 각 분류 알고리즘의 Input Feature로써 이용한다. 그리고 우리는 Feature weighting 에 대해 기존의 전통적인 방법이 아니라 각 Feature에 대해 Variance 값을 구하고 Global Feature를 선택하기 위해 Max Value Selection 방법에 적용 후에 전통적인 Feature Selection 방법인 MI, IG, CHI 들을 적용하여 Feature들을 추출한다. 이렇게 추출된 Feature들을 Naive Bayes, Support Vector Machine과 같은 분류 알고리즘에 적용한다. Vector Space Model의 경우에는 전통적인 방법 그대로 사용한다. 그 결과 우리는 Support Vector Machine Classifier, TF-IDF Variance Weighting(Combined Max Value Selection), CHI Feature Selection 방법을 사용할 경우 Recall(99.4%), Precision(97.4%), F-Measure(98.39%)의 성능을 보였다.

  • PDF

Optimization of Weighting Matrix selection (상태 비중 행렬의 선택에 대한 최적화)

  • 권봉환;윤명중
    • Journal of the Korean Institute of Telematics and Electronics
    • /
    • v.22 no.3
    • /
    • pp.91-94
    • /
    • 1985
  • A method optimizing selection of a state weighting matrix is presented. The state weight-ing matrix is chosen so that the closed-loop system responses closely match to the ideal model responses. An algorithm is presented which determines a positive semidefinite state weighting matrix in the linear quadratic optimal control design problem and an numerical example is given to show the effect of the present algorithm.

  • PDF

An Application of Case-Based Reasoning in Forecasting a Successful Implementation of Enterprise Resource Planning Systems : Focus on Small and Medium sized Enterprises Implementing ERP (성공적인 ERP 시스템 구축 예측을 위한 사례기반추론 응용 : ERP 시스템을 구현한 중소기업을 중심으로)

  • Lim Se-Hun
    • Journal of Information Technology Applications and Management
    • /
    • v.13 no.1
    • /
    • pp.77-94
    • /
    • 2006
  • Case-based Reasoning (CBR) is widely used in business and industry prediction. It is suitable to solve complex and unstructured business problems. Recently, the prediction accuracy of CBR has been enhanced by not only various machine learning algorithms such as genetic algorithms, relative weighting of Artificial Neural Network (ANN) input variable but also data mining technique such as feature selection, feature weighting, feature transformation, and instance selection As a result, CBR is even more widely used today in business area. In this study, we investigated the usefulness of the CBR method in forecasting success in implementing ERP systems. We used a CBR method based on the feature weighting technique to compare the performance of three different models : MDA (Multiple Discriminant Analysis), GECBR (GEneral CBR), FWCBR (CBR with Feature Weighting supported by Analytic Hierarchy Process). The study suggests that the FWCBR approach is a promising method for forecasting of successful ERP implementation in Small and Medium sized Enterprises.

  • PDF

Partly Random Multiple Weighting Matrices Selection for Orthogonal Random Beamforming

  • Tan, Li;Li, Zhongcai;Xu, Chao;Wang, Desheng
    • Journal of Communications and Networks
    • /
    • v.18 no.6
    • /
    • pp.892-901
    • /
    • 2016
  • In the multi-user multiple-input multiple-output (MIMO) system, orthogonal random beamforming (ORBF) scheme is proposed to serve multiple users simultaneously in order to achieve the multi-user diversity gain. The opportunistic space-division multiple access system (OSDMA-S) scheme performs multiple weighting matrices during the training phase and chooses the best weighting matrix to be used to broadcast data during the transmitting phase. The OSDMA-S scheme works better than the original ORBF by decreasing the inter-user interference during the transmitting phase. To save more time in the training phase, a partly random multiple weighting matrices selection scheme is proposed in this paper. In our proposed scheme, the Base Station does not need to use several unitary matrices to broadcast pilot symbol. Actually, only one broadcasting operation is needed. Each subscriber generates several virtual equivalent channels with a set of pre-saved unitary matrices and the channel status information gained from the broadcasting operation. The signal-to-interference and noise ratio (SINR) of each beam in each virtual equivalent channel is calculated and fed back to the base station for the weighting matrix selection and multi-user scheduling. According to the theoretical analysis, the proposed scheme relatively expands the transmitting phase and reduces the interactive complexity between the Base Station and subscribers. The asymptotic analysis and the simulation results show that the proposed scheme improves the throughput performance of the multi-user MIMO system.

Factors of Consumer' s Digital Content Selection : Focusing on Web-toon (소비자들의 디지털컨텐츠 선택 요인 : 웹툰을 중심으로)

  • Oh, Yongmin;Jung, Hunsik;Boo, Jeman
    • Journal of Korean Society of Industrial and Systems Engineering
    • /
    • v.42 no.3
    • /
    • pp.217-231
    • /
    • 2019
  • The purpose of this study is to analyze the factors influencing consumers' selection of web-toon service through AHP (Analytic Hierarchy Process) analysis and to provide the strategy of web-toon service. To accomplish this study, theories, existing research and references related to AHP were sufficiently examined and selected the factors in the selection criteria. Surveys from consumers who used the web-toon service were conducted with selected factors. Through this, the results were analyzed by AHP analysis to find out the weighting values and the differences were examined and analyzed. The highest weighting factor in the first layer that consists of web-toon service was cinematic quality. The cinematic quality was the most important factor in the selection criteria of customers who use the web-toon service regardless of their preferred genre. Furthermore, it was confirmed that the weighting value or ranking changed in the second layer by genre. In this study, the effective basis of strategy were suggested by ranking the quantitative selection factors according to the preferred genre of consumers using web-toon services. In addition, This research provides some practical implications. That is, the web-toon service provider can easily recognize and respond to the customer's requirements, which factors are important when the customer selects a specific genre from the web-toon genre.

GA Based Control Parameter Selection Method for Optimal TCSC Control (GA를 이용한 TCSG 제어기의 파라메터 선정)

  • Kim, Hak-Man;Oh, Tae-Kyoo;Shin, Myong-Chul;Son, Kwang-Myoung
    • Proceedings of the KIEE Conference
    • /
    • 1997.07c
    • /
    • pp.841-843
    • /
    • 1997
  • In this paper we present a Genetic approach to select weighting matrices of LQ(Linear Quadratic) controller for optimal TCSC(Thyristor Controlled Series Capacitor) control. A design of LQ controller depends on choosing weighting matrices. The selection of weighting matrices is usually carried out by trial and error, which is not a trivial problem. We proposed a efficient method using GA of finding weighting matrices for optimal control law. The proposed GA method was applied to design LQ controller of TCSC in one machine infinite bus system and showed good results.

  • PDF

Frame Selection, Hybrid, Modified Weighting Model Rank Method for Robust Text-independent Speaker Identification (강건한 문맥독립 화자식별을 위한 프레임 선택방법, 복합방법, 수정된 가중모델순위 방법)

  • 김민정;오세진;정호열;정현열
    • The Journal of the Acoustical Society of Korea
    • /
    • v.21 no.8
    • /
    • pp.735-743
    • /
    • 2002
  • In this paper, we propose three new text-independent speaker identification methods. At first, to exclude the frames not having enough features of speaker's vocal from calculation of the maximum likelihood, we propose the FS(Frame Selection) method. This approach selects the important frames by evaluating the difference between the biggest likelihood and the second in each frame, and uses only the frames in calculating the score of likelihood. Our secondly proposed, called the Hybrid, is a combined version of the FS and WMR(Weighting Model Rank). This method determines the claimed speaker using exponential function weights, instead of likelihood itself, only on the selected frames obtained from the FS method. The last proposed, called MWMR (Modified WMR), considers both original likelihood itself and its relative position, when the claimed speaker is determined. It is different from the WMR that take into account only the relative position of likelihood. Through the experiments of the speaker identification, we show that the all the proposed have higher identification rates than the ML. In addition, the Hybrid and MWMR have higher identification rate about 2% and about 3% than WMR, respectively.

Robust Design of Reactor Power Control System with Genetic Algorithm-Applied Weighting Functions

  • Lee, Yoon-Joon;Cho, Kyung-Ho;Kim, Sin
    • Nuclear Engineering and Technology
    • /
    • v.30 no.4
    • /
    • pp.353-363
    • /
    • 1998
  • The H$_{\infty}$ algorithms of the mixed weight sensitivity is used for the robust design of the reactor power control system. The mixed weight sensitivity method requires the selection of the proper weighting functions for the loop shaping in frequency domain. The complexity of the system equation and the non-convexity of the problem make it very difficult to determine the weighting functions. The genetic algorithm which is improved and hybridized with the simulated annealing is applied to determine the weighting functions. This approach permits an automatic calculation and the resultant system shows good robustness and performance.

  • PDF