• Title/Summary/Keyword: generalization-process

Search Result 298, Processing Time 0.028 seconds

Estimation of Storage Capacity using Topographical Shape of Sand-bar and High Resolution Image in Urban Stream (도시하천의 지형태 자료와 영상정보를 이용한 수체적 시험평가)

  • Lee, Hyun Seok;Lee, Geun Sang
    • KSCE Journal of Civil and Environmental Engineering Research
    • /
    • v.28 no.3D
    • /
    • pp.445-450
    • /
    • 2008
  • Recently, environmental and ecological approaches is in progress in urban stream, especially the guarantee of instream flow becomes very important. In this paper, it is suggested that water volume estimation method utilizing the topographical shape data obtained by field investigation and satellite image to manage the urban stream efficiently. The data obtained at Gap River is the study area are analysed and those results are as belows. First, surveying to investigate topographic shape characteristics of urban stream is carried out. In details, the gradient characteristics from water surface to bottom in case of sand area and in case of grass area are 0.013 and 0.065 respectively. In conclusion, the gradient characteristic of grass area is five times bigger than that of sand area. Besides, IKONOS image is classified by spectrum analysis and Minimum Distance Method and the sand area extraction method by the generalization method as Median filter is suggested to calculate water volume. Finally, mapping process on the sand area extracted from the topographical shape field data in river and satellite images is carried out by the GIS spatial analysis. And on the assumption that the water level was 1m at that time when satellite image was taken, the water volume was $225,258m^3$. It is clarified that the effect of water volume improvement was about 10.5% in comparison with water volume that had no consideration on the gradient characteristics of sand-bar.

Generalization of an Evaluation Formula for Bearing Pressures on the Rubble Mound of Gravity-Based Harbor Structures (중력식 항만구조물의 사석마운드 지반반력 평가식의 일반화)

  • Woo-Sun Park
    • Journal of Korean Society of Coastal and Ocean Engineers
    • /
    • v.35 no.6
    • /
    • pp.128-137
    • /
    • 2023
  • In this study, the bearing pressure on the rubble mound of a gravity-based harbor structure with an arbitrarily shaped bottom was targeted. Assuming that the bottom of the structure is a rigid body, the rubble mound was modeled as a linear spring uniformly distributed on the bottom that resists compression only, and the bearing pressure evaluation formula was derived. It was confirmed that there were no errors in the derivation process by showing that when the bottom was square, the derived equation was converted to the equation used in the design. In addition, the validity of the derived equation was proven by examining the behavior and convergence value of the bearing pressure when an arbitrarily shaped bottom converges into a square one. In order to examine the adequacy of the method used in the current design, the end bearing pressure for the pre-designed breakwater cross-section was calculated and compared with the values in the design document. As a result, it was shown that the method used for design was not appropriate as it gave unsafe results. In particular, the difference was larger when the eccentricity of the vertical load was large, such as in the case of extreme design conditions.

Development of Bond Strength Model for FRP Plates Using Back-Propagation Algorithm (역전파 학습 알고리즘을 이용한 콘크리트와 부착된 FRP 판의 부착강도 모델 개발)

  • Park, Do-Kyong
    • Journal of the Korea institute for structural maintenance and inspection
    • /
    • v.10 no.2
    • /
    • pp.133-144
    • /
    • 2006
  • In order to catch out such Bond Strength, the preceding researchers had ever examined the Bond Strength of FRP Plate through their experimentations by setting up of various fluent. However, since the experiment for research on such Bond Strength takes much of expenditure for equipment structure and time-consuming, also difficult to carry out, it is conducting limitedly. This Study purposes to develop the most suitable Artificial Neural Network Model by application of various Neural Network Model and Algorithm to the adhering experiment data of the preceding researchers. Output Layer of Artificial Neural Network Model, and Input Layer of Bond Strength were performed the learning by selection as the variable of the thickness, width, adhered length, the modulus of elasticity, tensile strength, and the compressive strength of concrete, tensile strength, width, respectively. The developed Artificial Neural Network Model has applied Back-Propagation, and its error was learnt to be converged within the range of 0.001. Besides, the process for generalization has dissolved the problem of Over-Fitting in the way of more generalized method by introduction of Bayesian Technique. The verification on the developed Model was executed by comparison with the resulted value of Bond Strength made by the other preceding researchers which was never been utilized to the learning as yet.

Improving the Accuracy of Document Classification by Learning Heterogeneity (이질성 학습을 통한 문서 분류의 정확성 향상 기법)

  • Wong, William Xiu Shun;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.21-44
    • /
    • 2018
  • In recent years, the rapid development of internet technology and the popularization of smart devices have resulted in massive amounts of text data. Those text data were produced and distributed through various media platforms such as World Wide Web, Internet news feeds, microblog, and social media. However, this enormous amount of easily obtained information is lack of organization. Therefore, this problem has raised the interest of many researchers in order to manage this huge amount of information. Further, this problem also required professionals that are capable of classifying relevant information and hence text classification is introduced. Text classification is a challenging task in modern data analysis, which it needs to assign a text document into one or more predefined categories or classes. In text classification field, there are different kinds of techniques available such as K-Nearest Neighbor, Naïve Bayes Algorithm, Support Vector Machine, Decision Tree, and Artificial Neural Network. However, while dealing with huge amount of text data, model performance and accuracy becomes a challenge. According to the type of words used in the corpus and type of features created for classification, the performance of a text classification model can be varied. Most of the attempts are been made based on proposing a new algorithm or modifying an existing algorithm. This kind of research can be said already reached their certain limitations for further improvements. In this study, aside from proposing a new algorithm or modifying the algorithm, we focus on searching a way to modify the use of data. It is widely known that classifier performance is influenced by the quality of training data upon which this classifier is built. The real world datasets in most of the time contain noise, or in other words noisy data, these can actually affect the decision made by the classifiers built from these data. In this study, we consider that the data from different domains, which is heterogeneous data might have the characteristics of noise which can be utilized in the classification process. In order to build the classifier, machine learning algorithm is performed based on the assumption that the characteristics of training data and target data are the same or very similar to each other. However, in the case of unstructured data such as text, the features are determined according to the vocabularies included in the document. If the viewpoints of the learning data and target data are different, the features may be appearing different between these two data. In this study, we attempt to improve the classification accuracy by strengthening the robustness of the document classifier through artificially injecting the noise into the process of constructing the document classifier. With data coming from various kind of sources, these data are likely formatted differently. These cause difficulties for traditional machine learning algorithms because they are not developed to recognize different type of data representation at one time and to put them together in same generalization. Therefore, in order to utilize heterogeneous data in the learning process of document classifier, we apply semi-supervised learning in our study. However, unlabeled data might have the possibility to degrade the performance of the document classifier. Therefore, we further proposed a method called Rule Selection-Based Ensemble Semi-Supervised Learning Algorithm (RSESLA) to select only the documents that contributing to the accuracy improvement of the classifier. RSESLA creates multiple views by manipulating the features using different types of classification models and different types of heterogeneous data. The most confident classification rules will be selected and applied for the final decision making. In this paper, three different types of real-world data sources were used, which are news, twitter and blogs.

Optimal Selection of Classifier Ensemble Using Genetic Algorithms (유전자 알고리즘을 이용한 분류자 앙상블의 최적 선택)

  • Kim, Myung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.99-112
    • /
    • 2010
  • Ensemble learning is a method for improving the performance of classification and prediction algorithms. It is a method for finding a highly accurateclassifier on the training set by constructing and combining an ensemble of weak classifiers, each of which needs only to be moderately accurate on the training set. Ensemble learning has received considerable attention from machine learning and artificial intelligence fields because of its remarkable performance improvement and flexible integration with the traditional learning algorithms such as decision tree (DT), neural networks (NN), and SVM, etc. In those researches, all of DT ensemble studies have demonstrated impressive improvements in the generalization behavior of DT, while NN and SVM ensemble studies have not shown remarkable performance as shown in DT ensembles. Recently, several works have reported that the performance of ensemble can be degraded where multiple classifiers of an ensemble are highly correlated with, and thereby result in multicollinearity problem, which leads to performance degradation of the ensemble. They have also proposed the differentiated learning strategies to cope with performance degradation problem. Hansen and Salamon (1990) insisted that it is necessary and sufficient for the performance enhancement of an ensemble that the ensemble should contain diverse classifiers. Breiman (1996) explored that ensemble learning can increase the performance of unstable learning algorithms, but does not show remarkable performance improvement on stable learning algorithms. Unstable learning algorithms such as decision tree learners are sensitive to the change of the training data, and thus small changes in the training data can yield large changes in the generated classifiers. Therefore, ensemble with unstable learning algorithms can guarantee some diversity among the classifiers. To the contrary, stable learning algorithms such as NN and SVM generate similar classifiers in spite of small changes of the training data, and thus the correlation among the resulting classifiers is very high. This high correlation results in multicollinearity problem, which leads to performance degradation of the ensemble. Kim,s work (2009) showedthe performance comparison in bankruptcy prediction on Korea firms using tradition prediction algorithms such as NN, DT, and SVM. It reports that stable learning algorithms such as NN and SVM have higher predictability than the unstable DT. Meanwhile, with respect to their ensemble learning, DT ensemble shows the more improved performance than NN and SVM ensemble. Further analysis with variance inflation factor (VIF) analysis empirically proves that performance degradation of ensemble is due to multicollinearity problem. It also proposes that optimization of ensemble is needed to cope with such a problem. This paper proposes a hybrid system for coverage optimization of NN ensemble (CO-NN) in order to improve the performance of NN ensemble. Coverage optimization is a technique of choosing a sub-ensemble from an original ensemble to guarantee the diversity of classifiers in coverage optimization process. CO-NN uses GA which has been widely used for various optimization problems to deal with the coverage optimization problem. The GA chromosomes for the coverage optimization are encoded into binary strings, each bit of which indicates individual classifier. The fitness function is defined as maximization of error reduction and a constraint of variance inflation factor (VIF), which is one of the generally used methods to measure multicollinearity, is added to insure the diversity of classifiers by removing high correlation among the classifiers. We use Microsoft Excel and the GAs software package called Evolver. Experiments on company failure prediction have shown that CO-NN is effectively applied in the stable performance enhancement of NNensembles through the choice of classifiers by considering the correlations of the ensemble. The classifiers which have the potential multicollinearity problem are removed by the coverage optimization process of CO-NN and thereby CO-NN has shown higher performance than a single NN classifier and NN ensemble at 1% significance level, and DT ensemble at 5% significance level. However, there remain further research issues. First, decision optimization process to find optimal combination function should be considered in further research. Secondly, various learning strategies to deal with data noise should be introduced in more advanced further researches in the future.

A Study on the Born Global Venture Corporation's Characteristics and Performance ('본글로벌(born global)전략'을 추구하는 벤처기업의 특성과 성과에 관한 연구)

  • Kim, Hyung-Jun;Jung, Duk-Hwa
    • Journal of Global Scholars of Marketing Science
    • /
    • v.17 no.3
    • /
    • pp.39-59
    • /
    • 2007
  • The international involvement of a firm has been described as a gradual development process "a process in which the enterprise gradually increases its international involvement in many studies. This process evolves in the interplay between the development of knowledge about foreign markets and operations on one hand and increasing commitment of resources to foreign markets on the other." On the basis of Uppsala internationalization model, many studies strengthen strong theoretical and empirical support. According to the predictions of the classic stages theory, the internationalization process of firms have been recognized and characterized gradual evolution to foreign markets, so called stage theory: indirect & direct export, strategic alliance and foreign direct investment. However, termed "international new ventures" (McDougall, Shane, and Oviatt 1994), "born globals" (Knight 1997; Knight and Cavusgil 1996; Madsen and Servais 1997), "instant internationals" (Preece, Miles, and Baetz 1999), or "global startups" (Oviatt and McDougall 1994) have been used and come into spotlight in internationalization study of technology intensity venture companies. Recent researches focused on venture company have suggested the phenomenons of 'born global' firms as a contradiction to the stages theory. Especially the article by Oviatt and McDougall threw the spotlight on international entrepreneurs, on international new ventures, and on their importance in the globalising world economy. Since venture companies have, by definition. lack of economies of scale, lack of resources (financial and knowledge), and aversion to risk taking, they have a difficulty in expanding their market to abroad and pursue internalization gradually and step by step. However many venture companies have pursued 'Born Global Strategy', which is different from process strategy, because corporate's environment has been rapidly changing to globalization. The existing studies investigate that (1) why the ventures enter into overseas market in those early stage, even in infancy, (2) what make the different international strategy among ventures and the born global strategy is better to the infant ventures. However, as for venture's performance(growth and profitability), the existing results do not correspond each other. They also, don't include marketing strategy (differentiation, low price, market breadth and market pioneer) that is important factors in studying of BGV's performance. In this paper I aim to delineate the appearance of international new ventures and the phenomenons of venture companies' internationalization strategy. In order to verify research problems, I develop a resource-based model and marketing strategies for analyzing the effects of the born global venture firms. In this paper, I suggested 3 research problems. First, do the korean venture companies take some advantages in the aspects of corporate's performances (growth, profitability and overall market performances) when they pursue internationalization from inception? Second, do the korean BGV have firm specific assets (foreign experiences, foreign orientation, organizational absorptive capacity)? Third, What are the marketing strategies of korean BGV and is it different from others? Under these problems, I test then (1) whether the BGV that a firm started its internationalization activity almost from inception, has more intangible resources(foreign experience of corporate members, foreign orientation, technological competences and absorptive capacity) than any other venture firms(Non_BGV) and (2) also whether the BGV's marketing strategies-differentiation, low price, market diversification and preemption strategy are different from Non_BGV. Above all, the main purpose of this research is that results achieved by BGV are indeed better than those obtained by Non_BGV firms with respect to firm's growth rate and efficiency. To do this research, I surveyed venture companies located in Seoul and Deajeon in Korea during November to December, 2005. I gather the data from 200 venture companies and then selected 84 samples, which have been founded during 1999${\sim}$2000. To compare BGV's characteristics with those of Non_BGV, I also had to classify BGV by export intensity over 50% among five or six aged venture firms. Many other researches tried to classify BGV and Non_BGV, but there were various criterion as many as researchers studied on this topic. Some of them use time gap, which is time difference of establishment and it's first internationalization experience and others use export intensity, ration of export sales amount divided by total sales amount. Although using a mixed criterion of prior research in my case, I do think this kinds of criterion is subjective and arbitrary rather than objective, so I do mention my research has some critical limitation in the classification of BGV and Non_BGV. The first purpose of research is the test of difference of performance between BGV and Non_BGV. As a result of t-test, the research show that there are statistically efficient difference not only in the growth rate (sales growth rate compared to competitors and 3 years averaged sales growth rate) but also in general market performance of BGV. But in case of profitability performance, the hypothesis that is BGV is more profit (return on investment(ROI) compared to competitors and 3 years averaged ROI) than Non-BGV was not supported. From these results, this paper concludes that BGV grows rapidly and gets a high market performance (in aspect of market share and customer loyalty) but there is no profitability difference between BGV and Non_BGV. The second result is that BGV have more absorptive capacity especially, knowledge competence, and entrepreneur's international experience than Non_BGV. And this paper also found BGV search for product differentiation, exemption strategy and market diversification strategy while Non_BGV search for low price strategy. These results have never been dealt with other existing studies. This research has some limitations. First limitation is concerned about the definition of BGV, as I mentioned above. Conceptually speaking, BGV is defined as company pursue internationalization from inception, but in empirical study, it's very difficult to classify between BGV and Non_BGV. I tried to classify on the basis of time difference and export intensity, this criterions are so subjective and arbitrary that the results are not robust if the criterion were changed. Second limitation is concerned about sample used in this research. I surveyed venture companies just located in Seoul and Daejeon and also use only 84 samples which more or less provoke sample bias problem and generalization of results. I think the more following studies that focus on ventures located in other region, the better to verify the results of this paper.

  • PDF

The Effects of Emotional Perception on Major Satisfaction among Students at the Department of Dental Hygiene (치위생과 학생의 정서적 인식이 전공만족도에 미치는 영향)

  • Yu, Ji-Su;Choi, Su-Young
    • Journal of dental hygiene science
    • /
    • v.10 no.5
    • /
    • pp.307-314
    • /
    • 2010
  • This study aimed to measure such features of emotional responses perceived by students as learning climate, department living stress, and perceived helplessness to analyze their effects on major satisfaction among students at the department of dental hygiene; to do this, a survey was conducted with 431 students, regardless of college year, who were at the department of dental hygiene in four colleges in Gyeonggi Province, Daejeon, and Chungcheong Province. An existing emotion scale which went through the generalization process was used to draw a multiple model in the combination form in order to collect emotional factors affecting college students' satisfaction with their major, which had existed as a hypothetical proposition, and make overall interpretation of relevance through the explainable, predictable modeling process by measuring emotional factors and phenomenal description of the level of general perception. The results showed that major satisfaction was very significantly affected by emotional features among students at the department of dental hygiene, which needs to be treated as an important factor to enhance expertise related to major learning and improve students' living.

A Study 0n the Improvement of the domestic in producing area organizations According to the change retail environment: Focused on organized, scaled, Specialization. (농산물 소매유통환경 변화에 따른 국내 산지유통조직 개선방안에 관한 연구: 조직화·규모화·전문화를 중심으로)

  • Kim, Dae-Yun
    • The Journal of Industrial Distribution & Business
    • /
    • v.2 no.2
    • /
    • pp.5-14
    • /
    • 2011
  • Opening agricultural market expansion, reduced purchases through wholesale markets, expanding the influence large retailers of consumer's market such as changes in the distribution system to the farmer's market conditions are changing rapidly. Because of this, retailers of the scaled and chain-store operations was centered on distribution environmental changes of the consumer market place. In producing area due to changes in market conditions in the agricultural production of in producing area distribution organization and the size distribution can not be put off no longer challenge is imminent. If it do not raise forces banded together, the producer is bound to remain as the weak. To support the distribution of this production was introduced in 2000 enable the Activation Project of in producing area distribution. Recent in producing area Changes of Agricultural conditions in order to cope with the Small-scale farmers and small individual farmers are becoming Scaled and specialized. Also, is specific to each item and regional is showing aspects. Government support for Activation Project of in producing area distribution is greatly improved, but in terms of competitiveness on the market still is showing the limitations. The most common of these problems, the market response if in producing area producer's organization and scale of the problem. Equipped for the purpose of consumer market place responsiveness unreasonable propelled outward from the Painter-sized weakens the organizational power. also, Difficult to succeed organizational size is a dissolution or anything within a few years, farmers around the best producer organizations, such as deviation occurs is exposed to a variety of issues. In this study, previous studies refer to the recent changes in agricultural retail environment, background and needs of organization·scaled, Determine the status of the domestic in producing area organizations and derived Problems, look into Domestic and overseas of in producing area organization with best practices for enhancing the competitiveness of the proposed improvement are intended to. In the future, in producing area distribution policy would like to provide direction to the development. The results of the study showed the follwing : 1) enhance utilization and orrganized through the diversification of the agricultural Collection systems. 2) Scaled to achieve through Items of specialized a wide area marketing. 3) Management operating units, such as installation and operating that overseas the best practices " Comite Economique Agricole Regional 'Fruits et Legumes' de Bretagne". 4) To establish a support system that in producing area distribution organization model development for appropriate domestic. In particular, in case of domestic in producing area distribution organization, through the analysis of various case study that a successful organization and scaled. The process of the various challenges arising in organizational scaled and generalization, and by the way he goes about trying to overcome is required. At the end of the study's limitations and future research directions suggested.

  • PDF

An Intelligent Intrusion Detection Model Based on Support Vector Machines and the Classification Threshold Optimization for Considering the Asymmetric Error Cost (비대칭 오류비용을 고려한 분류기준값 최적화와 SVM에 기반한 지능형 침입탐지모형)

  • Lee, Hyeon-Uk;Ahn, Hyun-Chul
    • Journal of Intelligence and Information Systems
    • /
    • v.17 no.4
    • /
    • pp.157-173
    • /
    • 2011
  • As the Internet use explodes recently, the malicious attacks and hacking for a system connected to network occur frequently. This means the fatal damage can be caused by these intrusions in the government agency, public office, and company operating various systems. For such reasons, there are growing interests and demand about the intrusion detection systems (IDS)-the security systems for detecting, identifying and responding to unauthorized or abnormal activities appropriately. The intrusion detection models that have been applied in conventional IDS are generally designed by modeling the experts' implicit knowledge on the network intrusions or the hackers' abnormal behaviors. These kinds of intrusion detection models perform well under the normal situations. However, they show poor performance when they meet a new or unknown pattern of the network attacks. For this reason, several recent studies try to adopt various artificial intelligence techniques, which can proactively respond to the unknown threats. Especially, artificial neural networks (ANNs) have popularly been applied in the prior studies because of its superior prediction accuracy. However, ANNs have some intrinsic limitations such as the risk of overfitting, the requirement of the large sample size, and the lack of understanding the prediction process (i.e. black box theory). As a result, the most recent studies on IDS have started to adopt support vector machine (SVM), the classification technique that is more stable and powerful compared to ANNs. SVM is known as a relatively high predictive power and generalization capability. Under this background, this study proposes a novel intelligent intrusion detection model that uses SVM as the classification model in order to improve the predictive ability of IDS. Also, our model is designed to consider the asymmetric error cost by optimizing the classification threshold. Generally, there are two common forms of errors in intrusion detection. The first error type is the False-Positive Error (FPE). In the case of FPE, the wrong judgment on it may result in the unnecessary fixation. The second error type is the False-Negative Error (FNE) that mainly misjudges the malware of the program as normal. Compared to FPE, FNE is more fatal. Thus, when considering total cost of misclassification in IDS, it is more reasonable to assign heavier weights on FNE rather than FPE. Therefore, we designed our proposed intrusion detection model to optimize the classification threshold in order to minimize the total misclassification cost. In this case, conventional SVM cannot be applied because it is designed to generate discrete output (i.e. a class). To resolve this problem, we used the revised SVM technique proposed by Platt(2000), which is able to generate the probability estimate. To validate the practical applicability of our model, we applied it to the real-world dataset for network intrusion detection. The experimental dataset was collected from the IDS sensor of an official institution in Korea from January to June 2010. We collected 15,000 log data in total, and selected 1,000 samples from them by using random sampling method. In addition, the SVM model was compared with the logistic regression (LOGIT), decision trees (DT), and ANN to confirm the superiority of the proposed model. LOGIT and DT was experimented using PASW Statistics v18.0, and ANN was experimented using Neuroshell 4.0. For SVM, LIBSVM v2.90-a freeware for training SVM classifier-was used. Empirical results showed that our proposed model based on SVM outperformed all the other comparative models in detecting network intrusions from the accuracy perspective. They also showed that our model reduced the total misclassification cost compared to the ANN-based intrusion detection model. As a result, it is expected that the intrusion detection model proposed in this paper would not only enhance the performance of IDS, but also lead to better management of FNE.

The Prediction of DEA based Efficiency Rating for Venture Business Using Multi-class SVM (다분류 SVM을 이용한 DEA기반 벤처기업 효율성등급 예측모형)

  • Park, Ji-Young;Hong, Tae-Ho
    • Asia pacific journal of information systems
    • /
    • v.19 no.2
    • /
    • pp.139-155
    • /
    • 2009
  • For the last few decades, many studies have tried to explore and unveil venture companies' success factors and unique features in order to identify the sources of such companies' competitive advantages over their rivals. Such venture companies have shown tendency to give high returns for investors generally making the best use of information technology. For this reason, many venture companies are keen on attracting avid investors' attention. Investors generally make their investment decisions by carefully examining the evaluation criteria of the alternatives. To them, credit rating information provided by international rating agencies, such as Standard and Poor's, Moody's and Fitch is crucial source as to such pivotal concerns as companies stability, growth, and risk status. But these types of information are generated only for the companies issuing corporate bonds, not venture companies. Therefore, this study proposes a method for evaluating venture businesses by presenting our recent empirical results using financial data of Korean venture companies listed on KOSDAQ in Korea exchange. In addition, this paper used multi-class SVM for the prediction of DEA-based efficiency rating for venture businesses, which was derived from our proposed method. Our approach sheds light on ways to locate efficient companies generating high level of profits. Above all, in determining effective ways to evaluate a venture firm's efficiency, it is important to understand the major contributing factors of such efficiency. Therefore, this paper is constructed on the basis of following two ideas to classify which companies are more efficient venture companies: i) making DEA based multi-class rating for sample companies and ii) developing multi-class SVM-based efficiency prediction model for classifying all companies. First, the Data Envelopment Analysis(DEA) is a non-parametric multiple input-output efficiency technique that measures the relative efficiency of decision making units(DMUs) using a linear programming based model. It is non-parametric because it requires no assumption on the shape or parameters of the underlying production function. DEA has been already widely applied for evaluating the relative efficiency of DMUs. Recently, a number of DEA based studies have evaluated the efficiency of various types of companies, such as internet companies and venture companies. It has been also applied to corporate credit ratings. In this study we utilized DEA for sorting venture companies by efficiency based ratings. The Support Vector Machine(SVM), on the other hand, is a popular technique for solving data classification problems. In this paper, we employed SVM to classify the efficiency ratings in IT venture companies according to the results of DEA. The SVM method was first developed by Vapnik (1995). As one of many machine learning techniques, SVM is based on a statistical theory. Thus far, the method has shown good performances especially in generalizing capacity in classification tasks, resulting in numerous applications in many areas of business, SVM is basically the algorithm that finds the maximum margin hyperplane, which is the maximum separation between classes. According to this method, support vectors are the closest to the maximum margin hyperplane. If it is impossible to classify, we can use the kernel function. In the case of nonlinear class boundaries, we can transform the inputs into a high-dimensional feature space, This is the original input space and is mapped into a high-dimensional dot-product space. Many studies applied SVM to the prediction of bankruptcy, the forecast a financial time series, and the problem of estimating credit rating, In this study we employed SVM for developing data mining-based efficiency prediction model. We used the Gaussian radial function as a kernel function of SVM. In multi-class SVM, we adopted one-against-one approach between binary classification method and two all-together methods, proposed by Weston and Watkins(1999) and Crammer and Singer(2000), respectively. In this research, we used corporate information of 154 companies listed on KOSDAQ market in Korea exchange. We obtained companies' financial information of 2005 from the KIS(Korea Information Service, Inc.). Using this data, we made multi-class rating with DEA efficiency and built multi-class prediction model based data mining. Among three manners of multi-classification, the hit ratio of the Weston and Watkins method is the best in the test data set. In multi classification problems as efficiency ratings of venture business, it is very useful for investors to know the class with errors, one class difference, when it is difficult to find out the accurate class in the actual market. So we presented accuracy results within 1-class errors, and the Weston and Watkins method showed 85.7% accuracy in our test samples. We conclude that the DEA based multi-class approach in venture business generates more information than the binary classification problem, notwithstanding its efficiency level. We believe this model can help investors in decision making as it provides a reliably tool to evaluate venture companies in the financial domain. For the future research, we perceive the need to enhance such areas as the variable selection process, the parameter selection of kernel function, the generalization, and the sample size of multi-class.