• Title/Summary/Keyword: Selection process

Search Result 3,396, Processing Time 0.036 seconds

The Persuit of Rationality and the Mathematics Education (합리성의 추구와 수학교육)

  • Kang Wan
    • The Mathematical Education
    • /
    • v.24 no.2
    • /
    • pp.105-116
    • /
    • 1986
  • For any thought and knowledge, its growth and development has close relation with the society where it is developed and grow. As Feuerbach says, the birth of spirit needs an existence of two human beings, i. e. the social background, as well as the birth of body does. But, at the educational viewpoint, the spread and the growth of such a thought or knowledge that influence favorably the development of a society must be also considered. We would discuss the goal and the function of mathematics education in relation with the prosperity of a technological civilization. But, the goal and the function are not unrelated with the spiritual culture which is basis of the technological civilization. Most societies of today can be called open democratic societies or societies which are at least standing such. The concept of rationality in such societies is a methodological principle which completes the democratic society. At the same time, it is asserted as an educational value concept which explains comprehensively the standpoint and the attitude of one who is educated in such a society. Especially, we can considered the cultivation of a mathematical thinking or a logical thinking in the goal of mathematics education as a concept which is included in such an educational value concept. The use of the concept of rationality depends on various viewpoints and criterions. We can analyze the concept of rationality at two aspects, one is the aspect of human behavior and the other is that of human belief or knowledge. Generally speaking, the rationality in human behavior means a problem solving power or a reasoning power as an instrument, i. e. the human economical cast of mind. But, the conceptual condition like this cannot include value concept. On the other hand, the rationality in human knowledge is related with the problem of rationality in human belief. For any statement which represents a certain sort of knowledge, its universal validity cannot be assured. The statements of value judgment which represent the philosophical knowledge cannot but relate to the argument on the rationality in human belief, because their finality do not easily turn out to be true or false. The positive statements in science also relate to the argument on the rationality in human belief, because there are no necessary relations between the proposition which states the all-pervasive rule and the proposition which is induced from the results of observation. Especially, the logical statement in logic or mathematics resolves itself into a question of the rationality in human belief after all, because all the logical proposition have their logical propriety in a certain deductive system which must start from some axioms, and the selection and construction of an axiomatic system cannot but depend on the belief of a man himself. Thus, we can conclude that a question of the rationality in knowledge or belief is a question of the rationality both in the content of belief or knowledge and in the process where one holds his own belief. And the rationality of both the content and the process is namely an deal form of a human ability and attitude in one's rational behavior. Considering the advancement of mathematical knowledge, we can say that mathematics is a good example which reflects such a human rationality, i. e. the human ability and attitude. By this property of mathematics itself, mathematics is deeply rooted as a good. subject which as needed in moulding the ability and attitude of a rational person who contributes to the development of the open democratic society he belongs to. But, it is needed to analyze the practicing and pursuing the rationality especially in mathematics education. Mathematics teacher must aim the rationality of process where the mathematical belief is maintained. In fact, there is no problem in the rationality of content as long the mathematics teacher does not draw mathematical conclusions without bases. But, in the mathematical activities he presents in his class, mathematics teacher must be able to show hem together with what even his own belief on the efficiency and propriety of mathematical activites can be altered and advanced by a new thinking or new experiences.

  • PDF

Improving the Accuracy of Document Classification by Learning Heterogeneity (이질성 학습을 통한 문서 분류의 정확성 향상 기법)

  • Wong, William Xiu Shun;Hyun, Yoonjin;Kim, Namgyu
    • Journal of Intelligence and Information Systems
    • /
    • v.24 no.3
    • /
    • pp.21-44
    • /
    • 2018
  • In recent years, the rapid development of internet technology and the popularization of smart devices have resulted in massive amounts of text data. Those text data were produced and distributed through various media platforms such as World Wide Web, Internet news feeds, microblog, and social media. However, this enormous amount of easily obtained information is lack of organization. Therefore, this problem has raised the interest of many researchers in order to manage this huge amount of information. Further, this problem also required professionals that are capable of classifying relevant information and hence text classification is introduced. Text classification is a challenging task in modern data analysis, which it needs to assign a text document into one or more predefined categories or classes. In text classification field, there are different kinds of techniques available such as K-Nearest Neighbor, Naïve Bayes Algorithm, Support Vector Machine, Decision Tree, and Artificial Neural Network. However, while dealing with huge amount of text data, model performance and accuracy becomes a challenge. According to the type of words used in the corpus and type of features created for classification, the performance of a text classification model can be varied. Most of the attempts are been made based on proposing a new algorithm or modifying an existing algorithm. This kind of research can be said already reached their certain limitations for further improvements. In this study, aside from proposing a new algorithm or modifying the algorithm, we focus on searching a way to modify the use of data. It is widely known that classifier performance is influenced by the quality of training data upon which this classifier is built. The real world datasets in most of the time contain noise, or in other words noisy data, these can actually affect the decision made by the classifiers built from these data. In this study, we consider that the data from different domains, which is heterogeneous data might have the characteristics of noise which can be utilized in the classification process. In order to build the classifier, machine learning algorithm is performed based on the assumption that the characteristics of training data and target data are the same or very similar to each other. However, in the case of unstructured data such as text, the features are determined according to the vocabularies included in the document. If the viewpoints of the learning data and target data are different, the features may be appearing different between these two data. In this study, we attempt to improve the classification accuracy by strengthening the robustness of the document classifier through artificially injecting the noise into the process of constructing the document classifier. With data coming from various kind of sources, these data are likely formatted differently. These cause difficulties for traditional machine learning algorithms because they are not developed to recognize different type of data representation at one time and to put them together in same generalization. Therefore, in order to utilize heterogeneous data in the learning process of document classifier, we apply semi-supervised learning in our study. However, unlabeled data might have the possibility to degrade the performance of the document classifier. Therefore, we further proposed a method called Rule Selection-Based Ensemble Semi-Supervised Learning Algorithm (RSESLA) to select only the documents that contributing to the accuracy improvement of the classifier. RSESLA creates multiple views by manipulating the features using different types of classification models and different types of heterogeneous data. The most confident classification rules will be selected and applied for the final decision making. In this paper, three different types of real-world data sources were used, which are news, twitter and blogs.

Optimal Selection of Classifier Ensemble Using Genetic Algorithms (유전자 알고리즘을 이용한 분류자 앙상블의 최적 선택)

  • Kim, Myung-Jong
    • Journal of Intelligence and Information Systems
    • /
    • v.16 no.4
    • /
    • pp.99-112
    • /
    • 2010
  • Ensemble learning is a method for improving the performance of classification and prediction algorithms. It is a method for finding a highly accurateclassifier on the training set by constructing and combining an ensemble of weak classifiers, each of which needs only to be moderately accurate on the training set. Ensemble learning has received considerable attention from machine learning and artificial intelligence fields because of its remarkable performance improvement and flexible integration with the traditional learning algorithms such as decision tree (DT), neural networks (NN), and SVM, etc. In those researches, all of DT ensemble studies have demonstrated impressive improvements in the generalization behavior of DT, while NN and SVM ensemble studies have not shown remarkable performance as shown in DT ensembles. Recently, several works have reported that the performance of ensemble can be degraded where multiple classifiers of an ensemble are highly correlated with, and thereby result in multicollinearity problem, which leads to performance degradation of the ensemble. They have also proposed the differentiated learning strategies to cope with performance degradation problem. Hansen and Salamon (1990) insisted that it is necessary and sufficient for the performance enhancement of an ensemble that the ensemble should contain diverse classifiers. Breiman (1996) explored that ensemble learning can increase the performance of unstable learning algorithms, but does not show remarkable performance improvement on stable learning algorithms. Unstable learning algorithms such as decision tree learners are sensitive to the change of the training data, and thus small changes in the training data can yield large changes in the generated classifiers. Therefore, ensemble with unstable learning algorithms can guarantee some diversity among the classifiers. To the contrary, stable learning algorithms such as NN and SVM generate similar classifiers in spite of small changes of the training data, and thus the correlation among the resulting classifiers is very high. This high correlation results in multicollinearity problem, which leads to performance degradation of the ensemble. Kim,s work (2009) showedthe performance comparison in bankruptcy prediction on Korea firms using tradition prediction algorithms such as NN, DT, and SVM. It reports that stable learning algorithms such as NN and SVM have higher predictability than the unstable DT. Meanwhile, with respect to their ensemble learning, DT ensemble shows the more improved performance than NN and SVM ensemble. Further analysis with variance inflation factor (VIF) analysis empirically proves that performance degradation of ensemble is due to multicollinearity problem. It also proposes that optimization of ensemble is needed to cope with such a problem. This paper proposes a hybrid system for coverage optimization of NN ensemble (CO-NN) in order to improve the performance of NN ensemble. Coverage optimization is a technique of choosing a sub-ensemble from an original ensemble to guarantee the diversity of classifiers in coverage optimization process. CO-NN uses GA which has been widely used for various optimization problems to deal with the coverage optimization problem. The GA chromosomes for the coverage optimization are encoded into binary strings, each bit of which indicates individual classifier. The fitness function is defined as maximization of error reduction and a constraint of variance inflation factor (VIF), which is one of the generally used methods to measure multicollinearity, is added to insure the diversity of classifiers by removing high correlation among the classifiers. We use Microsoft Excel and the GAs software package called Evolver. Experiments on company failure prediction have shown that CO-NN is effectively applied in the stable performance enhancement of NNensembles through the choice of classifiers by considering the correlations of the ensemble. The classifiers which have the potential multicollinearity problem are removed by the coverage optimization process of CO-NN and thereby CO-NN has shown higher performance than a single NN classifier and NN ensemble at 1% significance level, and DT ensemble at 5% significance level. However, there remain further research issues. First, decision optimization process to find optimal combination function should be considered in further research. Secondly, various learning strategies to deal with data noise should be introduced in more advanced further researches in the future.

A Basic Study on the Characteristics of the Modern Garden in Incheon During the Opening Period - Focused on Rikidake's Villa - (개항기 인천 근대정원의 조영특성에 관한 기초연구 - 리키다케 별장을 중심으로 -)

  • Jin, Hye-Young;Shin, Hyun-Sil
    • Journal of the Korean Institute of Traditional Landscape Architecture
    • /
    • v.38 no.3
    • /
    • pp.83-91
    • /
    • 2020
  • The purpose of this study is to examine the process of formation of modern gardens. Based on the analysis of the process of formation and transformation of the Jemulpo in Incheon and the details of the modern garden construction. The results are as follows; First, the formation of the Incheon Residence Site began in 1876 with the signing of the Joseon-Japan Treaty. Jemulpo used to be a desolate fishing village in the past, but after its opening in 1881, the Japanese settlement, Chinese settlement, and the general foreign settlement were formed. After that, Japan reclaimed the southern mudflats and expanded the theire settlement area, and advanced to the Joseon area(currently Sinheung-dong). In Japanese colonial era, modern Japanese urban landscapes were transplanted into the settlement area, centering on the Japanese modern gardens were distributed in the area around the center of the settlement area. Second, after examining the process of creating the garden for the Rikidake villa, Japanese Rikidake purchased a site for an orchard in Uri-tang, who was a major landowner in Incheon, to create the garden. At the time of Rikidake's residence, the garden was very large, measuring about 3,000 pyeong, and after liberation, it was acquired by Incheon City and used as Yulmok Children's Library. It was known as a rich village at the time of the opening of the port, and a garden was located at the highest point in Yulmok-dong, making it easy to see the Incheon Port area. Also, a spot located about 300 meters away from Rikidake's rice mill may have affected the location selection. Third, today's Rikidake villa has a Japanese-style house on a trapezoidal site, with a garden of about 990 square meters on the south side. Currently, it is possible to enter from the south and from Yulmok Children's Park in the north, but in the past, the main direction of the house was to view the Incheon Port, settlement area, and the Rikidake Rice Mill, so the house was located in front of the garden. The garden is a multi-faceted style with stone lanterns, tombstones, garden stones, and trees placed on each side, and is surrounded by arboreal plants such as attention, strobe pine, and maple trees, as well as royal azaleas. The view from the inside of the house was secured through shrub-oriented vegetation around the house.

The Process of Hillslope Denudation Since the Last Glacial Maximum Near Tangjeong-myeon, Asan-si, Central Korea (아산시(牙山市) 탕정면(湯井面) 일대(一帶) 최종빙기(最終氷期) 최성기(最盛期) 이후(以後) 구사면(丘斜面)의 삭박과정(削剝過程))

  • PARK, Ji-Hoon;JANG, Dong-Ho
    • Journal of The Geomorphological Association of Korea
    • /
    • v.15 no.2
    • /
    • pp.67-83
    • /
    • 2008
  • To find out the process of hillslope denudation since the Last Glacial Maximum in Asan area, we conducted the stratiform interpretation and carbon age measurements with the collected samples through trenching in the valley bottom of 'Agol' located in the lower stream of Magok stream. The results are as follows. 11 inorganic and 8 organic matter layers were confirmed at the point of trench MG1 in the subject area, 7 inorganic and 3 organic at the point of trench MG2, and 5 inorganic and 3 organic at the point of trench MG3, respectively. The frequency of hillslope denudation, hillslope mass movement, which had occurred in the unstable environment of back hillslope at the point of MG 1, was 11 times (8 times before about 2,900yrBP, twice in between about 2,900~1,900yrBP, and once after about 1,900yrBP) as a whole. The frequency of moor which had formed in the comparatively stable environment of back hillslope was 9 times (5 times before about 3,000yrBP, twice in between 3,000~2,800yrBP, and once in between 2,200~1,900yrBP) at minimum. The frequency of back hillslope denudation at the point of MG2 was totally 7 times (4 times before about 1,900yrBP and 3 times after about 1,900yrBP) and the moor formations were 3 times (twice before about 1,900yrBP and once after 1,900yrBP). The frequency of back hillslope denudation at the point of MG3 was totally 5 times (3 times before about 1,900yrBP and twice after about 1,900yrBP) and the moor formations were 3 times (twice before about 1,900yrBP and once after 1,900yrBP). The hillslope surrounded by valley bottom of 'Agol' was confirmed as the pile up of various inorganic matters by the mass movement such as sand or sandy gravel in the valley bottom of the subject area, formed not once but several times of denudation. We could know that the hillslope denudation cycle converged to the time period of $10^2{\sim}10^3$ years. These results will be an important basic data for restoring hillslope denudation process near Asan and changing climate of the Late Quaternary Period.

A Study on the Concept of Records-Archives and on the Definition of Archival Terms (기록물의 개념과 용어의 정의에 관한 연구)

  • Kim, Jung-Ha
    • The Korean Journal of Archival Studies
    • /
    • no.21
    • /
    • pp.3-40
    • /
    • 2009
  • It has passed ten years since modern records and archives management in our country launched. During times, it has dramatically developed in the fields of law, institution and education. However a study on the definition of records and archives was non be studied enough compared to development of various research fields. In fact the reason why study on the definition was non fulfilled is that some aspects such as historical, informational, archival perspective have been coexisting without order in Korea. This situation is the biggest barrier that archival science is to a disciplinary field. Historically, 'archivium' in Latin language had developed in starting of its means place, then whole entity of documents and those organic relations. In this point, archives is rigidly separate to material of Historical science which covers all of recorded. Unlike information which is produced in the process of intended themes and following its outputs like books, documents in archival science is made in the natural process of work. In addition, historical archives which finished the current and semi-current stage and transfer to the institute of permanent conservation after the process of selection so that it is historical and cultural value to satisfy its purpose of making. This changed trend is based on the Second World War and necessity of North American society which needs to effciency and transparency of work. In Korea, records and archives management has been dominantly affected by North American society and become a subject of not arrangement but of classification, not of transferring but of collection. It is also recognized as management of on formation on the all recorded or documents not as an whole documents and all organic relations. But the original type of recognition is the only technology, it cannot have dignity as a field of science.

Awareness of Pre-Service Elementary Teachers' on Science Teaching-Learning Lesson Plan (초등예비교사의 과학과 교수·학습 과정안 작성에 대한 인식)

  • Yong-Seob, Lee;Sun-Sik, Kim
    • Journal of the Korean Society of Earth Science Education
    • /
    • v.15 no.3
    • /
    • pp.335-344
    • /
    • 2022
  • This study was conducted for 4 weeks on the preparation of the science teaching/learning course plan for 109 students in 4 classes of the 2nd year intensive course at B University of Education. Pre-service elementary teachers attended a two-week field training practice after listening to a lecture on how to write a science teaching and learning course plan. Pre-service elementary teachers tried to find out about the selection of materials and the degree of connection between the course plan and the class to prepare the science teaching/learning course plan. The researcher completed the questionnaire by reviewing and deliberation on the questionnaire questions together with 4 pre-service elementary teachers. The questionnaire related to the writing of the science teaching and learning course plan consists of 8 questions. Preferred reference materials when writing the course plan, the level of interest in learning, the success or failure of the science course plan and class, the science preferred model, the evaluation method in unit time, and the science teaching and learning One's own efforts to write the course plan, the contents of this course are the science faculty. It is composed of the preparation of the learning process plan and how helpful it is to the class. The results of this study are as follows. First, it was found that elementary school pre-service elementary teachers preferred teacher guidance the most when drafting science teaching and learning curriculum plans. Second, it is recognized that the development stage is very important in the teaching and learning stage of the science department. Third, Pre-service elementary teachers believe that the science and teaching and learning process plan has a high correlation with the success of the class. Fourth, it was said that the student's level, the teacher's ability, and the appropriate lesson plan had the most influence on the class. Fifth, it was found that pre-service elementary teachers prefer the inquiry learning class model. Sixth, it was found that reports and activity papers were preferred for evaluation in 40-minute classes. Seventh, it was stated that the teaching and learning process plan is highly related to the class, so it will be studied and studied diligently. Eighth, the method of writing a science teaching and learning course plan based on the instructional design principle is interpreted as very beneficial.

Crop Injury (Growth Inhibition) Induced by Herbicides and Remedy to Reduce It (제초제(除草劑) 약해발생(藥害發生) 양상(樣相)과 경감대책(輕減對策))

  • Kim, K.U.
    • Korean Journal of Weed Science
    • /
    • v.12 no.3
    • /
    • pp.261-270
    • /
    • 1992
  • Many herbicides that are applied at the soil before weed emergence inhibit plant growth soon after weed germination occurs. Plant growth has been known as an irreversible increase in size as a result of the processes of cell divison and cell enlargement. Herbicides can influence primary growth in which most new plant tissues emerges from meristmatic region by affecting either or both of these processes. Herbicides which have sites of action during interphase($G_1$, S, $G_2$) of cell cycle and cause a subsequent reduction in the observed frequency of mitotic figures can be classified as an inhibitor of mitotic entry. Those herbicides that affect the mitotic sequence(mitosis) by influencing the development of the spindle apparatus or by influencing new cell plate formation should be classified as causing disruption of the mitotic sequence. Sulfonylureas, imidazolinones, chloroacetamides and some others inhibit plant growth by inhibiting the entry of cell into mitosis. The carbamate herbicides asulam, carbetamide, chlorpropham and propham etc. reported to disrupt the mitotic sequence, especially affecting on spindle function, and the dinitroaniline herbicides trifluralin, nitralin, pendimethalin, dinitramine and oryzalin etc. reported to disrupt the mitotic sequence, particularly causing disappearence of microtubles from treated cells due to inhibition of polymerization process. An inhibition of cell enlargement can be made by membrane demage, metabolic changes within cells, or changes in processes necessary for cell yielding. Several herbicides such as diallate, triallate, alachlor, metolachlor and EPTC etc. reported to inhibit cell enlargement, while 2, 4-D has been known to disrupt cell enlargement. One potential danger inherent in the use of soil acting herbicides is that build-up of residues could occur from year to year. In practice, the sort of build-up that would be disastrous is unikely to occur for substances applied at the correct soil concentration. Crop injury caused by soil applied herbicides can be minimized by (1) following the guidance of safe use of herbicides, particularly correct dose at correct time in right crop, (2) by use of safeners which protect crops against injury without protecting any weed ; interactions between herbicides and safeners(antagonists) at target sites do occur probably from the following mechanisms (1) competition for binding site, (2) circumvention of the target site, and (3) compensation of target site, and another mechanism of safener action can be explained by enhancement of glutathione and glutathione related enzyme activity as shown in the protection of rice from pretilachlor injury by safener fenclorim, (3) development of herbicide resistant crops ; development of herbicide-resistant weed biotypes can be explained by either gene pool theory or selection theory which are two most accepted explanations, and on this basis it is likely to develop herbicide-resistant crops of commercial use. Carry-over problems do occur following repeated use of the same herbicide in an extended period of monocropping, and by errors in initial application which lead to accidental and irregular overdosing, and by climatic influence on rates of loss. These problems are usually related to the marked sensitivity of the particular crops to the specific herbicide residues, e.g. wheat/pronamide, barley/napropamid, sugarbeet/ chlorsulfuron, quinclorac/tomato. Relatively-short-residual product, succeeding culture of insensitive crop to specific herbicide, and greater reliance on postemergence herbicide treatments should be alternatives for farmer practices to prevent these problems.

  • PDF

Construction and characterization of heterozygous diploid Escherichia coli (2배체 대장균의 제조와 그 특성)

  • Jung, Hyeim;Lim, Dongbin
    • Korean Journal of Microbiology
    • /
    • v.52 no.4
    • /
    • pp.406-414
    • /
    • 2016
  • Among 6 leu codons, CUG is the most frequently used codon in E. coli. It is recognized by leu-tRNA(CAG) encoded by four genes scattered on two chromosomal loci (leuT and leuPQV ). In the process of constructing a strain with no functional leu-tRNA (CAG) gene on chromosome, we made two mutant strains separately, one on leuPQV locus (${\Delta}leuPQV$), and the other on leuT locus [$leuT^*$(GAG)], where the anticodon of leuT was changed from CAG to GAG, thereby altering its recognition codon from CUG to CUC. We attempted to combine these two mutations by transduction using $leuT^*$(GAG) strain as a donor and ${\Delta}leuPQV$ strain as a recipient. Large and small colonies appeared from this transduction. From PCR and DNA sequencing, large colony was confirmed to be the reciprocal recombinant as expected, but the small colonies contained both mutant $leuT^*$(GAG) and wild type leuT (CAG) genes in the cell. This heterozygous diploid strain did not show any unusual morphology under microscopic observation, but, interestingly, it showed a linear growth curve in rich medium with much slower growth rate than wild type cell. It always formed homogenous small colonies in the selection medium, but, when there was no selection, it readily segregated into $leuT^*$(GAG) and leuT (CAG). From these observations, we suggested that the strain with both $leuT^*$(GAG) and leuT (CAG) genes was not a partial diploid (merodiploid), but a full diploid cell having two different chromosomes. We proposed a model explaining how such a heterozygous diploid cell was formed and how and why its growth showed a linear growth curve.

The Prediction of DEA based Efficiency Rating for Venture Business Using Multi-class SVM (다분류 SVM을 이용한 DEA기반 벤처기업 효율성등급 예측모형)

  • Park, Ji-Young;Hong, Tae-Ho
    • Asia pacific journal of information systems
    • /
    • v.19 no.2
    • /
    • pp.139-155
    • /
    • 2009
  • For the last few decades, many studies have tried to explore and unveil venture companies' success factors and unique features in order to identify the sources of such companies' competitive advantages over their rivals. Such venture companies have shown tendency to give high returns for investors generally making the best use of information technology. For this reason, many venture companies are keen on attracting avid investors' attention. Investors generally make their investment decisions by carefully examining the evaluation criteria of the alternatives. To them, credit rating information provided by international rating agencies, such as Standard and Poor's, Moody's and Fitch is crucial source as to such pivotal concerns as companies stability, growth, and risk status. But these types of information are generated only for the companies issuing corporate bonds, not venture companies. Therefore, this study proposes a method for evaluating venture businesses by presenting our recent empirical results using financial data of Korean venture companies listed on KOSDAQ in Korea exchange. In addition, this paper used multi-class SVM for the prediction of DEA-based efficiency rating for venture businesses, which was derived from our proposed method. Our approach sheds light on ways to locate efficient companies generating high level of profits. Above all, in determining effective ways to evaluate a venture firm's efficiency, it is important to understand the major contributing factors of such efficiency. Therefore, this paper is constructed on the basis of following two ideas to classify which companies are more efficient venture companies: i) making DEA based multi-class rating for sample companies and ii) developing multi-class SVM-based efficiency prediction model for classifying all companies. First, the Data Envelopment Analysis(DEA) is a non-parametric multiple input-output efficiency technique that measures the relative efficiency of decision making units(DMUs) using a linear programming based model. It is non-parametric because it requires no assumption on the shape or parameters of the underlying production function. DEA has been already widely applied for evaluating the relative efficiency of DMUs. Recently, a number of DEA based studies have evaluated the efficiency of various types of companies, such as internet companies and venture companies. It has been also applied to corporate credit ratings. In this study we utilized DEA for sorting venture companies by efficiency based ratings. The Support Vector Machine(SVM), on the other hand, is a popular technique for solving data classification problems. In this paper, we employed SVM to classify the efficiency ratings in IT venture companies according to the results of DEA. The SVM method was first developed by Vapnik (1995). As one of many machine learning techniques, SVM is based on a statistical theory. Thus far, the method has shown good performances especially in generalizing capacity in classification tasks, resulting in numerous applications in many areas of business, SVM is basically the algorithm that finds the maximum margin hyperplane, which is the maximum separation between classes. According to this method, support vectors are the closest to the maximum margin hyperplane. If it is impossible to classify, we can use the kernel function. In the case of nonlinear class boundaries, we can transform the inputs into a high-dimensional feature space, This is the original input space and is mapped into a high-dimensional dot-product space. Many studies applied SVM to the prediction of bankruptcy, the forecast a financial time series, and the problem of estimating credit rating, In this study we employed SVM for developing data mining-based efficiency prediction model. We used the Gaussian radial function as a kernel function of SVM. In multi-class SVM, we adopted one-against-one approach between binary classification method and two all-together methods, proposed by Weston and Watkins(1999) and Crammer and Singer(2000), respectively. In this research, we used corporate information of 154 companies listed on KOSDAQ market in Korea exchange. We obtained companies' financial information of 2005 from the KIS(Korea Information Service, Inc.). Using this data, we made multi-class rating with DEA efficiency and built multi-class prediction model based data mining. Among three manners of multi-classification, the hit ratio of the Weston and Watkins method is the best in the test data set. In multi classification problems as efficiency ratings of venture business, it is very useful for investors to know the class with errors, one class difference, when it is difficult to find out the accurate class in the actual market. So we presented accuracy results within 1-class errors, and the Weston and Watkins method showed 85.7% accuracy in our test samples. We conclude that the DEA based multi-class approach in venture business generates more information than the binary classification problem, notwithstanding its efficiency level. We believe this model can help investors in decision making as it provides a reliably tool to evaluate venture companies in the financial domain. For the future research, we perceive the need to enhance such areas as the variable selection process, the parameter selection of kernel function, the generalization, and the sample size of multi-class.