• Title/Summary/Keyword: Decision Class Analysis

Search Result 155, Processing Time 0.022 seconds

Performance analysis and comparison of various machine learning algorithms for early stroke prediction

  • Vinay Padimi;Venkata Sravan Telu;Devarani Devi Ningombam
    • ETRI Journal
    • /
    • v.45 no.6
    • /
    • pp.1007-1021
    • /
    • 2023
  • Stroke is the leading cause of permanent disability in adults, and it can cause permanent brain damage. According to the World Health Organization, 795 000 Americans experience a new or recurrent stroke each year. Early detection of medical disorders, for example, strokes, can minimize the disabling effects. Thus, in this paper, we consider various risk factors that contribute to the occurrence of stoke and machine learning algorithms, for example, the decision tree, random forest, and naive Bayes algorithms, on patient characteristics survey data to achieve high prediction accuracy. We also consider the semisupervised self-training technique to predict the risk of stroke. We then consider the near-miss undersampling technique, which can select only instances in larger classes with the smaller class instances. Experimental results demonstrate that the proposed method obtains an accuracy of approximately 98.83% at low cost, which is significantly higher and more reliable compared with the compared techniques.

The Prediction of DEA based Efficiency Rating for Venture Business Using Multi-class SVM (다분류 SVM을 이용한 DEA기반 벤처기업 효율성등급 예측모형)

  • Park, Ji-Young;Hong, Tae-Ho
    • Asia pacific journal of information systems
    • /
    • v.19 no.2
    • /
    • pp.139-155
    • /
    • 2009
  • For the last few decades, many studies have tried to explore and unveil venture companies' success factors and unique features in order to identify the sources of such companies' competitive advantages over their rivals. Such venture companies have shown tendency to give high returns for investors generally making the best use of information technology. For this reason, many venture companies are keen on attracting avid investors' attention. Investors generally make their investment decisions by carefully examining the evaluation criteria of the alternatives. To them, credit rating information provided by international rating agencies, such as Standard and Poor's, Moody's and Fitch is crucial source as to such pivotal concerns as companies stability, growth, and risk status. But these types of information are generated only for the companies issuing corporate bonds, not venture companies. Therefore, this study proposes a method for evaluating venture businesses by presenting our recent empirical results using financial data of Korean venture companies listed on KOSDAQ in Korea exchange. In addition, this paper used multi-class SVM for the prediction of DEA-based efficiency rating for venture businesses, which was derived from our proposed method. Our approach sheds light on ways to locate efficient companies generating high level of profits. Above all, in determining effective ways to evaluate a venture firm's efficiency, it is important to understand the major contributing factors of such efficiency. Therefore, this paper is constructed on the basis of following two ideas to classify which companies are more efficient venture companies: i) making DEA based multi-class rating for sample companies and ii) developing multi-class SVM-based efficiency prediction model for classifying all companies. First, the Data Envelopment Analysis(DEA) is a non-parametric multiple input-output efficiency technique that measures the relative efficiency of decision making units(DMUs) using a linear programming based model. It is non-parametric because it requires no assumption on the shape or parameters of the underlying production function. DEA has been already widely applied for evaluating the relative efficiency of DMUs. Recently, a number of DEA based studies have evaluated the efficiency of various types of companies, such as internet companies and venture companies. It has been also applied to corporate credit ratings. In this study we utilized DEA for sorting venture companies by efficiency based ratings. The Support Vector Machine(SVM), on the other hand, is a popular technique for solving data classification problems. In this paper, we employed SVM to classify the efficiency ratings in IT venture companies according to the results of DEA. The SVM method was first developed by Vapnik (1995). As one of many machine learning techniques, SVM is based on a statistical theory. Thus far, the method has shown good performances especially in generalizing capacity in classification tasks, resulting in numerous applications in many areas of business, SVM is basically the algorithm that finds the maximum margin hyperplane, which is the maximum separation between classes. According to this method, support vectors are the closest to the maximum margin hyperplane. If it is impossible to classify, we can use the kernel function. In the case of nonlinear class boundaries, we can transform the inputs into a high-dimensional feature space, This is the original input space and is mapped into a high-dimensional dot-product space. Many studies applied SVM to the prediction of bankruptcy, the forecast a financial time series, and the problem of estimating credit rating, In this study we employed SVM for developing data mining-based efficiency prediction model. We used the Gaussian radial function as a kernel function of SVM. In multi-class SVM, we adopted one-against-one approach between binary classification method and two all-together methods, proposed by Weston and Watkins(1999) and Crammer and Singer(2000), respectively. In this research, we used corporate information of 154 companies listed on KOSDAQ market in Korea exchange. We obtained companies' financial information of 2005 from the KIS(Korea Information Service, Inc.). Using this data, we made multi-class rating with DEA efficiency and built multi-class prediction model based data mining. Among three manners of multi-classification, the hit ratio of the Weston and Watkins method is the best in the test data set. In multi classification problems as efficiency ratings of venture business, it is very useful for investors to know the class with errors, one class difference, when it is difficult to find out the accurate class in the actual market. So we presented accuracy results within 1-class errors, and the Weston and Watkins method showed 85.7% accuracy in our test samples. We conclude that the DEA based multi-class approach in venture business generates more information than the binary classification problem, notwithstanding its efficiency level. We believe this model can help investors in decision making as it provides a reliably tool to evaluate venture companies in the financial domain. For the future research, we perceive the need to enhance such areas as the variable selection process, the parameter selection of kernel function, the generalization, and the sample size of multi-class.

A Phenomenological Study on Academic Achievement After Experiences of Problem-Based Learning in Students of Physical Therapy (물리치료학과 학생의 PBL수업과 학업성취도에 대한 현상학적 연구)

  • Kim, Janggon
    • Journal of The Korean Society of Integrative Medicine
    • /
    • v.2 no.4
    • /
    • pp.83-90
    • /
    • 2014
  • Purpose : PBL is a teaching method to learn problem-solving process. Present study was to investigate the predictors of academic achievement when PBL is applied to students of physical therapy. Method : We Performed in-depth interviews and analyzed using the qualitative analysis by randomly assigning 5 of twenty four students who attended the class. Result : The results are classified into two categories and six sub-subjects. Based on two system of classification, PBL showed the learning effect through problem-solving methods because students directly participated in these processes. Also, students need to clearly comprehend communication method and decision-making process in order to progress the class smoothly. Conclusion : Therefore, futher studies will be continuously needed on how we apply PBL to various curriculums of physical therapy.

Development of Independent 1 kW-class PEMFC-Battery Hybrid System for a Building (건물용 독립형 1kW급 PEMFC-배터리 하이브리드 시스템 기술 개발)

  • Yang, Seug Ran;Kim, Jung Suk;Choi, Mi Hwa
    • KEPCO Journal on Electric Power and Energy
    • /
    • v.5 no.2
    • /
    • pp.113-120
    • /
    • 2019
  • We have developed 1 kW-class PEMFC-battery hybrid system independently powering to the building, through the process of system design, current load characteristics analysis, power system configuration for demonstration site and performance evaluation. In order to use the fuel cell and battery as the hybrid type, a control technology for the charging/discharging decision and charging speed of the battery is required rather than using fuel cell. Also output power distribution between PEMFC and the battery is a core of energy management technology. It is confirmed that it is possible to supply independently 1kW powering the building to ensure optimal energy management through the power control experiment of the hybrid system.

Identifying Daily and Weekly Charging Profiles of Electric Vehicle Users in Korea : An Application of Sequence Analysis and Latent Class Cluster Analysis (전기차 이용자의 일단위 및 주단위 충전 프로파일 유형화 분석 : 순차패턴분석과 잠재계층분석을 중심으로)

  • Jae Hyun Lee;Seo Youn Yoon
    • The Journal of The Korea Institute of Intelligent Transport Systems
    • /
    • v.21 no.6
    • /
    • pp.194-210
    • /
    • 2022
  • The user-centered EV charging infrastructure construction policy the government is aiming for can increase convenience for electric vehicle users and bring new electric vehicle users into the market. This study was conducted to provide an in-depth understanding of the charging behaviors of actual electric vehicle users, which can be used as basic information for the electric vehicle charging infrastructure. Based on charging diary data collected for a week, the charging of electric vehicles was analyzed on a daily and weekly basis, and sequence analysis and latent class analysis were used. As a result, five daily charging profiles and four weekly charging profiles were identified, which are expected to contribute to revitalizing the electric vehicle market by providing key information for decision-making by potential electric vehicle users as well for establishing user-centered charging infrastructure policies in the future.

An Analysis of the Decision Factors on Mokpo Port by Multinomial Logit Model

  • Seong, Yu-Chang;Youn, Myung-Ou
    • Journal of Navigation and Port Research
    • /
    • v.31 no.2
    • /
    • pp.133-139
    • /
    • 2007
  • Relative importance of maritime transport that takes charge of main current of freight in country' economy is very large. Especially, port and facility carry out important role which treats freight of import and export smoothly and improves international trade as turning point, to achieve key role on connection and association between sea and land. For such reason, enlargement of port facilities or development of port needs to grasp exactly the utilization of port, attributes and selective factors of shipper. On the other hand, the amounts of physical distribution on Mokpo port located in Korean west coast are increasing, with fast economic growth of East Asian including China. This study uses discrete choice model that is measuring to analyze attribute and characteristic of Mokpo port, and analyzes port selection by decision factors of shipper. This paper composed a questionnaire using the result of preceding research, to decide port selection factor among competitive ports. Through factor analysis on a basis of the questionnaire' result, five principal components were extracted. These are resorted out by Logit model, to grasp competitive elements of port. This research fin present direction which raises competitive power of ports in west coast of Korea, especially on alternative and concentration of middle-class port as Mokpo may be useful.

A New Method for Hyperspectral Data Classification

  • Dehghani, Hamid.;Ghassemian, Hassan.
    • Proceedings of the KSRS Conference
    • /
    • 2003.11a
    • /
    • pp.637-639
    • /
    • 2003
  • As the number of spectral bands of high spectral resolution data increases, the capability to detect more detailed classes should also increase, and the classification accuracy should increase as well. Often, it is impossible to access enough training pixels for supervise classification. For this reason, the performance of traditional classification methods isn't useful. In this paper, we propose a new model for classification that operates based on decision fusion. In this classifier, learning is performed at two steps. In first step, only training samples are used and in second step, this classifier utilizes semilabeled samples in addition to original training samples. At the beginning of this method, spectral bands are categorized in several small groups. Information of each group is used as a new source and classified. Each of this primary classifier has special characteristics and discriminates the spectral space particularly. With using of the benefits of all primary classifiers, it is made sure that the results of the fused local decisions are accurate enough. In decision fusion center, some rules are used to determine the final class of pixels. This method is applied to real remote sensing data. Results show classification performance is improved, and this method may solve the limitation of training samples in the high dimensional data and the Hughes phenomenon may be mitigated.

  • PDF

Podiatric Clinical Diagnosis using Decision Tree Data Mining (결정트리 데이터마이닝을 이용한 족부 임상 진단)

  • Kim, Jin-Ho;Park, In-Sik;Kim, Bong-Ok;Yang, Yoon-Seok;Won, Yong-Gwan;Kim, Jung-Ja
    • Journal of the Institute of Electronics Engineers of Korea CI
    • /
    • v.48 no.2
    • /
    • pp.28-37
    • /
    • 2011
  • With growing concerns about healthy life recently, although the podiatry which deals with the whole area for diagnosis, treatment of foot and leg, and prevention has been widely interested, research in our country is not active. Also, because most of the previous researches in data analysis performed the quantitative approaches, the reasonable level of reliability for clinical application could not be guaranteed. Clinical data mining utilizes various data mining analysis methods for clinical data, which provides decision support for expert's diagnosis and treatment for the patients. Because the decision tree can provide good explanation and description for the analysis procedure and is easy to interpret the results, it is simple to apply for clinical problems. This study investigate rules of item of diagnosis in disease types for adapting decision tree after collecting diagnosed data patients who are 2620 feet of 1310(males:633, females:677) in shoes clinic (department of rehabilitation medicine, Chungnam National University Hospital). and we classified 15 foot diseases followed factor of 22 foot diseases, which investigated diagnosis of 64 rules. Also, we analyzed and compared correlation relationship of characteristic of disease and factor in types through made decision tree from 5 class types(infants, child, adolescent, adult, total). Investigated results can be used qualitative and useful knowledge for clinical expert`s, also can be used tool for taking effective and accurate diagnosis.

Wear Debris Analysis using the Color Pattern Recognition (칼라 패턴인식을 이용한 마모입자 분석)

  • ;A.Y.Grigoriev
    • Proceedings of the Korean Society of Tribologists and Lubrication Engineers Conference
    • /
    • 2000.06a
    • /
    • pp.54-61
    • /
    • 2000
  • A method and results of classification of 4 types metallic wear debris were presented by using their color features. The color image of wear debris was used (or the initial data, and the color properties of the debris were specified by HSI color model. Particle was characterized by a set of statistical features derived from the distribution of HSI color model components. The initial feature set was optimized by a principal component analysis, and multidimensional scaling procedure was used for the definition of classification plane. It was found that five features, which include mean values of H and S, median S, skewness of distribution of S and I, allow to distinguish copper based alloys, red and dark iron oxides and steel particles. In this work, a method of probabilistic decision-making of class label assignment was proposed, which was based on the analysis of debris-coordinates distribution in the classification plane. The obtained results demonstrated a good availability for the automated wear particle analysis.

  • PDF

Classification and Analysis of Human Error Accidents of Helicopter Pilots in Korea (국내 헬리콥터 조종사 인적오류 사고 분류 및 분석)

  • Yu, TaeJung;Kwon, YoungGuk;Song, Byeong-Heum
    • Journal of the Korean Society for Aviation and Aeronautics
    • /
    • v.28 no.4
    • /
    • pp.21-31
    • /
    • 2020
  • There are two to three helicopter accidents every year in Korea, representing 5.7 deaths per 100,000 flights. In this study, an analysis was conducted on helicopter accidents that occurred in Korea from 2005 to 2017. The accident analysis was based on the aircraft accident and incident report published by the Aircraft and Railway Accident Investigation Board. This Research analyzed the characteristics of accidents occurring in Korea caused by human error by pilots. Accident analysis was done by classifying the organization, flight mission, aircraft class, flight stage, accident cause, etc. Pilot's huan error was classified as Skill-based error, decision error and perceptual error in accordance with the HFACS taxonomy. The accidents caused by pilot's human error were classified into five categories: powerlines collision, loss of control, fuel exhaustion, unstable approach to reservoir, and elimination of tail rotor.