• 제목/요약/키워드: correct classification rate

검색결과 107건 처리시간 0.027초

공통요인분석자혼합모형의 요인점수를 이용한 일반화가법모형 기반 신용평가 (A credit classification method based on generalized additive models using factor scores of mixtures of common factor analyzers)

  • 임수열;백장선
    • Journal of the Korean Data and Information Science Society
    • /
    • 제23권2호
    • /
    • pp.235-245
    • /
    • 2012
  • 로지스틱판별분석은 금융 분야에서 유용하게 사용되고 있는 통계적 기법으로 신용평가 시 해석이 쉽고 우수한 분별력으로 많이 활용되고 있지만 종속변수에 대한 설명변수들의 비선형적인 관계를 설명하는 부분에는 한계점이 있다. 일반화가법모형은 로지스틱판별모형의 장점과 함께 종속변수와 설명변수 사이의 비선형적인 관계도 설명할 수 있다. 그러나 연속형 설명변수의 수가 대단히 많은 경우이 두 방법은 모형에 유의한 변수를 선택해야하는 문제점이 있다. 따라서 본 연구에서는 다수의 연속형 설명변수들을 공통요인분석자혼합모형에 의한 차원축소를 통해 변환된 소수의 요인점수들을 일반화가법모형의 새로운 연속형 설명변수로 사용하여 신용분류를 하는 방법을 제시한다. 실제 금융자료를 이용하여 로지스틱판별모형과 일반화가법모형, 그리고 본 연구에서 제안한 방법에 의한 정분류율을 비교한 결과 본 연구에서 제안한 방법의 분류 성능이 더 우수하였다.

구강 편평세포암종의 경부 림프절전이에 대한 임상통계학적 연구 (A CLINICO-STATISTICAL STUDY ON CERVICAL LYMPH NODE METASTASIS OF ORAL SQUAMOUS CELL CARCINOMA)

  • 이재욱;김진욱;김진수
    • Journal of the Korean Association of Oral and Maxillofacial Surgeons
    • /
    • 제34권6호
    • /
    • pp.594-601
    • /
    • 2008
  • Cervical lymph node metastasis is one of the most important predicting factors that influence the prognosis of oral squamous cell carcinoma. Correct diagnosis on cervical lymph node metastasis is essential in determining the extent of operation and treatment modality. So we investigated a clinico-statistical evaluation on cervical lymph node metastasis in 183 patients who were diagnosed with oral squamous cell carcinoma at the Department of Oral and Maxillofacial Surgery in Kyungpook National University Hospital, from January 1st, 1999 to December 31st, 2007. The following results were obtained : 1. Among 183 patients who were diagnosed with oral squamous cell carcinoma, 149 were male and 49 were female. The average age of the patients was 61.8 years old. 2. Patients with advanced T classification showed higher incidence of cervical lymph node metastasis than those with lower T classification. 3. Patients with less differentiated tumors had higher tendency of manifesting cervical lymph node metastasis than those with more differentiated tumors. 4. Sensitivity and specificity on PET/CT was 87.5% and 58.3% respectively. PET/CT showed higher sensitivity and lower false-negative values than those of CT or USG. 5. The 5 - year survival rate of all the oral squamous cell carcinoma patients appeared to be 63.2% By N classification, patients in N0 stage showed a higher survival rate than patients in N1 or N2. 5 - year survival rates according to the modality of neck dissection were as follows in increasing order: no neck dissection group, MRND group, SND group, and RND group.

중년 후기 여성의 체형 유형화에 관한 연구 (A Study on Somatotype Classification of the Late Middle-Aged Women)

  • 심정희
    • 한국의류학회지
    • /
    • 제26권1호
    • /
    • pp.15-26
    • /
    • 2002
  • The purpose of this study was to classier the somatotype of late middle-aged women and to analyze the characteristics of each somatotype. The subjects were 337 late middle-aged women and their age range os from 45 to 59 fears old. Data were collected through anthropometry and photometry and analyzed by factor analysis, cluster analysis and discriminant analysis. The results were as follows; 1. The result of factor analysis indicated that 9 factors were extracted through factor analysis and those factors comprised 83.56 percent of total valiance. 2. Using factor scores, cluster analysis was carried out and the subject were classified into 4 cluster. Each cluster was classified as their body front and side view contour. Type 1 is tall, slim, and lower balk is flat on the side. Type 2 is standard and lean-back type on the side. Type 3 is standard height and weight, H type in front, and belly-protruded on the side. Type 4 is short, fat, and the side is hip-protruded. 3. According to the stepwise discriminant analysis, the 9 important items in classifying the somatotype of the late middle-aged women are as follows ; lower back tilt angle, hip depth(back) -back waist depth(back), bust depth(fore) - anterior waist depth(fore), jugular fossa point(fore), upper back tilt angle, burst breadth -waist breadth, right shoulder tilt, height of shoulder - height of anterior waist, abdomen breath. The correct classification rate for these items is as exact as 84.62%.

심전도 신호의 자동분석을 위한 자기회귀모델 변수추정과 패턴분류 (The Auto Regressive Parameter Estimation and Pattern Classification of EKS Signals for Automatic Diagnosis)

  • 이윤선;윤형로
    • 대한의용생체공학회:의공학회지
    • /
    • 제9권1호
    • /
    • pp.93-100
    • /
    • 1988
  • The Auto Regressive Parameter Estimation and Pattern Classification of EKG Signal for Automatic Diagnosis. This paper presents the results from pattern discriminant analysis of an AR (auto regressive) model parameter group, which represents the HRV (heart rate variability) that is being considered as time series data. HRV data was extracted using the correct R-point of the EKG wave that was A/D converted from the I/O port both by hardware and software functions. Data number (N) and optimal (P), which were used for analysis, were determined by using Burg's maximum entropy method and Akaike's Information Criteria test. The representative values were extracted from the distribution of the results. In turn, these values were used as the index for determining the range o( pattern discriminant analysis. By carrying out pattern discriminant analysis, the performance of clustering was checked, creating the text pattern, where the clustering was optimum. The analysis results showed first that the HRV data were considered sufficient to ensure the stationarity of the data; next, that the patern discrimimant analysis was able to discriminate even though the optimal order of each syndrome was dissimilar.

  • PDF

중년 전기 여성의 체형 유형화에 관한 연구 (A study on Somatotype Classification of the Early Middle-Aged Women)

  • 심정희
    • 한국의류학회지
    • /
    • 제25권8호
    • /
    • pp.1386-1397
    • /
    • 2001
  • The purpose of this study was to classify and analyze the somatotype of early middle-aged women and to provide its total data for clothing construction, and to improve clothing culture. The subjects were 277 early middle-aged women between 35 and 44 years old. Data were collected through anthropometry and photometry and analyzed by factor analysis, cluster analysis and discriminant analysis. The results were as follows; 1. The result of factor analysis indicated that 10 factors were extracted through factor analysis and those factors comprised 86.13 percent of total variance. 2. Using factor scores, cluster analysis was carried out and the subject were classified into 4 cluster. Type 1 is tall, slim, and X type in front. Type 2 is standard height and weight, short upper body, and hip-protruded on the side. Type 3 is standard height, thin, H type in front, back and hip are clearly protruded, and lean-back type on the side. Type 4 is standard height, fat, and long upper body. 3. According to the stepwise discriminant analysis, the 8 important iems is classifying the somatotype of early middle-aged women are as follows : bust girth, back length hip breadth-waist breadth, back protruded point depth(back)-back waist depth(back), hip tangent tilt, hip depth(back) waist dapth(back), bust depth-waist depth, and cervical hight, The correct classification rate for these items is as exact as 83.20%.

  • PDF

로지스틱회귀모형의 로버스트 추정을 위한 알고리즘 (Algorithm for the Robust Estimation in Logistic Regression)

  • 김부용;강명욱;최미애
    • 응용통계연구
    • /
    • 제20권3호
    • /
    • pp.551-559
    • /
    • 2007
  • 로지스틱회귀에서 일반적으로 사용되는 최대우도추정법은 이상점에 대해 로버스트 하지 않다. 따라서 본 논문에서는 로지스틱회귀모형의 로버스트 추정을 위한 알고리즘을 제안하고자 한다. 이 알고리즘은 V-마스크 형태의 경계기준에 의해 나쁜 지렛점과 수직이상점을 식별하고, 식별 결과를 바탕으로 이상점의 영향력을 감소시키기 위한 효과적인 방안을 모색한다. 이상점의 영향력 감소는 가중치와 조정치를 적절히 선정함으로 가능하며, 그 결과 붕괴점이 높은 추정치를 얻게 된다. 제안된 알고리즘을 다양한 자료에 적용하여 정분류율을 측정하여 비교하였는데, 새로운 알고리즘이 최대우도추정보다 정확한 분류를 해 주는 것으로 평가되었다.

소아중환자를 대상으로 한 PIM Ⅱ의 타당도 평가 (Evaluating the Validity of the Pediatric Index of Mortality Ⅱ in the Intensive Care Units)

  • 김정순;부선주
    • 대한간호학회지
    • /
    • 제35권1호
    • /
    • pp.47-55
    • /
    • 2005
  • Purpose: This study was to evaluate the validity of the Pediatric Index of Mortality Ⅱ(PIM Ⅱ). Method: The first values on PIM Ⅱ variables following ICU admission were collected from the patient's charts of 548 admissions retrospectively in three ICUs(medical, surgical, and neurosurgical) at P University Hospital and a cardiac ICU at D University Hospital in Busan from January 1, 2002 to December 31, 2003. Data was analyzed with the SPSSWIN 10.0 program for the descriptive statistics, correlation coefficient, standardized mortality ratio(SMR), validity index(sensitivity, specificity, positive predictive value, negative predictive value), and AUC of ROC curve. Result: The mortality rate was 10.9% (60 cases) and the predicted death rate was 9.5%. The correlation coefficient(r) between observed and expected death rates was .929(p<.01) and SMR was 1.15. Se, Sp, pPv, nPv, and the correct classification rate were .80, .96, .70, .98, and 94.0% respectively. In addition, areas under the curve (AUC) of the receiver operating characteristic(ROC) was 0.954 (95% CI=0.919~0.989). According to demographic characteristics, mortality was underestimated in the medical group and overestimated in the surgical group. In addition, the AUCs of ROC curve were generally high in all subgroups. Conclusion: The PIM Ⅱ showed a good, so it can be utilized for the subject hospital. better.

연결요소 분석에 기반한 인쇄체 한글 주소와 필기체 한글 주소의 구분 (Classification of Handwritten and Machine-printed Korean Address Image based on Connected Component Analysis)

  • 장승익;정선화;임길택;남윤석
    • 한국정보과학회논문지:소프트웨어및응용
    • /
    • 제30권10호
    • /
    • pp.904-911
    • /
    • 2003
  • 본 논문에서는 우편봉투 상에 기입된 인쇄체 한글 주소와 필기체 한글 주소를 효과적으로 구분할 수 있는 방법을 제안한다. 문자인식 모듈을 포함하는 각종 응용 시스템에서 입력 영상이 인쇄체인지 필기체인지 구분하는 것은 매우 중요하다. 이는 대부분의 경우 인쇄체 영상과 필기체 영상이 갖는 특징이 상이하여, 각 영상에서의 문자 및 문자열 분리 방법, 문자 인식 방법 둥이 매우 상이하게 개발되기 때문이다. 본 논문에서 제안한 구분 방법은 연결요소 추출 및 병합, 특징 추출, 영상 구분 순으로 수행된다. 연결요소 추출 및 병합 단계에서는 입력영상으로부터 연결요소를 추출한 후 일부 연결요소들에 대하여 병합을 시도하며, 특징 추출 단계에서는 병합결과 얻어진 연결요소들의 그룹들로부터 폭과 위치에 관련된 특징을 추출하고, 영상 구분 단계에서는 추출한 특징을 입력으로 제공받는 다충퍼셉트론을 사용하여 구분을 시도한다. 제안한 방법의 우수성을 증명하기 위해 실제 우편물로부터 추출된 3,147개의 한글 주소 영상을 사용하여 실험한 결과, 98.85%의 구분률을 보여주었다.

일반화된 판별분석 기법을 이용한 능동소나 표적 식별 (Sonar Target Classification using Generalized Discriminant Analysis)

  • 김동욱;김태환;석종원;배건성
    • 한국정보통신학회논문지
    • /
    • 제22권1호
    • /
    • pp.125-130
    • /
    • 2018
  • 선형판별분석(LDA) 기법은 특징벡터의 차원을 줄이거나 클래스 식별에 이용되는 통계적 분석 방법이다. 그러나 선형 분리가 불가능한 데이터 집합의 경우에는 비선형 함수를 이용하여 특징벡터를 고차원의 공간으로 사상(mapping) 시켜줌으로써 선형 분리가 가능하도록 만들 수 있는데, 이러한 기법을 일반화된 판별분석(GDA) 또는 커널판별분석(KDA) 기법이라고 한다. 본 연구에서는 인터넷에 공개되어 있는 능동소나 표적신호에 LDA 및 GDA 기법을 이용하여 표적식별 실험을 수행하고, 그 결과를 비교/분석하였다. 실험 결과 104개의 테스트 데이터에 대해 LDA 기법으로는 73.08% 인식률을 얻었으나 GDA 기법으로는 95.19%로 기존의 MLP 또는 커널 기반 SVM에 비해 나은 성능을 보였다.

TPM, PAC 활동에서 생산성지표와 재무회계 지표의 연계방안 전략 (The Linkage Strategies Between Productivity Metrics and Financial Accounting Metrics in TPM and PAC Activities)

  • 최성운
    • 대한안전경영과학회지
    • /
    • 제15권3호
    • /
    • pp.151-161
    • /
    • 2013
  • This paper proposes a strategic model of linkage between productivity metrics and financial accounting metrics to properly evaluate the financial effect of TPM activities and the business performance. This linkage strategy provides a connection tool for clear communication between factory-level and headquarters that the metrics proposed by this paper ultimately improves a quality of support from the management by receiving the factors required for productivity activities in the practical field. This factor includes such as equipment, raw materials and labors. Here, we propose that chain reaction models using break down structure of productivity metrics and financial metrics enhance the knowledge sharing of KPI (Key Performance Indicator) which generally tend to create oversimplified communication between management in headquarters and employees in the practical fields. The productivity metrics include OEE(Overall Equipment Effectiveness) of TPM (Total Productive Maintenance), OLE (Overall Labor Effectiveness) of PAC(Performance and Analysis and Control) activities, and OYE (Overall Yield Effectiveness) of TMM(Total Material Management) activities. The financial accounting metrics include ROE(Return on Equity), ROA(Return on Asset), and AVR(Added-Value Rate). The suggested chain reaction model selects the financial metrics as initial stage and branch down until final stage of productivity metrics. When demand exceeds supply, an ideal speed rate, the lean OEE strategy can be initially applied to reduce the gap between the demand and supply, then apply variable costing to estimate correct amount of operating profit. In addition, the paper presents a new type of model for linkage between financial accounting metrics including CAPEX(Capital Expenditure), OPEX(Operating Expenditure), EVA(Economic Added Value), DCL(Degree of Combined Leverage), and TPM productivity activities including AM(Autonomous Maintenance), PM(Preventive Maintenance), MP(Maintenance Prevention) and QM(Quality Maintenance). In order to support the evidence of proposed linkage strategy, a case analysis on 52 projects from national TPM contest from 2011 to 2012 is analyzed. The case presents the classification of CAPEX and OPEX activities from TPM, and proposes the correct implementation of financial effect for TPM projects.