Predicting Early Retirees Using Personality Data

인성 데이터를 활용한 조기 퇴사자 예측

  • Kim, Young Park (Department of Big Data Application And Security, Korea University) ;
  • Kim, Hyoung Joong (Department of Big Data Application And Security, Korea University)
  • 김영박 (고려대학교 정보보호대학원 빅데이터 응용 및 보안학과) ;
  • 김형중 (고려대학교 정보보호대학원 빅데이터 응용 및 보안학과)
  • Received : 2017.10.27
  • Accepted : 2018.01.29
  • Published : 2018.01.31


This study analyzed the early retired employees who stayed in company no longer than 3 years based on a certain company's personality evaluation result data. The predicted model was analyzed by dividing into two categories; the manufacture group and the R&D group. Independent variables were selected according to the stepwise method. A logistic regression model was selected as a prediction model among various supervised learning methods, and trained through cross-validation to prevent over-fitting or under-fitting. The accuracy of the two groups were confirmed by the confusion matrix. The most influential factor for early retirement in the manufacture group was revealed as "immersion," and for the R&D group appeared as "antisocial." In the past, people concentrated on collecting data by questionnaire and identifying factors that are highly related to the retirement, but this study suggests a sustainable early retirement prediction model in the future by analyzing the tangible outcome of the recruitment process.

본 연구는 기업에서 채용 전형 시 진행되는 인성시험 결과 데이터를 기반으로, 입사 3년 미만의 조기 퇴사자를 분석하였다. 예측 모형은 적합성 및 향후 활용성을 고려하여 제조(manufacture)직군과 R&D직군 2개 그룹으로 구분하여 분석하였으며, 독립변수 선택은 전진(stepwise)선택법에 따라 직군별로 유의미한 독립변수를 선택하였다. 예측 모형은 지도학습(supervised learning) 방법 중 로지스틱 회귀분석 알고리즘을 선택하였으며, 과잉적합(overfitting) 또는 과소적합(underfitting)을 방지하고자 교차 검증(cross validation)을 통해 예측 모형을 훈련시켰다. 혼동행렬(confusion matrix)을 통해 2개 그룹의 정확도(accuracy)를 확인하였으며, 조기 퇴직에 가장 영향을 많이 미치는 요인으로 제조직군에서는 '몰입', R&D직군에서는 '반사회성' 항목으로 확인되었다. 기존 퇴직 관련 연구는 설문 방식으로 데이터를 수집하고, 퇴직과 관련성이 높은 요인을 확인하는데 집중하였다면, 본 연구는 채용 전형 시 진행되는 인성 결과 분석을 통해 향후에도 지속 가능한 조기 퇴직 예측 모형을 제시했다는 면에서 의의를 갖는다.



  1. Yonhapnews, "Semiconductor exports exceed $ 90 billion this year" [Internet]. Available: 37500003.HTML
  2. New Daily Economic, "4th industrial revolution semiconductor new golden age" [Internet]. Available:
  3. Korea Employers Federation, "Survey on the recruitment and retraining status of new college graduates" [Internet]. Available: jsp?num=460
  4. H. S. Jeon and E. J. Wang, "A study on an exit interview process, influencing the withdrawal of a turnover decision: Semiconductor manufacturing plant case," Korean Journal of Industrial and Organizational Psychology, vol. 27, no. 4, pp. 805-830, 2014.
  5. MBC News, "The selection of new employee in half season", [Internet]. Available:
  6. D. R. Cox, "The regression analysis of binary sequences," Journal of the Royal Statistical Society, vol. 20, no. 2, pp. 215-242, 1958.
  7. J. R. Baek, "Development of accident prediction model for military aircraft by using logistic regression," Master Dissertation, Yonsei University, Korea, 2012.
  8. S. M. Lee , G. C. Yu, and W. S. Park, "Analysis of articles on HRM in the Korean Journal of Human Resource Management from 1980 to 2008," Korean Academy of Organization and Management, vol. 34, no. 1, pp. 177-218, 2010.
  9. Y. M. Lee and K. J.Youn, "Analysis of influential factors that impact the turnover intention and turnover behavior of newcomers in information technology industries," Korean Society for Learning and Performance, vol. 11, no. 1, pp. 59-77, 2009.
  10. S. S. Chung and K. H. Lee, "A study on job satisfaction and turnover behavior with 2-stage logistic regression: In case of graduates occupational mobility survey," Communications for Statistical Applications and Methods, vol. 15, no. 6, pp. 859-873, 2008.
  11. H. J. Jung, "The effects of big 5 on the emotional labor and turnover intention: focused on the flight attendant," Journal of the Korean Data Analysis Society, vol. 17, no. 3, pp. 1501-1511, 2008.
  12. W. C. Seo, "A study on the internal reputation factors affecting the job satisfaction: Focusing on big data analysis in the social media for corporation reputation," Journal of Digital Contents Society, vol. 17, no. 4, pp. 295-305, 2016.
  13. H. M. Park, C. S. Oh, and C. S. Yum, "An Empirical Study on the Factors Influencing Student Satisfaction of e-Learning," Journal of Korean Institute of Information Technology, vol. 9, no. 7, pp. 143-152, 2011.