Developing the Predictive Model for the Group at High Risk for Colon Cancer

대장암 발생 고위험군의 예측모형 개발과 활용

  • Lee, Ae-Kyoung (Health Insurance Research Center, National Health Insurance Corporation) ;
  • Park, Il-Soo (Health Insurance Research Center, National Health Insurance Corporation) ;
  • Kim, Su-Young (Department of Health Policy and Management, College of Medicine, Cheju National University) ;
  • Yoon, Tae-Ho (Department of Occupational & Preventive Medicine, College of Medicine, Busan National University) ;
  • Jeong, Baek-Geun (Department of Preventive Medicine, College of Medicine, Gyeongsang National University) ;
  • Lee, Sang-Yi (Health Insurance Research Center, National Health Insurance Corporation)
  • 이애경 (국민건강보험공단 건강보험연구센터) ;
  • 박일수 (국민건강보험공단 건강보험연구센터) ;
  • 김수영 (제주대학교 의과대학 예방의학교실) ;
  • 윤태호 (부산대학교 의과대학 예방의학 및 산업의학교실) ;
  • 정백근 (경상대학교 의과대학 예방의학교실) ;
  • 이상이 (국민건강보험공단 건강보험연구센터)
  • Published : 2006.09.30

Abstract

Objectives: We developed the predictive model for the incidence of colon cancer by utilizing the health screening data of the National Health Insurance in Korea. We also explored the characteristics of the high risk group for colon cancer. Methods: The predictive model was used to determine those people who have a high risk for colon cancer within 2 years of their NHI health screening, and we excluded the people who had already been treated for cancer or who were cancer patient. The study population is the insured of the NHI, aged 40 or over and they had undergone health screening from the year 2000 to 2004, according to NHI health screening formula. We performed logistic regression analysis and used SAS Enterprise Miner 4.1. Results: This study shows that there exists a higher rate of colon cancer in males than females. Also, for the population in their 60s, the incidence rate of colon cancer is much higher by 5.36 times than that for those people in their 40s. Amongst the behavioral factors, heavy drinking is the most important determinant of the colon cancer incidence (7.39 times in males and 21.51 times in females). Conclusions: Our study confirms that the major influencing factors for the incidence of colon cancer are drinking, lack of exercise, a medical history of colon polypus and a family history of colon cancer. As a result, we can choose the group that is at a high risk for colon cancer and provide customized medical information and selective management services according to their characteristics.

Keywords

References

  1. Yi JJ, Yoo WK, Kim SY, Kim KK, Yi, SW. Medical expenses by site of cancer and survival time among cancer patients in the last one tear of life. J Prev Med Public Health 2005; 38(1): 915(Korean)
  2. 통계청. 사망원인통계분석;각 연도
  3. Shin HR, Ahn YO, Bae JM, Shin MH, Lee DH, Lee CW, Ohrr H, Ahn DH, Ferlay J, Parkin, DM, Oh DK, Park JG. Cancer incidence in Korea. Cancer Res Treat 2003; 34(6): 405-408
  4. Baldwin FD. Well-managed care. Predictive modeling helps keep high-risk patients from becoming high-cost patients. Healthc Inform 2004; 21(8): 27-28
  5. http://www.cpmk.com
  6. Dogu Celebi. The power of predictive modeling. Healthc Inform 2003; 8: 56-58
  7. Colditz GA, Atwood KA, Emmons K, Monson RR, Willett WC, Trichopoulos D, Hunter DJ. Harvard report on cancer prevention volume 4: Harvard cancer risk index. Risk index working group, Harvard center for cancer prevention. Cancer Causes Control 2000; 11(6): 477-88 https://doi.org/10.1023/A:1008984432272
  8. Yoo KY, Shin HR. Cancer epidemiology and prevention. Korean J Epidemiol 2003; 25(1): 115(Korean)
  9. http://www.ncc.re.kr
  10. 김정순. 역학원론. 신광출판사;2004
  11. Lee DC, Soe I, Lee HR, Kim DK. Factors affecting on the compliance of he health screening program : A study for the insured in a district medical insurance association. J Korean Acad Fam Med 1997; 18(7): 739-751(Korean)
  12. 이원철 외 8인. 건강검진 . 암검사 수검행태 연구. 국민건강보험공단, 가톨릭대학교 의과대학;2004
  13. 강현철, 한상태, 최종후, 김은석, 김미경. SAS Enterprise Miner를 이용한 데이터마이닝-방법론 및 활용. 자유아카데미;2001
  14. 강현철. SAS Enterprise Miner를 이용한 데이터마이닝. 자유아카데미;1999
  15. Giudici, P. Applied Data Mining - Statistical Methods for Business and Industry. Wiley, 2003
  16. Kang SH, Choi SH. The efficient ways of health promotion program on public health center using data mining. J Korean Soc Med Inform 2001; 7(2): 37-48 (Korean)
  17. Richard L. David S. Coronary artery bypass risk prediction using neural networks. Ann Thorac Surg 1997; 63(6): 1635-1643 https://doi.org/10.1016/S0003-4975(97)00225-7