• Title/Summary/Keyword: Gender Prediction

Search Result 126, Processing Time 0.019 seconds

A customer credit Prediction Researched to Improve Credit Stability based on Artificial Intelligence

  • MUN, Ji-Hui;JUNG, Sang Woo
    • Korean Journal of Artificial Intelligence
    • /
    • v.9 no.1
    • /
    • pp.21-27
    • /
    • 2021
  • In this Paper, Since the 1990s, Korea's credit card industry has steadily developed. As a result, various problems have arisen, such as careless customer information management and loans to low-credit customers. This, in turn, had a high delinquency rate across the card industry and a negative impact on the economy. Therefore, in this paper, based on Azure, we analyze and predict the delinquency and delinquency periods of credit loans according to gender, own car, property, number of children, education level, marital status, and employment status through linear regression analysis and enhanced decision tree algorithm. These predictions can consequently reduce the likelihood of reckless credit lending and issuance of credit cards, reducing the number of bad creditors and reducing the risk of banks. In addition, after classifying and dividing the customer base based on the predicted result, it can be used as a basis for reducing the risk of credit loans by developing a credit product suitable for each customer. The predicted result through Azure showed that when predicting with Linear Regression and Boosted Decision Tree algorithm, the Boosted Decision Tree algorithm made more accurate prediction. In addition, we intend to increase the accuracy of the analysis by assigning a number to each data in the future and predicting again.

Accuracy of predictive equations for resting energy expenditure (REE) in non-obese and obese Korean children and adolescents

  • Kim, Myung-Hee;Kim, Jae-Hee;Kim, Eun-Kyung
    • Nutrition Research and Practice
    • /
    • v.6 no.1
    • /
    • pp.51-60
    • /
    • 2012
  • Weight-controlling can be supported by a proper prescription of energy intake. The individual energy requirement is usually determined through resting energy expenditure (REE) and physical activity. Because REE contributes to 60-70% of daily energy expenditure, the assessment of REE is very important. REE is often predicted using various equations, which are usually based on the body weight, height, age, gender, and so on. The aim of this study is to validate the published predictive equations for resting energy expenditure in 76 normal weight and 52 obese Korean children and adolescents in the 7-18 years old age group. The open-circuit indirect calorimetry using a ventilated hood system was used to measure REE. Sixteen REE predictive equations were included, which were based on weight and/or height of children and adolescents, or which were commonly used in clinical settings despite its use based on adults. The accuracy of the equations was evaluated on bias, RMSPE, and percentage of accurate prediction. The means of age and height were not significantly different among the groups. Weight and BMI were significantly higher in obese group (64.0 kg, $25.9kg/m^2$) than in the non-obese group (44.8 kg, $19.0kg/m^2$). For the obese group, the Molnar, Mifflin, Liu, and Harris-Benedict equations provided the accurate predictions of > 70% (87%, 79% 77%, and 73%, respectively). On the other hand, for non-obese group, only the Molnar equation had a high level of accuracy (bias of 0.6%, RMSPE of 90.4 kcal/d, and accurate prediction of 72%). The accurate prediction of the Schofield (W/WH), WHO (W/WH), and Henry (W/WH) equations was less than 60% for all groups. Our results showed that the Molnar equation appears to be the most accurate and precise for both the non-obese and the obese groups. This equation might be useful for clinical professionals when calculating energy needs in Korean children and adolescents.

Corelationship Study between Hwa-Byung and Coronary Heart Disease, by using Framingham Coronary Risk Score (Framingham Coronary Risk Score를 이용한 화병과 심혈관계 질환과의 관련성 연구)

  • Jeong, Ha-Ryong;Koh, Sang-Baek;Park, Jong-Ku;Yu, Jun-Sang;Lee, Jae-Hyok
    • Journal of Oriental Neuropsychiatry
    • /
    • v.22 no.3
    • /
    • pp.13-22
    • /
    • 2011
  • Objectives : This study was to research the relationship between Hwa-Byung and Framingham coronary risk score(FRS), cardiovascular disease. Methods : 649 people participated in the community based cohort study in Wonju City of South Korea from July 2nd to August 30th in 2006. Educated investigators checked up systolic & diastolic blood pressure and surveyed Hwa-Byung Diagnostic Interview Schedule(HBDIS), cohort questionnaire about gender, age, smoking, diabetes. Blood sample was collected from participants to analyze total cholesterol, HDL-cholesterol. FRS was calculated from collected data. 10-year prediction of coronary heart disease was determined from FRS by using score sheet that is estimated by Wilson et al. Collected data were analyzed by the chi-square test. Results : 1. Low risk number of people was 18(52.9%) in Hwa-Byung group, 263(42.8%) in non Hwa-Byung group. p-value was 0.472. Difference of the two group was invalid. 2. The number of people below or equal to average 10-year prediction of coronary heart disease as gnder & age, Hwa-Byung group was 19(55.9%), non Hwa-Byung group was 412(67.0%). p-value was 0.251. Difference of the two group was invalid. Conclusions : There was no correlationship Between Hwa-Byung and 10-year prediction of coronary heart disease.

A Comparative Study of Predictive Factors for Passing the National Physical Therapy Examination using Logistic Regression Analysis and Decision Tree Analysis

  • Kim, So Hyun;Cho, Sung Hyoun
    • Physical Therapy Rehabilitation Science
    • /
    • v.11 no.3
    • /
    • pp.285-295
    • /
    • 2022
  • Objective: The purpose of this study is to use logistic regression and decision tree analysis to identify the factors that affect the success or failurein the national physical therapy examination; and to build and compare predictive models. Design: Secondary data analysis study Methods: We analyzed 76,727 subjects from the physical therapy national examination data provided by the Korea Health Personnel Licensing Examination Institute. The target variable was pass or fail, and the input variables were gender, age, graduation status, and examination area. Frequency analysis, chi-square test, binary logistic regression, and decision tree analysis were performed on the data. Results: In the logistic regression analysis, subjects in their 20s (Odds ratio, OR=1, reference), expected to graduate (OR=13.616, p<0.001) and from the examination area of Jeju-do (OR=3.135, p<0.001), had a high probability of passing. In the decision tree, the predictive factors for passing result had the greatest influence in the order of graduation status (x2=12366.843, p<0.001) and examination area (x2=312.446, p<0.001). Logistic regression analysis showed a specificity of 39.6% and sensitivity of 95.5%; while decision tree analysis showed a specificity of 45.8% and sensitivity of 94.7%. In classification accuracy, logistic regression and decision tree analysis showed 87.6% and 88.0% prediction, respectively. Conclusions: Both logistic regression and decision tree analysis were adequate to explain the predictive model. Additionally, whether actual test takers passed the national physical therapy examination could be determined, by applying the constructed prediction model and prediction rate.

Machine Learning-Based Prediction Technology for Medical Treatment Period of Automobile Insurance Accident Patients (머신러닝 기반의 자동차보험 사고 환자의 진료 기간 예측 기술)

  • Kyung-Keun Byun;Doeg-Gyu Lee;Hyung-Dong Lee
    • Convergence Security Journal
    • /
    • v.23 no.1
    • /
    • pp.89-95
    • /
    • 2023
  • In order to help reduce the medical expenses of patients with auto insurance accidents, this study predicted the treatment period, which is the most important factor in the medical expenses of patients in their 40s and 50s, and analyzed the factors affecting the treatment period. To this end, a mechine learning model using five algorithms such as Decision Tree was created, and its performance was compared and analyzed between models. There were three algorithms that showed good performance including Decison Tree, Gradient Boost, and XGBoost. In addition, as a result of analyzing the factors affecting the prediction of the treatment period, the type of hospital, the treatment area, age, and gender were found. Through these studies, easy research methods such as the use of AutoML were presented, and we hope that the results of this study will help policies to reduce medical expenses for automobile insurance accidents.

Prediction of speaking fundamental frequency using the voice and speech range profiles in normal adults (정상 성인에서 음성 및 말소리 범위 프로파일을 이용한 발화 기본주파수 예측)

  • Lee, Seung Jin;Kim, Jaeock
    • Phonetics and Speech Sciences
    • /
    • v.11 no.3
    • /
    • pp.49-55
    • /
    • 2019
  • This study sought to investigate whether mean speaking fundamental frequency (SFF) can be predicted by parameters of voice and speech range profile (VRP and SRP) in Korean normal adults. Moreover, it explored whether gender differences exist in the absolute differences between the SFF and estimated SFF (ESFF) predicted by the VRP and SRP. A total of 85 native Korean speakers with normal voice participated in the study. Each participant was asked to perform the VRP task using the vowel /a/ and the SRP task using the first sentence of a Korean standard passage "Ga-eul". In addition, the SFF was measured with electroglottography during a passage reading task. Predictive factors of the SFF were explored and the absolute difference between the SFF and the ESFF (DSFF) was compared between gender groups. Results indicated that predictive factors were age, gender, minimum pitch and pitch range for the VRP (adjusted $R^2=.931$), and pitch range (in semi-tones) and maximum pitch for the SRP (adjusted $R^2=.963$), respectively. The SFF and ESFF predicted by the VRP and SRP showed a strong positive correlation. The DSFF of the VRP and SRP, as well as their sum did not differ by gender. In conclusion, the SFF during a passage reading task could be successfully predicted by the parameters of the VRP and SRP tasks. In further studies, clinical implications need to be explored in patients who may exhibit deviations in SFF.

Speech Emotion Recognition Based on GMM Using FFT and MFB Spectral Entropy (FFT와 MFB Spectral Entropy를 이용한 GMM 기반의 감정인식)

  • Lee, Woo-Seok;Roh, Yong-Wan;Hong, Hwang-Seok
    • Proceedings of the KIEE Conference
    • /
    • 2008.04a
    • /
    • pp.99-100
    • /
    • 2008
  • This paper proposes a Gaussian Mixture Model (GMM) - based speech emotion recognition methods using four feature parameters; 1) Fast Fourier Transform(FFT) spectral entropy, 2) delta FFT spectral entropy, 3) Mel-frequency Filter Bank (MFB) spectral entropy, and 4) delta MFB spectral entropy. In addition, we use four emotions in a speech database including anger, sadness, happiness, and neutrality. We perform speech emotion recognition experiments using each pre-defined emotion and gender. The experimental results show that the proposed emotion recognition using FFT spectral-based entropy and MFB spectral-based entropy performs better than existing emotion recognition based on GMM using energy, Zero Crossing Rate (ZCR), Linear Prediction Coefficient (LPC), and pitch parameters. In experimental Results, we attained a maximum recognition rate of 75.1% when we used MFB spectral entropy and delta MFB spectral entropy.

  • PDF

Age and Gender Prediction from Korean Tweets with Stylometric Analysis (문체 분석을 활용한 한국어 트위터 사용자의 연령대 및 성별 예측)

  • Kim, Sang-Chae;Park, Jong-C.
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2012.06b
    • /
    • pp.303-305
    • /
    • 2012
  • 사람들은 주변의 영향을 받아 가면서 각자의 독특한 글쓰기 양식을 만들어간다. 따라서 같은 연령대와 성별을 가지는 사람들은 유사한 글쓰기 양식을 나타내는 경향이 있다. 이와 같은 가정을 바탕으로, 본 연구에서는 다양한 연령대와 성별의 사람들이 작성한 트윗의 문체를 분석하여 임의의 트윗을 작성한 저자의 연령대와 성별을 예측하는 실험을 진행하였다. 한국어 웹 언어에서 자주 보이는 표현들을 토대로 구성한 자질들과, 그에 비해 데이터와 관계가 적은 n-gram 단위의 자질들을 함께 사용하여 예측을 진행함으로써, 최대 공산 기준치보다 25%가량 높은 정확도를 보이는 예측 결과를 얻게 되었다. 이와 함께 각 자질 구성이 예측에 얼마나 효율적으로 기여하는지에 대한 이해도를 높일 수 있었다.

Relationships between Children's Social Development and Day Care Quality, Child-care Experience and Family Characteristics (탁아기관의 질, 탁아경험 및 가족특성과 아동의 사회성발달과의 관계)

  • Yang, Yeon Suk;Cho, Bok Hee
    • Korean Journal of Child Studies
    • /
    • v.17 no.2
    • /
    • pp.181-193
    • /
    • 1996
  • The purpose of this study was: (1) to examine relationships between social development and day care quality, child-care experience and family characteristics, and (2) to investigate the explainability of those related variables for social development. Subjects for this study were 252 4-year-old children and their mothers from 32 day care centers in Seoul. Harms & Clifford's Early Childhood Environment Rating Scale was used to measure the quality of day care. The main results were as follows: (1) Day care quality, child-care experience and family characteristics were significantly related to social development. (2) Child's gender, months of age, mother's child rearing attitude, the length of child-care experience, overall quality of day care, and group size significantly predicted social development. 33% of the variance of social development was explained by these variables. The relative influence of these variables to the prediction of social development was about the same.

  • PDF

A Study on Change in Perception of Community Service and Demand Prediction based on Big Data

  • Chun-Ok, Jang
    • International Journal of Advanced Culture Technology
    • /
    • v.10 no.4
    • /
    • pp.230-237
    • /
    • 2022
  • The Community Social Service Investment project started as a state subsidy project in 2007 and has grown very rapidly in quantitative terms in a short period of time. It is a bottom-up project that discovers the welfare needs of people and plans and provides services suitable for them. The purpose of this study is to analyze using big data to determine the social response to local community service investment projects. For this, data was collected and analyzed by crawling with a specific keyword of community service investment project on Google and Naver sites. As for the analysis contents, monthly search volume, related keywords, monthly search volume, search rate by age, and gender search rate were conducted. As a result, 10 items were found as related keywords in Google, and 3 items were found in Naver. The overall results of Google and Naver sites were slightly different, but they increased and decreased at almost the same time. Therefore, it can be seen that the community service investment project continues to attract users' interest.