1. Introduction
The biggest concern for lenders in the loan market is whether a borrower will be able to repay his/her loan properly. This is particularly of interest, in the P2P loan market, where ordinary individuals with insufficient credit evaluation experience serve as lenders. (Kim, 2020). P2P loans are rapidly growing worldwide, drawing attention to the possibility of alternative financing for traditional finance. However, the recent rise in loan defaults in major markets such as China and South Korea has been stoking the possibilities of sustainable growth. Finding factors affecting loan repayment results is an important concern as the P2P loan has a less systematic credit evaluation system and lacks experience compared to traditional finance. Thus, despite its relatively recent emergence, borrower’s characteristic elements, and their relationship to repayment performance in the P2P loan market, is the subject of active academic research. Some of these studies focus on the specific characteristic elements of the borrower, including race (Pope & Sydnor, 2011), gender (D. Chen, Li, & Lai, 2017; X. Chen, Huang, & Ye, 2020), perceived trustworthiness (Duarte, Siegel, & Young, 2012), social networks (Everett, 2015; Freedman & Jin, 2017; Lin, Prabhala, & Viswanathan, 2013), university reputation (J. Li & Hu, 2019), and communication with lenders (Xu & Chau, 2018). Pope and Sydnor (2011) reported that black people are less successful than white people in loan repayment, using data from the U.S. Prosper platform. D. Chen et al. (2017) and X. Chen et al. (2020) used data from China’s PPDAI and Renrendai platforms to argue that the default rates of female borrowers are significantly lower than those of male borrowers. Duarte et al. (2012) revealed that among Prosper’s borrowers, those who seem to have high trustworthiness in their appearance actually have a low rate of insolvency. Everett (2015) argued that borrowers acting as members of a group show superior repayment performance than borrowers acting as individuals within the platform, and Freedman and Jin (2017) and Lin et al. (2013) claimed that people with good social relationships are better at paying back their loans. In addition, J. Li and Hu (2019) investigated college students who borrowed money from Renrendai in China and revealed that a borrower attending a university with a good reputation is likely to pay back better. Xu and Chau (2018) argued that the amount of communication between a borrower and lenders within a platform has nothing to do with the borrower’s repayment performance. Furthermore, several studies have presented models to predict repayment performance using the relationship between a borrower’s characteristic elements and his/her repayment performance (Z. Li, Li, Yao, & Wen, 2019; Tao, Dong, & Lin, 2017; Wang, Zhang, Zhao, & Wang, 2019). The main findings of the existing studies are listed in Table 1.
Table 1: Summary of prior related research
This study investigates the effects and importance of borrowers’ past repayment performance records within the platform. More than 10 years have passed since the advent of P2P loans worldwide, and an increasing number of borrowers are accumulating a track record within the platform. Therefore, it is necessary to study whether these borrowers’ past practices on the platform can be an important predictor of their creditworthiness. In this respect, this study contributes to the literature. Furthermore, most of the previous studies used a dichotomous variable with levels of success or failure as a dependent variable. The others adopted more granular levels of the dependent variable, but this also was a categorical variable. However, prior studies have limitations in that even loans classified as failure will inevitably differ in their performance depending on the degree of repayment. To overcome these limitations, this study analyzes not only the above binary variables but also the continuous variable: the repayment rate. It also attempts to increase the generalization of results by analyzing them in various ways, such as linear regression, logit, and survival analysis.
2. Data and Methodology
For this study, loan repayment data from Korea’s P2P lending platform Moneyauction and Popfunding were extracted through a web scaping method as of December 2017 and January 2020, respectively. The loan repayment data includes information about the actual amount of repayment along with the final loan repayment result and the actual payment period of the entire maturity month. In the case of Moneyauction, 4,390 records were analyzed, except for incomplete or missing data, of which 27 were repaid as they did not mature as of the collection date. In Popfunding, 60 out of the 2011 records with completeness were still in progress for repayment as of the collection date. Data showing a borrower’s past track record in the platform include the number of successful or failed repayments in Moneyauction and the indicator of quality of repayment with four levels (i.e., A through D) and the number of normal monthly repayments in Popfunding. For each platform, the above two components were used as explanatory variables to represent the borrower’s past track record, and the remaining information such as the loan amount, interest rate and maturity, gender, and age were used as control variables. The detailed description and brief statistics of the variables are provided in Tables 2 and 3, respectively.
Table 2: Variables
Table 3: Descriptive statistics
The analytical methods included 1) a Cox proportional hazard survival analysis with a dependent variable of the repayment status (i.e., repayment success or failure) over monthly time, 2) a multiple linear regression analysis with a dependent variable of the repayment rate, a continuous variable representing the actual repayment amount relative to the final expected repayment amount; and 3) a logit analysis with a dependent variable of the final dichotomous repayment result (i.e., repayment success or failure). The Cox proportional hazard survival analysis was based on 4,390 and 2011 cases for each platform. The loans that were successfully repaid or in progress were treated as censored cases and the other loans of which the repayment failed were treated as uncensored. The multiple linear regression and logit analyses covered 4,363 and 1951 records for each platform, excluding records in progress of repayment. The following simple analytical models were employed in this study:
W(t|x) = W0(t)exp(xβ), (1)
where W(t|x), as defined in the Cox’s proportional hazard model (Cox, 1972), measures the loan default function at time t given that the loan survives until time t. W0(t) denotes a baseline default function, and x is the vector of predictors. In this study, the exponent of the coefficient β is reported, which is the ratio of the expected default rate for each predictor and the baseline default rate. The default ratio, exp(β), for each predictor xi indicates the multiples of the default probability relative to the baseline case.
\(Y i=\alpha_{0}+\Sigma \beta i X i+\Sigma \gamma i C i+\varepsilon i\) (2)
\(\text { Prob }(Z i)=\alpha_{0}+\Sigma \beta i X i+\Sigma \gamma i C i+\varepsilon i\) (3)
where Yi denotes the ratio of a borrower’s actual amount of repayment to the expected amount on a cumulative basis in the linear regression model, Zi indicates the repayment status of loan i. Zi equals 1 if the borrower has fully repaid the loan and 0 otherwise in the logit model. For each model, Xi represents the explanatory variable and proxies the track records in this study. Ci is a control variable representing loan amount, loan interest rate, loan duration, a borrower’s gender and age, and other factors.
The Cox proportional hazard survival, linear regression, and logit models for the data from Moneyauction are models 1,2, and 3, and each model that analyzes Popfunding data is 4,5, and 6.
Furthermore, in order to investigate the predictive power of the track record variables, the predictive powers of the logit models for which the track record variable is or is not, respectively, were compared through the ROC curves. For each comparison, through random sampling, 80% of the data was used to construct predictive models, and the remaining 20% was tested for prediction. For the Moneyauction data, Model 3 with the track record variables and Model 3A without them were compared, and for the Popfunding data, Model 6 and Model 6A were compared similarly. All analyses were conducted using the statistical package R (version 3.6.3).
3. Results and Discussion
First, on Moneyauction, as shown in model 1, the hazard ratios of the SUCCESS and FAILURE variables are 0.768 and 3.228 at 99% confidence levels, respectively. This means that as the number of prior repayment successes increases, the default probability of the current loan decreases; and, vice versa, the default probability of the current loan increases as the number of prior repayment failures increases. In addition, FAILURE’s hazard ratio is high, indicating that it has a particularly significant impact on loan repayment performance. The FAILURE is shown to have the greatest impact on the time of default among all variables, and SUCCESS also has a major impact following the PURPOSE and INSURANCE variables. Models 2 and 3 also show that the SUCCESS and FAILURE variables have statistically significant relationships with the repayment rate and repayment status, respectively, at a 99% confidence level. In particular, the relative weight analyses reveal that the effects of the SUCCESS and FAILURE variables on the dependent variables are more important than those of other variables, as the sum of the relative importance of the two variables is high at 36.571% in Model 2 and 42.720% in Model 3. In both models, the FAILURE is the most important variable, and the SUCCESS is less important than the DURATION, but is similar or higher to the RATE andGRADE variables depending on the model. Consequently, the results of Models 1, 2, and 3 show that the borrower’s past track record in the platform have greater impacts on loan repayment performance than other factors.
Next, for Popfunding, the hazard ratios for the ORDINARY and QUALITY variables are 0.950 and 0.388, respectively, which are significant at the 99% confidence level. This indicates that the number of monthly ordinary repayments and the quality of past repayments of the borrower adversely affect the time of default for his/her current loan. In particular, the quality level of the borrower’s past repayments would be very influential on the survival status of his/her current loan. Models 5 and 6 show that both variables affect the repayment rate and repayment status in a positive direction at a 99% confidence level. Moreover, the relative weight analyses report that the combined relative importance of the two variables is 91.582% for model 5 and 90.045% for model 6, which has an absolute effect on the dependent variable compared to the other variables. The analysis results are aggregated in Table 4.
Table 4: Analyses of the effect of a borrower’s track records on his/her repayment performance
Note: β and RW represent the regression coefficient and relative weight, respectively.
** and * indicate statistical significance at the 1% and 5% levels, respectively.
Figures 1 and 2 analyze the predictive power of the borrower’s track record for his/her repayment performance in each platform. First, as seen in the ROC curve in Figure 1, for Moneyauction, the AUC (area under the curve) values of Model 3, which contains the SUCCESS and FAILURE variables and Model 3A, which exclude the explanatory variables, are reported as 0.661 and 0.724, respectively. This means that the addition of the explanatory variables improves the predictive power of the model by approximately 10% compared to otherwise. Looking at the ROC curve in Figure 2, more dramatic results are shown in Popfunding. The AUC value (i.e., 0.897) of model 6 with the ORDINARY and QUALITY variables is shown to be significantly higher than that (i.e., 0.567) of model 6A without the explanatory variables, indicating that the borrower’s track repayment performance compared to other factors.
Figure 1: ROC curves for Models 3 and 3A
Figure 2: ROC curves for Models 6 and 6A
The study reveals that the borrower’s past track record in the platform has the most impact on the repayment performance of his/her current loan in the Korean P2P lending market. This is in line with the results that Ding, Huang, and Meng (2019) analyzed using Renrendai platform data from China. The results of this study are rather surprising where it is assumed that the borrower’s credit and loan information such as credit score, income level, loan amount, loan interest rate, and loan maturity are likely to have a significant impact on the borrower’s record is highly predictive of his/her repayment performance.
In P2P loans, the platform operator provides a lot of information to help individual lenders evaluate a borrower’s creditworthiness in making their funding decisions. While some of this information is taken from the borrower and passed on to the lenders, there are also internal borrower credit scores that the platform operator evaluates on its own. As such, the platform operator strives to support the lenders’ judgments, but in fact, lenders have limited access to analytical information about what factors are actually important. Considering that most lenders who participate in P2P loan markets are unskilled and inexperienced individuals (Kim, 2020), there may be too much information that lenders need to refer to for their decision making. Such an excessive provision of information may hamper lenders’ decision-making. Therefore, the platform operator needs to continue to provide lenders with the importance and quality of the borrower’s information.
Table 4: Analyses of the effect of a borrower’s track records on his/her repayment performance
Note: β and RW represent the regression coefficient and relative weight, respectively.
** and * indicate statistical significance at the 1% and 5% levels, respectively.
4. Conclusions
This study investigates the effects and importance of borrowers’ past repayment performance track record within the platform to identify its predictive power. To this end, I analyze the detailed loan repayment data from two leading P2P lending platforms in Korea using a Cox proportional hazard, multiple linear regression, and logit models. Furthermore, the predictive power of the factors proxied by borrowers’ track records are evaluated through the receiver operating characteristic (ROC) curves. As a result, it is found that the borrowers’ past track record within the platform have the most important impact on the repayment performance of their current loans. In addition, this study also reveals that the borrowers’ track record is much more predictive of their repayment performance than any other factor.
The study provides theoretical contributions in that it reveals the importance of the borrower’s past track record in P2P loan markets and, in particular, for the first time in Korea’s multi-platform. Especially, it also contributes to existing studies in that it has not only found the relationship between the borrower’s past track record and the loan repayment performance, but also identifies the relative importance of the past track record and how much it improves the predictive power over the borrower’s repayment performance. Given that existing studies have limitations by using a dichotomous categorical variable indicating whether the repayment is successful or failed as a dependent variable, this study has the distinction of using a continuous variable of repayment rate, as well as the dichotomous category variable, in order to increase the accuracy of the analysis. It is also meaningful in that it first identifies the important role the borrower’s past track record plays in his/her repayment performance and prediction.
From a practical perspective, the findings provide important implications for participants in P2P loan markets. First, the platform operator needs to periodically analyze empirical data and share with the lenders what factors of the borrower’s information significantly impact performance in order to help lenders make decisions. In addition, lenders need to recognize that the factors that will be considered important in assessing a borrower’s creditworthiness in P2P loans are somewhat different from those considered in assessing the borrower’s creditworthiness in traditional finance. Borrowers should understand that they must manage their track record within the platform in order to continue to receive good funding from the lenders.
The study reveals that the borrower’s past track record within the platform has important effects on their repayment performance. However, it is necessary for the borrower to first succeed in funding on the platform to have a past track record. In addition, most of them are first-time borrowers, rather than those who have had multiple experiences borrowing within the platform. Thus, although a borrower’s past performance is important, it does not apply to borrowers with no prior experience. Therefore, the results of this study provide limited implications for assessing the ability of borrowers to repay loans for the first time on the platform. In this regard, further research on the differences in factors in the repayment outcome between the borrower who has no prior experience in P2P loan markets and the borrower who has extensive loan experience is likely to contribute to the existing study. In this study, the number of successful repayments and the number of failed repayments in the past are used as proxies of the borrower’s past track records in each platform. However, even loans classified as successful repayment may have different details of repayment. In addition, loans classified as failure to repay is also likely to differ depending on the degree of repayment. Although this study did not reflect these differences due to the limitations of the data, it would be possible to draw more meaningful implications if these were supplemented in the future.
References
- Chen, D., Li, X., & Lai, F. (2017). Gender discrimination in online peer-to-peer credit lending: Evidence from a lending platform in China. Electronic Commerce Research, 17, 553-583. https://doi.org/10.1007/s10660-016-9247-2
- Chen, X., Huang, B., & Ye, D. (2020). Gender gap in peer-to-peer lending: Evidence from China. Journal of Banking & Finance, 112, 105633. https://doi.org/10.1016/j.jbankfin.2019.105633
- Cox, D. R. (1972). Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological), 34(2), 187-202. https://doi.org/10.1111/j.2517-6161.1972.tb00899.x
- Ding, J., Huang, J., & Meng, M. (2019). Is there an effective reputation mechanism in peer-to-peer lending? Evidence from China. Finance Research Letters, 30, 208-215. https://doi.org/10.1016/j.frl.2018.09.015
- Duarte, J., Siegel, S., & Young, L. (2012). Trust and credit: The role of appearance in peer-to-peer lending. The Review of Financial Studies, 25(8), 2455-2484. https://doi.org/10.1093/rfs/hhs071
- Everett, C. (2015). Group membership, relationship banking and loan default risk: The case of online social lending. Banking and Finance Review, 7(2), 15-54.
- Freedman, S., & Jin, G. Z. (2017). The information value of online social networks: Lessons from peer-to-peer lending. International Journal of Industrial Organization, 51, 185-222. https://doi.org/10.1016/j.ijindorg.2016.09.002
- Han, S.-H., Kang, K., & Shin, y. S. (2016). Bond rating, Corporate governance, and cost of debt: The case of Korea. Journal of Asian Finance, Economics and Business, 5(15), 5-15.
- Kim, D. (2020). The importance of detailed patterns of herding behaviour in a P2P lending market. Applied Economics Letters, 27(2), 127-130. https://doi.org/10.1080/13504851.2019.1610698
- Li, J., & Hu, J. (2019). Does university reputation matter? Evidence from peer-to-peer lending. Finance Research Letters, 31, 66-77. https://doi.org/10.1016/j.frl.2019.04.004
- Li, Z., Li, K., Yao, X., & Wen, Q. (2019). Predicting prepayment and default risks of unsecured consumer loans in online lending. Emerging Markets Finance and Trade, 55(1), 118-132. https://doi.org/10.1080/1540496x.2018.1479251
- Lin, M., Prabhala, N. R., & Viswanathan, S. (2013). Judging borrowers by the company they keep: Friendship networks and information asymmetry in online peer-to-peer lending. Management Science, 59(1), 17-35. https://doi.org/10.1287/mnsc.1120.1560
- Park, H., & Yoo, Y. (2019). Differences among credit rating agencies and the information envirionment. Journal of Asian Finance, Economics and Business, 6(2), 25-32. https://doi.org/10.13106/JAFEB.2019.VOL6.NO2.25
- Pope, D. G., & Sydnor, J. R. (2011). What's in a picture? Evidence of discrimination from Prosper.com. Journal of Human Resources, 46(1), 53-92.
- Tao, Q., Dong, Y., & Lin, Z. (2017). Who can get money? Evidence from the Chinese peer-to-peer lending platform. Information Systems Frontiers, 19(3), 425-441. https://doi.org/10.1007/s10796-017-9751-5
- Wang, C., Zhang, W., Zhao, X., & Wang, J. (2019). Soft information in online peer-to-peer lending: Evidence from a leading platform in China. Electronic Commerce Research and Applications, 36, 100873. https://doi.org/10.1016/j.elerap.2019.100873
- Xu, J. J., & Chau, M. (2018). Cheap talk? The impact of lenderborrower communication on peer-to-peer lending outcomes. Journal of Management Information Systems, 35(1), 53-85. https://doi.org/10.1080/07421222.2018.1440776
Cited by
- Digital Payment System Analysis of Buying Decision in Indonesia vol.7, pp.10, 2020, https://doi.org/10.13106/jafeb.2020.vol7.n10.323