Evaluating Classical and Artificial Intelligence Methods for Credit Risk Analysis

Bruno Reis; António Quintino

doi:10.58567/jea02030006

Authors

Bruno Reis Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal https://orcid.org/0000-0003-1848-8756
António Quintino CEG-IST, Instituto Superior Técnico, Universidade de Lisboa, Lisbon, Portugal https://orcid.org/0000-0003-0058-2488

DOI:

https://doi.org/10.58567/jea02030006

Keywords:

Credit scoring; artificial intelligence; discriminant analysis; logistic regression; artificial neural networks; random forest

Abstract

Credit scoring remains one of the most important subjects in financial risk management. Although the methods in this field have grown in sophistication, further improvements are necessary. These advances could translate in major gains for financial institutions and other companies that extend credit by diminishing the potential for losses in this process. This research seeks to compare statistical and artificial intelligence (AI) predictors in a credit risk analysis setting, namely the discriminant analysis, the logistic regression (LR), the artificial neural networks (ANNs), and the random forests. In order to perform this comparison, these methods are used to predict the default risk for a sample of companies that engage in trade credit. Pre-processing procedures are established, namely in the form of a proper sampling technique to assure the balance of the sample. Additionally, multicollinearity in the dataset is assessed via an analysis of the variance inflation factors (VIFs), and the presence of multivariate outliers is investigated with an algorithm based on robust Mahalanobis distances (MDs). After seeking the most beneficial architectures and/or settings for each predictor category, the final models are then compared in terms of several relevant key performance indicators (KPIs). The benchmarking analysis revealed that the artificial intelligence methods outperformed the statistical approaches.

References

Abdou, H. A., & Pointon, J. (2011). Credit scoring, statistical techniques and evaluation criteria: a review of the literature. Intelligent Systems in Accounting, Finance and Management, 18, 59–88.

Addo, P. M., Guegan, D., & Hassani, B. (2018). Credit Risk Analysis Using Machine and Deep Learning Models. Risks, 6(2):38.

Aguilera, A., Escabias, M., & Valderrama, M. (2006). Using principal components for estimating logistic regression with high-dimensional multicollinear data. Computational Statistics & Data Analysis, 50, 1905-1924.

Altman, E. I. (1968). Financial Ratios, Discriminant Analysis and the Prediction of Corporate Bankruptcy. The Journal of Finance, 23, 589-609.

Angelini, E., di Tollo, G., & Roli, A. (2008). A neural network approach for credit risk evaluation. Quarterly Review of Economics and Finance, 48, 733–755.

Archer, K., & Kimes, R. (2008). Empirical characterization of random forest variable importance measures. Computational Statistics & Data Analysis, 52, 2249-2260.

Ayala, H., & Coelho, L. (2016). Cascaded evolutionary algorithm for nonlinear system identification based on correlation functions and radial basis functions neural networks. Mechanical Systems and Signal Processing, 68, 378–393.

Baesens, B., Setiono, R., Mues, C., & Vanthienen, J. (2003). Using Neural Network Rule Extraction andDecision Tables for Credit-Risk Evaluation. Management Science, 49, 312-329.

Barnett, V. & Lewis, T. (1994). Outliers in Statistical Data (3rd ed.). Chichester, UK: Wiley

Baser, F., Koc, O., & Selcuk-Kestel, A. (2023). Credit risk evaluation using clustering based fuzzy classificationmethod. Expert Systems with Applications, 223.

Batista, A. (2012). Credit Scoring – Uma ferramenta de gestão financeira. Porto, Portugal: Vida Económica.

Beliakov, G., Kelarev, A., & Yearwood, J. (2011). Robust artificial neural networks and outlier detection. Technical report.

Breiman, L. (1996). Bagging Predictors. Machine Learning, 24, 123-140.

Breiman, L. (2001). Random forests. Machine Learning, 45, 5-32.

Brereton, R., & Lloyd (2016). Re-evaluating the role of the Mahalanobis distance measure. Journal of Chemometrics, 30, 134-143.

Bryll, R., Gutierrez-Osuna, R., & Quek, F. (2003). Attribute bagging: improving accuracy of classifier ensembles by using random feature subsets. Pattern Recognition, 36, 1291-1302.

Chen, X., Wang, D., Liu, Z., & Wu, Y. (2018). A Fast Direct Position Determination for Multiple Sources Based on Radial Basis Function Neural Network. 10th International Conference on Communication Software and Networks (ICCSN), 381-385.

Craney, T., & Surles, J. (2002). Model-Dependent Variance Inflation Factor Cutoff Values. Quality Engineering, 14, 391-403.

Crone, S., & Finlay, F. (2012). Instance sampling in credit scoring: An empirical study of sample size and balancing. International Journal of Forecasting, 28, 224-238.

Dawoud, I., Awwad, F., Tageldin, E., & Abonazel, M. (2022). New Robust Estimators for Handling Multicollinearity and Outliers in the Poisson Model: Methods, Simulation and Applications. Axioms, 11

Dietterich, T. (2000). An Experimental Comparison of Three Methods for Constructing Ensembles of Decision Trees: Bagging, Boosting, and Randomization. Machine Learning, 40, 139-157.

Dumitrescu, E., Hué, S., Hurlin, C., & Tokpavi, S. 2022. Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects. European Journal of Operational Research, 297(3), 1178-1192.

Fabbri, D., & Menichini, A. (2010). Trade credit, collateral liquidation and borrowing constraints. Journal of Financial Economics, 96, 413-432.

Filzmoser, P. (2004). A multivariate outlier detection method. Proceedings of the Seventh International Conference on ComputerData Analysis and Modeling, 1, 18-22.

Finlay, S. (2011). Multiple classifier architectures and their application to credit risk assessment. European Journal of Operational Research, 210, 368-378.

Fletcher, P., Venkatasubramanian, S., & Joshi, S. (2008). 2008 IEEE Conference on Computer Vision and Pattern Recognition.

Grubbs, F. (1969). Procedures for Detecting Outlying Observations in Samples. Technometrics ,11(1), 1-21.

Hastie, T., Tibshirani, R., & Friedman, J. H. (2009). The elements of statistical learning: data mining,inference, and prediction (2nd ed.). New York, USA: Springer

Huang, X., Liu, X., & Ren, Y. (2018). Enterprise credit risk evaluation based on neural network algorithm. Cognitive Systems Research, 52, 317–324.

Huang, Z., Chen, H., Hsu, C. J., Chen, W. H., & Wu, S. (2004). Credit rating analysis with support vector machines and neural networks: A market comparative study. Decision Support Systems, 37,543–558.

Jones, S., Johnstone, D., & Wilson, R. (2015). An empirical evaluation of the performance of binary classifiers in the prediction of credit ratings changes. Journal of Banking and Finance, 56, 72–85.

Khashman, A. (2010). Neural networks for credit risk evaluation: Investigation of different neural models and learning schemes. Expert Systems with Applications, 37, 6233–6239.

Kvamme, H., Sellereite, N., Aas, K., & Sjursen, S. (2018). Predicting mortgage default using convolutional neural networks. Expert Systems with Applications, 102, 207–217.

Lai, K., Yu, L., Wang, S., & Zhou, L. (2006). Credit risk analysis using a reliability-based neural network ensemble model. Artificial Neural Networks – ICANN 2006, 682–690.

Lee, T. S., Chiu, C. C., Lu, C. J., & Chen, I. F. (2002). Credit scoring using the hybrid neural discriminant technique. Expert Systems with Applications, 23(3), 245–254.

Leys, C., Klein, O., Dominicy, Y., & Ley, C (2018). Detecting multivariate outliers: Use a robust variant of the Mahalanobis distance. Journal of Experimental Social Psychology, 74, 150-156.

Lessmann, S., Baesens, B., Seow, H., & Thomas, L. (2015). Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. European Journal of Operational Research, 247,124-136.

Khemakhem, S., & Boujelbènea, Y. (2015). Credit risk prediction: A comparative study between discriminant analysis and the neural network approach. Accounting and Management Information Systems, 14(1), 60–78.

Ong, C. S., Huang, J. J., & Tzeng, G. H. (2005). Building credit scoring models using genetic programming. Expert Systems with Applications, 29, 41-47.

Pacelli, V., & Azzollini, M. (2011). An Artificial Neural Network Approach for Credit Risk Management. Journal of Intelligent LearningSystems and Applications, 3, 103–112.

Paleologo, G., Elisseeff, A., & Antonini, G. (2010). Subagging for credit scoring models. European Journal of Operational Research, 201, 490-499.

Press, S., & Wilson, S. (1978). Choosing Between Logistic Regression and Discriminant Analysis. Journal of the American Statistical Association, 73, 699-705.

MathWorks. Detect outliers in multivariate datasets. (2019). https://www.mathworks.com/matlabcentral/fileexchange/65817-detect-outliers-in-multivaraite-datasets Accessed 24 September 2019.

Šušteršič, M., Mramor, D., & Zupan, J. (2009). Consumer credit scoring models with limited data. Expert Systems with Applications, 36, 4736-4744

Swets, J., Dawes, R., & Monahan, J. (2000). Better decisions through science. Scientific American, 283(4), 82–87.

Tang, Y., Ji, J., Gao, S., Dai, H., Yu, Y., & Todo, Y. (2018). A Pruning Neural Network Model in Credit Classification Analysis. Computational Intelligence and Neuroscience, 2018, 1-22.

Thompson, C., Kim, R., Aloe, A., & Becker, B. (2017). Extracting the Variance Inflation Factor and Other Multicollinearity Diagnostics from Typical Regression Results. Basic and Applied Social Psychology, 39(2), 81-90.

Vellido, A., Lisboa, P. J. G. & Vaughan, J. (1999). Neural networks in business: A survey of applications (1992-1998). Expert Systems with Applications, 17, 51-70.

West, D. (2000). Neural network credit scoring models. Computers and Operations Research, 27, 1131–1152.

Wójcicka, A. (2017). Neural Networks in Credit Risk Classification of Companies in the Construction Sector. Econometric Research in Finance, 2(2), 63–77.

Zhao, Z., Xu, S., Kang, B. H., Kabir, M. M., Liu, Y., & Wasinger, R. (2015). Investigation and improvement of multi-layer perception neural networks for credit scoring. Expert Systems with Applications, 42, 3508-3516.

Evaluating Classical and Artificial Intelligence Methods for Credit Risk Analysis

Authors

DOI:

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License