COST-SENSITIVE EXPLAINABLE AI FOR CORPORATE BANKRUPTCY PREDICTION USING FINANCIAL RATIOS

Aditya  Deshpande; Ritu Saxena; Karthik Subramanian

Authors

Aditya Deshpande
Ritu Saxena
Karthik Subramanian

Keywords:

Bankruptcy Prediction, Financial Distress, Explainable AI, Financial Ratios, Cost-Sensitive Learning, Business Analytics

Abstract

The prediction of corporate bankruptcy is at the heart of financial risk management, but its infrequent occurrence and thelack of transparency in the models remain limiting its application in business and accounting practice. In response tothese issues, an explainable machine learning framework is constructed in a cost-sensitive way to predict bankruptcybased on structured financial ratios. The algorithm combines gradient boosting, where LightGBM will be used as themain classifier, class-weighted optimization, and systematic threshold calibration to solve asymmetric misclassificationrisk. Stratified cross-validation and minority-centric measures, especially Precision-Recall Area Under the Curve (PR-AUC), are used to measure model performance, in conjunction with F 1-score, recall and balanced accuracy. SHapleyAdditive exPlanations (SHAP) is used to measure global feature importance and nonlinear financial impact to make surethat the managerial interpretation is possible. Empirical results show strong discriminatory performance in the case ofextreme class imbalance, with balanced precision and recall, and reliably identifying liquidity and leverage ratios as thekey factors in the distress risk. The findings indicatethat the applicability of interpretable boosting models can beconsidered as precursory alert systems in credit screening, investment screening, and regulatory monitoring. Thecombination of explainable artificial intelligence with cost-sensitive learningin a single financial ratio model providesmethodologically sound and decision-relevant information to the modern business analytics and accounting research.

Downloads

Download data is not yet available.

References

Chen, T., & Guestrin, C. (2016, August). Xgboost: A scalable tree boosting system. InProceedings of the 22nd acm

sigkdd international conference on knowledge discovery and data mining(pp. 785-794).

Ribeiro, M. T., Singh, S., & Guestrin, C. (2016, August). " Why should i trust you?" Explaining the predictions of any

classifier. InProceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and datamining(pp. 1135-1144).

Zięba, M., Tomczak, S. K., & Tomczak, J. M. (2016). Ensemble boosted trees with synthetic features generation in

application to bankruptcy prediction.Expert systems with applications,58, 93-101.

Barboza, F., Kimura, H., & Altman, E. (2017).Machine learning models and bankruptcy prediction.Expert systems

with applications,83, 405-417.

Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., ... & Liu, T. Y. (2017). Lightgbm: A highly efficientgradient

boosting decision tree.Advances in neural information processing systems,30.

Lundberg, S. M., & Lee, S. I. (2017).A unified approach to interpreting model predictions.Advances in neural

information processing systems,30.

Lin, T. Y., Goyal,P., Girshick, R., He, K., & Dollár, P. (2017). Focal loss for dense object detection. InProceedings of

the IEEE international conference on computer vision(pp. 2980-2988).

Boughorbel, S., Jarray, F., & El-Anbari, M. (2017). Optimal classifier for imbalanced data using Matthews Correlation

Coefficient metric.PloS one,12(6), e0177678.

Prokhorenkova, L., Gusev, G., Vorobev, A., Dorogush, A. V., & Gulin, A. (2018). CatBoost: unbiased boosting with

categorical features.Advances in neural information processing systems,31.

Hosaka, T. (2019). Bankruptcy prediction using imaged financial ratios and convolutional neural networks.Expert

systems with applications,117, 287-299.

Chicco, D., & Jurman, G. (2020). The advantages of the Matthews correlation coefficient (MCC) over F1 score and

accuracy in binary classification evaluation.BMC genomics,21(1), 6.

Yerashenia, N., Bolotov, A., Chan, D., & Pierantoni, G. (2020, June). Semantic data pre-processing for machine

learning based bankruptcy prediction computational model. In2020 IEEE 22nd Conference on Business Informatics

(CBI)(Vol. 1, pp. 66-75). IEEE.

Ben Jabeur, S., Stef, N., & Carmona, P. (2023).Bankruptcy prediction using the XGBoost algorithm and variable

importance feature engineering.Computational Economics,61(2), 715-741.

Kuizinienė, D., Krilavičius, T., Damaševičius, R., & Maskeliūnas, R. (2022). Systematic review of financial distress

identification using artificial intelligence methods.Applied Artificial Intelligence,36(1), 2138124.

Hacibedel, M. B., & Qu, R. (2022).Understanding and predicting systemic corporate distress: a machine-learning

approach. International Monetary Fund.

Zhong, J., & Wang, Z. (2022). Artificial intelligence techniques for financial distress prediction.AIMS

Mathematics,7(12), 20891-20908.

Chen, T. K., Liao, H. H., Chen, G. D., Kang, W. H., & Lin, Y. C. (2023).Bankruptcy prediction using machine learning

models with the text-based communicative value of annual reports.Expert Systems with Applications,233, 120714.18.Florek, P., & Zagdański, A. (2023). Benchmarking state-of-the-art gradient boosting algorithms for

classification.arXiv preprint arXiv:2305.17094.

Taiwanese Bankruptcy Prediction. (2020).UCI Machine Learning Repository. University of California, Irvine, School

of Information and Computer Sciences.https://archive.ics.uci.edu/dataset/572/taiwanese+bankruptcy+prediction

COST-SENSITIVE EXPLAINABLE AI FOR CORPORATE BANKRUPTCY PREDICTION USING FINANCIAL RATIOS

Authors

Keywords:

Abstract

Downloads

References

Downloads

Published

How to Cite

Issue

Section

License

sidemenu

Quick Links

Submission Info

Manuscript Submission

crossref