主管:中国科学院
主办:中国优选法统筹法与经济数学研究会
   中国科学院科技战略咨询研究院

Chinese Journal of Management Science ›› 2020, Vol. 28 ›› Issue (9): 45-53.doi: 10.16381/j.cnki.issn1003-207x.2018.1491

• Articles • Previous Articles     Next Articles

Interpretable Credit Risk Assessment Modeling Based on Improved Pedagogical Method

DONG Lu-an, YE Xin   

  1. School of Economics and Management, Dalian University of Technology, Dalian 116024, China
  • Received:2018-10-18 Revised:2019-03-20 Online:2020-09-20 Published:2020-09-25

Abstract: Credit risk assessment is one of the important tasks in risk prevention and control of financial institutions. In recent year, the credit risk assessment based on machine learning models have received increasing attention due to their better predictive performance. However, the machine learning models lack the interpretability, which makes it impossible for decision makers to fully trust the models and their predictive results. To solve the above problem, an improved pedagogical method is proposed. The proposed method uses the machine learning models guide to construct the credit risk assessment based on a decision tree, which can assist investors in decision-making. In our approach, to improve the approximation degree between decision tree and the correct function of machine learning model, a pseudo date set sampling method based on Weight Synthetic Minority Over-sampling Technique (Weight-SMOTE) is proposed to improve the proportion of pseudo samples, which is marked by the trusted function of machine learning model, in the pseudo data set. To achieve an effective trade-off between accuracy, interpretability and the consistency of generated decision tree with machine learning model, a new decision tree pruning method is proposed. At the same time, aiming at the limitation of fidelity evaluation indicator, a new evaluation indicator, true-fidelity, is proposed to effectively measure the approximation degree of the decision tree and the correct function of the machine learning model. In addition, three real credit risk assessment data sets are used in the experiment. The experimental results show that the improved pedagogical method can construct an interpretable credit risk assessment, which can meet the different decision-making preferences of decision makers.

Key words: credit risk assessment, machine learning model, interpretability, pedagogical method

CLC Number: