主管:中国科学院
主办:中国优选法统筹法与经济数学研究会
   中国科学院科技战略咨询研究院

Chinese Journal of Management Science ›› 2024, Vol. 32 ›› Issue (3): 1-8.doi: 10.16381/j.cnki.issn1003-207x.2021.2434

    Next Articles

Credit Scoring Based on Semi-supervised Support Vector Machine

Song Chen1,Xiuyun Yu2,Yongqin Qiu2,Kuangnan Fang2()   

  1. 1.Mico-Finance College, Taizhou University, Taizhou 318000, China
    2.School of Economics, Xiamen University, Xiamen 361005, China
  • Received:2021-11-23 Revised:2022-10-17 Online:2024-03-25 Published:2024-03-25
  • Contact: Kuangnan Fang E-mail:xmufkn@xmu.edu.cn

Abstract:

To address the problem of difficulty and high cost in obtaining labeled samples in credit scoring, a new credit scoring model is proposed based on semi-supervised support vector machines. By introducing new parameters to the unlabeled samples, the model need not satisfy the random missing assumption and has good applicability. Meanwhile, adding a semi-supervised part to the loss function encourages the similarity between the coefficients of labeled and unlabeled samples, which can effectively fuse the unlabeled sample information and improve the estimation effect. In addition, Group LASSO is used for variable selection, which can make full use of the group structure information and screen important variables. The feasibility of the proposed method and its excellent results in variable selection, coefficient estimation and classification prediction are demonstrated by numerical simulations and an example data of credit card risk default prediction.

Key words: semi-supervised classification, support vector machines, variable selection, credit scoring

CLC Number: