主管:中国科学院
主办:中国优选法统筹法与经济数学研究会
   中国科学院科技战略咨询研究院

Chinese Journal of Management Science ›› 2023, Vol. 31 ›› Issue (10): 234-244.doi: 10.16381/j.cnki.issn1003-207x.2020.1628

Previous Articles     Next Articles

Prediction Method for Gastric Cancer Survivability Based on an Improved LightGBM Ensemble Model

Yi FENG1,Du-juan WANG1(),Zhi-neng HU1,Shao-ze CUI2   

  1. 1.Business School, Sichuan University, Chengdu 610064, China
    2.School of Economics and Management, Dalian University of Technology, Dalian 116024, China
  • Received:2020-08-23 Revised:2021-06-03 Online:2023-10-15 Published:2023-10-20
  • Contact: Du-juan WANG E-mail:wangdujuan@dlut.edu.cn

Abstract:

The prediction of gastric cancer survivability is one of the important works in the gastric cancer prognosis, which can provide decision-making support for the doctor by mining important features affecting the survivability of gastric cancer patients and accurately predicting survivability. So far, the accuracy of survivability prediction of gastric cancer is insufficient, and the interpretability of prediction models is lack. Therefore, a gastric survivability prediction method called SGPL-LightGBM based on improved LightGBM is proposed to predict the survivability of patients with gastric cancer and interpret the prediction model. The stability-based feature selection method is carried out to determine the optimal feature subset, thereby reducing the computational overhead and improving the accuracy of the prediction. After that, the intelligent optimization algorithm (Genetic algorithm) is adopted to optimize the important hyperparameters in LightGBM (Light gradient boosting machine) to further improves the performance of gastric cancer survivability prediction model. The important features of gastric survivability prediction model are analyzed by PDP (Partial dependence plot) and LIME (Local interpretable model-agnostic explanations), which explain the effect of influencing features on the predicted response of SGPL-LightGBM. Finally, the numeral experiments are conducted on real gastric dataset, and the experimental results indicate that the proposed ensemble classification prediction method SGPL-LightGBM has better accuracy and interpretability in gastric cancer survivability prediction, which can provide effective decision support for doctors to develop a treatment plan.

Key words: gastric cancer survival prediction, feature selection, ensemble learning, intelligence algorithm, interpretability

CLC Number: