主管:中国科学院
主办:中国优选法统筹法与经济数学研究会
   中国科学院科技战略咨询研究院

中国管理科学 ›› 2017, Vol. 25 ›› Issue (9): 116-124.doi: 10.16381/j.cnki.issn1003-207x.2017.09.013

• 论文 • 上一篇    下一篇

半参数可加beta回归模型及其应用

方匡南1, 姚紫薇2   

  1. 1. 厦门大学经济学院, 福建 厦门 361005;
    2. 上海财经大学国际工商管理学院, 上海 200433
  • 收稿日期:2016-06-16 修回日期:2016-12-07 出版日期:2017-09-20 发布日期:2017-11-24
  • 作者简介:方匡南(1983-),男(汉族),浙江台州人,厦门大学经济学院统计系教授,研究方向:数据挖掘机器学习、应用统计,E-mail:39863401@qq.com.
  • 基金资助:

    国家自然科学基金资助项目(71471152,71303200);全国统计科学研究重点项目(2015629)

Semi-parametric Additive Beta Regression Model with its Application

FANG Kuang-nan1, YAO Zi-wei2   

  1. 1. The School of Economics, Xiamen University, Xiamen 361005, China;
    2. International College of Business Administration, Shanghai University of Finance and Economcics, Shanghai 200433, China
  • Received:2016-06-16 Revised:2016-12-07 Online:2017-09-20 Published:2017-11-24

摘要: 当响应变量为连续比例数据时,即其取值为(0,1)区间时,经典的线性回归或者数据变换方法的结果往往不理想。这种情况下可以使用Ferrari and Cribari-Neto提出的beta回归模型。但是传统的beta回归模型仅局限于参数线性回归,模型的灵活度不高。本文提出了半参数可加beta回归模型以及参数估计方法,通过模拟发现具有良好的效果。另外,将半参数可加beta回归应用于医疗支出占家庭总支出比例的实证分析上,探讨了影响医疗支出占家庭总支出比例的影响因素。

关键词: 半参数, beta回归, 广义可加模型, 医疗支出

Abstract: In regression analysis, classical linear regression or its transformation methods are not satisfied when response variable is restricted to the interval (0, 1), that is, proportional or fractional data, which is common in Economics, education, medical science etc. One of the most promising approaches is the beta regression proposed by Ferrari and Cribari-Neto. However, the traditional beta regression is confined in the linear situation and thus lacks flexibility. Besides, it has specification error if the true model is not linear. Borrow the idea from generalized additive model (GAM) proposed by Hastie and Tibshirani, a semi-parametric additive beta regression model is proposed. It is assumed the model can be decomposed into parametric and nonparametric parts. For the nonparametric part, the local scoring algorithm is used to fit the unknown function and AIC is used to choose the best smoothing (tuning) parameters. Two simulation examples under different scenarios are conducted, the simulation results shows that semi-parametric beta regression model perform well. Comparing to traditional models, the proposed semi-parametric beta regression model is the best and is significantly better than other traditional models. The proposed model is applied on medical expenditure data to explore the factors of the medical expenditure portion in patients' overall expenditure. It is found marital status, age of householder, income, the number of inpatient and outpatient are the significant factor for the proportion of medical expenditure in overall expenditure.

Key words: semi-parametric, beta regression, generalized additive models, medical expenditure

中图分类号: