主管:中国科学院
主办:中国优选法统筹法与经济数学研究会
   中国科学院科技战略咨询研究院

中国管理科学 ›› 2022, Vol. 30 ›› Issue (4): 63-73.doi: 10.16381/j.cnki.issn1003-207x.2019.0291

• 论文 • 上一篇    下一篇

比例数据基于Tobit分位数回归模型的贝叶斯变量选择

赵为华1, 王玲1, 程喆1, 张日权2   

  1. 1. 南通大学理学院,江苏 南通226019; 2.华东师范大学统计学院,上海200241
  • 收稿日期:2019-03-05 修回日期:2019-07-30 出版日期:2022-04-20 发布日期:2022-04-26
  • 通讯作者: 赵为华(1978-), 男(汉族), 江苏海门人, 南通大学理学院,教授, 统计学博士,研究方向:复杂数据统计建模及其应用,Email:zhaowhstat@163.com. E-mail:zhaowhstat@163.com
  • 基金资助:
    国家社会科学基金资助项目(15BTJ027);国家自然科学基金资助项目(11971171)

Variable Selection of Proportional Data Based on Tobit Quantile Regression Model

ZHAO Wei-hua1, WANG Ling1, CHENG Zhe1, ZHANG Ri-quan2   

  1. 1. SchoolofSciences, Nantong University, Jiangsu Nantong 226019, China;2. School of Statistics, East China Normal University, Shanghai 200241, China
  • Received:2019-03-05 Revised:2019-07-30 Online:2022-04-20 Published:2022-04-26
  • Contact: 赵为华 E-mail:zhaowhstat@163.com

摘要: 针对家庭商业健康保险参保比例在[0,1]闭区间上取值的特点, 本文基于Tobit模型给出了比例响应数据的贝叶斯分位数回归建模方法。通过引入回归系数的“Spike-and-slab”先验分布, 应用EM 算法我们提出了基于门限规则的贝叶斯变量选择方法。大量数值模拟研究验证了所提的贝叶斯变量选择方法的有效性, 且具有易操作、计算量小等优点。最后, 将此方法应用到家庭商业健康保险数据的实证分析, 研究不同分位数水平下家庭健康保险参保比例的影响因素, 得到了许多有意义的研究结果。

关键词: 比例数据;Tobit分位数回归;贝叶斯变量选择; “Spike-and-Slab”先验; EM算法

Abstract: According to the features of the proportional data taken values on the closed interval [0,1] for the household coverage rate of commercial health insurance, in this paper, the Bayesian quantile regression modeling method is investigated for proportional response data based on the Tobit model. By introducing the “spike-and-slab” prior distributions of coefficients, the threshold rule-based Bayesian variable selection method is proposed by applying the EM algorithm. Extensive numerical simulation studies are used to verify the effectiveness of the proposed Bayesian variable selection method, and it has many merits such as easy implementation and cheap computation. Finally, the newly proposed method is applied to analyze the household coverage rate of commercial health insurance data, the influence factors of the coverage rate of commercial health insurance are uncovered at different quantile levels and some interesting results are gotten.

Key words: proportional data; tobit quantile regression; Bayesian variable selection; “Spike-and-slab” prior; EM algorithm

中图分类号: