Unified distributed robust regression and variable selection framework for massive data

2021
This paper proposes a unified distributed robust regression framework for distributed massive data, which can include many robust regressions in one setting. Specifically, we first transfer different types of robust regressions into an asymptotically equivalent least-squares problem. Then the resulting estimator can be calculated as a weighted average of robust local estimators, and the communication cost is reduced, since it involves only one round of communication. In addition, since the local data information is incorporated sufficiently, it is adaptive to the heterogeneity. The new estimator is proven to be equivalent with the corresponding global robust regression estimator. Furthermore, we conduct variable selection based on the unified robust regression framework and adaptive LASSO, and the path of solution can also be conveniently obtained by LARS algorithm. It is theoretically shown that the new variable selection method can select true relevant variables consistently by using a new distributed BIC-type tuning parameter selector. The simulation results confirm the effectiveness of the new methods and the correctness of the theoretical results.
EXPERT SYSTEMS WITH APPLICATIONS
卷号:186
ISSN:0957-4174
来源机构
Shandong Technology & Business University
收录类型
SSCI
发表日期
2021
学科领域
循证管理学
国家
中国
语种
英语
DOI
10.1016/j.eswa.2021.115701
其他关键词
NONCONCAVE PENALIZED LIKELIHOOD; COMPRESSION; SHRINKAGE; ALGORITHM
EISSN
1873-6793
资助机构
NNSF project of ChinaNational Natural Science Foundation of China (NSFC) [11901356]; wealth management project of Shandong Technology and Business University [2019ZBKY047]
资助信息
The research was supported by NNSF project of China (11901356) and wealth management project of Shandong Technology and Business University (2019ZBKY047) .
被引频次(WOS)
0
被引更新日期
2022-01
关键词
Distributed massive data Robust regression Communication efficiency Variable selection