A Distributed and Integrated Method of Moments for High-Dimensional Correlated Data Analysis

2021
This article is motivated by a regression analysis of electroencephalography (EEG) neuroimaging data with high-dimensional correlated responses with multilevel nested correlations. We develop a divide-and-conquer procedure implemented in a fully distributed and parallelized computational scheme for statistical estimation and inference of regression parameters. Despite significant efforts in the literature, the computational bottleneck associated with high-dimensional likelihoods prevents the scalability of existing methods. The proposed method addresses this challenge by dividing responses into subvectors to be analyzed separately and in parallel on a distributed platform using pairwise composite likelihood. Theoretical challenges related to combining results from dependent data are overcome in a statistically efficient way using a meta-estimator derived from Hansen's generalized method of moments. We provide a rigorous theoretical framework for efficient estimation, inference, and goodness-of-fit tests. We develop an R package for ease of implementation. We illustrate our method's performance with simulations and the analysis of the EEG data, and find that iron deficiency is significantly associated with two auditory recognition memory related potentials in the left parietal-occipital region of the brain. for this article are available online.
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
页码:805-818|卷号:116|期号:534
ISSN:0162-1459
收录类型
SSCI
发表日期
2021
学科领域
循证社会科学-方法
国家
美国
语种
英语
DOI
10.1080/01621459.2020.1736082
其他关键词
CONFIDENCE DISTRIBUTION; LIKELIHOOD APPROACH; SAMPLE PROPERTIES; LONGITUDINAL DATA; MODELS; INFERENCE; METAANALYSIS; REGRESSION; PARAMETER; MATRIX
EISSN
1537-274X
资助机构
NIHUnited States Department of Health & Human ServicesNational Institutes of Health (NIH) - USA [R01ES024732, P01ES022844]; NSFNational Science Foundation (NSF) [DMS1811734]
资助信息
This research was funded by grants NSF DMS1811734, NIH R01ES024732, and NIH P01ES022844.
被引频次(WOS)
2
被引更新日期
2022-01
来源机构
University of Michigan System University of Michigan
关键词
Composite likelihood Divide-and-conquer Generalized method of moments Parallel computing Scalable computing