Grouped Heterogeneous Mixture Modeling for Clustered Data

2021
Clustered data are ubiquitous in a variety of scientific fields. In this article, we propose a flexible and interpretable modeling approach, called grouped heterogeneous mixture modeling, for clustered data, which models cluster-wise conditional distributions by mixtures of latent conditional distributions common to all the clusters. In the model, we assume that clusters are divided into a finite number of groups and mixing proportions are the same within the same group. We provide a simple generalized EM algorithm for computing the maximum likelihood estimator, and an information criterion to select the numbers of groups and latent distributions. We also propose structured grouping strategies by introducing penalties on grouping parameters in the likelihood function. Under the settings where both the number of clusters and cluster sizes tend to infinity, we present asymptotic properties of the maximum likelihood estimator and the information criterion. We demonstrate the proposed method through simulation studies and an application to crime risk modeling in Tokyo.
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION
页码:999-1010|卷号:116|期号:534
ISSN:0162-1459
来源机构
University of Tokyo
收录类型
SSCI
发表日期
2021
学科领域
循证社会科学-方法
国家
日本
语种
英语
DOI
10.1080/01621459.2020.1777136
其他关键词
MAXIMUM-LIKELIHOOD-ESTIMATION
EISSN
1537-274X
资助机构
JSPS KAKENHIMinistry of Education, Culture, Sports, Science and Technology, Japan (MEXT)Japan Society for the Promotion of ScienceGrants-in-Aid for Scientific Research (KAKENHI) [16H07406, 18K12757]
资助信息
The authors gratefully acknowledge JSPS KAKENHI grant numbers 16H07406 and 18K12757 .
被引频次(WOS)
2
被引更新日期
2022-01
关键词
EM algorithm Finite mixture Maximum likelihood estimation Mixture of experts Unobserved heterogeneity