Bayesian Tree-Structured Two-Level Clustering for Nested Data Analysis
收藏DataCite Commons2024-05-31 更新2024-07-29 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Bayesian_Tree-Structured_Two-Level_Clustering_for_Nested_Data_Analysis/21263192
下载链接
链接失效反馈官方服务:
资源简介:
Data integration plays a crucial role in the era of big data. The <i>nested data</i> are a combined set of observations from multiple sources and exhibit heterogeneity both at the source level and at the observational level. The complex nature makes it challenging to reasonably visualize and jointly analyze the nested data. In this article, we present a nonparametric Bayesian model to implement the <i>tree-structured two-level clustering</i> for nested data analysis. The two-level clustering is used to tease out the heterogeneity existing in the sources and observations, while a tree-structured prior is employed to model the latent hierarchy for clusters at the observational level. The proposed Bayesian model is flexible as it does not require an exact specification of cluster numbers or tree width/depth, and it can automatically learn the underlying tree structures among clusters of observations, thus, offering an insightful visualization of the nested data. We further provide a rigorous posterior sampling scheme via the partially collapsed Gibbs sampler and show the performance of the proposed method using simulation studies. Finally, the applications to two different types of nested data (multi-source image data and multi-subject single-cell expression data) demonstrate the advantages of the proposed Bayesian method. Supplementary materials for this article are available online.
提供机构:
Taylor & Francis
创建时间:
2022-10-03



