five

Bayesian Tree-Structured Two-Level Clustering for Nested Data Analysis

收藏
DataCite Commons2024-05-31 更新2024-07-29 收录
下载链接:
https://tandf.figshare.com/articles/dataset/Bayesian_Tree-Structured_Two-Level_Clustering_for_Nested_Data_Analysis/21263192/1
下载链接
链接失效反馈
官方服务:
资源简介:
Data integration plays a crucial role in the era of big data. The <i>nested data</i> are a combined set of observations from multiple sources and exhibit heterogeneity both at the source level and at the observational level. The complex nature makes it challenging to reasonably visualize and jointly analyze the nested data. In this paper, we present a nonparametric Bayesian model to implement the <i>tree-structured two-level clustering</i> for nested data analysis. The two-level clustering is used to tease out the heterogeneity existing in the sources and observations, while a tree-structured prior is employed to model the latent hierarchy for clusters at the observational level. The proposed Bayesian model is flexible as it does not require an exact specification of cluster numbers or tree width/depth, and it can automatically learn the underlying tree structures among clusters of observations, thus offering an insightful visualization of the nested data. We further provide a rigorous posterior sampling scheme via the partially collapsed Gibbs sampler and show the performance of the proposed method using simulation studies. Finally, the applications to two different types of nested data (multi-source image data and multi-subject single-cell expression data) demonstrate the advantages of the proposed Bayesian method.
提供机构:
Taylor & Francis
创建时间:
2022-10-03
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作