S1 Fig. The first 10 PCA components capture a high proportion of variance in the ABCD Study®.
收藏Figshare2025-08-13 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/S1_Fig_The_first_10_PCA_components_capture_a_high_proportion_of_variance_in_the_ABCD_Study_/29902449
下载链接
链接失效反馈官方服务:
资源简介:
a) Individual component variance explained. b) Cumulative variance explained across components. S2 Fig. Best CVAE architecture outperforms PCA on reconstruction loss. We compared the mean squared error (MSE) reconstruction loss on the same test split data across 25 different random seed values for PCA solver and CVAE model weight initializations for 6 different CVAE architectures with different hidden layer sizes, all with a 100 dimensional latent space. CVAE architectures 1, 2, and 3 used hidden layer sizes of 200, 400, and 800 respectively. CVAE architectures 4, 5, and 6 first utilized PCA to reduce the data to dimensionality 400, 200, and 150 respectively and had respective hidden layer sizes of 200, 150, and 125. S3 Fig. The top 10 ranked CVAE components account for a high proportion of explained variance. Plotting the CVAE components in order of variance explained lead to the observation that ten components account for a high proportion of variance in the data, indicating that key modes of population stratification exist. S4 Fig. Structure captured by CVAE components predominantly distinct from PCA components. Pearson’s correlation was applied to the top 10 components for both CVAE and PCA ranked by explained variance. The highest correlation is observed between the top PCA component and a range of CVAE components. S5 Fig. Identified SES-centric components predict participant state residency. Confusion matrix reporting the percentage of times state label was correctly predicted across participants (diagonal) versus percentage of incorrect classifications to other states (off-diagonal). S6 Fig. Different SES components are more influential in predicting residency in different US states. Mean logistic regression coefficient values across 10-fold cross validation for each state one-versus-rest classifier. (ZIP)
创建时间:
2025-08-13



