five

Host/microbiome interactions in NIH-Heterogeneous Stock rats (study based on 16S data)

收藏
DataCite Commons2025-10-27 更新2026-04-25 收录
下载链接:
https://figshare.com/articles/dataset/Host_microbiome_interactions_in_NIH-Heterogeneous_Stock_rats_study_based_on_16S_data_/28769039/1
下载链接
链接失效反馈
官方服务:
资源简介:
These files are relevant to the study https://www.biorxiv.org/content/10.1101/2025.03.20.644349v2 (accepted for publication in Nature Communications) and the associated code available at https://github.com/Baud-lab/P50/tree/Master/16S and https://github.com/Baud-lab/CoreQuantGen. The folder source_data_paper includes the source data that can be used to reproduce the figures in the paper.<b>175568_57950_analysis_16S_FilterfeaturesagainstreferencefilterfeaturesPhylogenetictreedatabasesgg202210202210taxonomyasvnwkqza</b>BIOM file from Qiita (https://qiita.ucsd.edu/) artefact 175568 in analysis 57950, created by merging artefacts 175546, 175547, 175548, 175549, 175550, 175551, 175552, 175553, and 175555 (corresponding to the different library preparations) from study 11479.Includes abundance and taxonomy for all ASVs identified in the samples<b>taxonomy_Greengenes2.txt</b>1) downloaded 2022.10.taxonomy.asv.nwk.qza from http://greengenes.microbio.me/greengenes_release/2022.10-rc1/;2) downloaded file 175568_feature-table.qza from Qiita (https://qiita.ucsd.edu/) artefact 175568 in analysis 57950 (same information as BIOM table above)3) filtered the taxonomy file to keep only the ASVs present in the HS samples using ```qiime greengenes2 taxonomy-from-table --i-reference-taxonomy 022.10.taxonomy.asv.nwk.qza --i-table 175568_feature-table.qza --o-classification biom.taxonomy.qza```4) exported taxonomy to TSV file ```qiime tools export --input-path biom.taxonomy.qza --output-path taxonomy.tsv```5) renamed taxonomy as taxonomy_Greengenes2.txtCol1 is ASV sequennceCol2 is full taxonomyCol3 and Col4 not used.<b>full_biomt_clr_counts.RData</b>Contains "full_biomt" and "clr_counts" (each for 3886 rats and 93,090 ASVs)<b>deblur_rarefied_collapsed_full_biomt.RData</b>R object corresponding to Qiita artefact ID 214605. Contains "collapsed_full_biomt" (subset of 2,728 rats with &gt;10,000 sequencing reads and 2,251 taxa from phylum to species)<b>collapsed_full_biomt_collapsed_clr_counts.RData</b>Contains "collapsed_full_biomt" and "collapsed_clr_counts" (created by 4_merge_taxonomic_level.R on the associated github). 3,886 rats and 2,675 taxa (phylum to species).<b>metadata_16Spaper.RData</b>All metadata relevant to the 16S data<b>augmented_VC.RData</b>output of CoreQuantGen code, for microbiome phenotypes and using a model with DGE only (no IGE), augmented using annotate_VCs_pvalues.RColumns trait2, sample_size2, sample_size2_cm None, study2 all None in univariate analysisOther columns:- trait1 phenotype name (ASV or taxon _ cohort)- sample_size1 number of individuals with phenotype, covariate, cage and genetic information- sample_size1_cm (equal or greater than sample_size1): includes cage mates of individuals in sample_size1 for which phenotype and/or covariate is NA but with cage and genetic information- covariates_names for all phenotypes, residuals from regressing out the covariates were used. Hence, only a mean term (showing as m,e,a,n) and a group size term (not showing) were fit as fixed effect in the linear mixed models used for all genetic analyses.- conv indicates if the model converged (True)- LML -log10(Maximum Likelihood)- prop_Ad1 proportion of phenotypic variance (total_var1) explained by host genetic effects/DGE- prop_Ed1 proportion of phenotypic variance (total_var1) explained by individual environmental effects- prop_Dm1 proportion of phenotypic variance (total_var1) explained by maternal effects- prop_C1 proportion of phenotypic variance (total_var1) explained by cage effects- tot_genVar1 proportion of phenotypic variance (total_var1) explained by DGE (and IGE and DGE-IGE covariance- total_var1 total phenotypic variance- id.x not used- time_exec time it took to fit the model (and estimate STE)- STE_Ad1 standard error for prop_Ad1- STE_Ed1 standard error for prop_Ed1- STE_Dm1 standard error for prop_Dm1- STE_totv1 standard error for total_var1- corParams_Ed1_Dm1 correlation between the parameters prop_Ed1 and prop_Dm1- id.y not used- taxon1 taxon of trait1- study1 cohort of trait1- full_taxon full taxonomy for taxon1- pvalue_DGE p-value for H0: prop_Ad1 = 0- sw_bonf_pvalue_DGE and sw_qvalue_DGE not used- cw_bonf_pvalue_DGE Bonferroni corrected p-value accounting for the number of phenotypes considered in a given cohort- cw_qvalue_DGE Q value accounting for the phenotypes considered in a given cohort<b>phenos_all_estNste.RData</b>similar to augmented_VC.RData but for organismal phenotypes measured in HS rats.<b>all_VCs_corr_Ad1d2_zero_P50_Rn7_pruned_DGE.RData</b>similar to augmented_VC.RData but from a bivariate modeltrait2 is same taxon or ASV as trait1 but a different cohortcorr_Ad1d2 is the (genetic) correlation between DGE on trait1 and DGE on trait2<b>augmented_IGE_VC</b><b>.RData</b>similar to augmented_VC.RData but from a model including DGE and IGE. whole sample data only (not per cohort).<b>microbiome_DGE_QTLs.RData</b>Microbiome-associated loci (for ASVs and taxa, -logP &gt; 4)<b>microbiome_DGE_QTLs_ALL.RData</b>Microbiome-associated loci (for ASVs and taxa, -logP &gt; 4)<b>P50_Rn7_chr10qtl_allSNPS.raw</b>Genotypes for SNPs at the chromosome 10 replicated locus.<b>permutations_cagemates folder</b>For Fig 6c and SFig 13. Contains output of variance decomposition after permuting the cage mates and analysing using three different models. to be explored with analyse_scrambled.R in that same folder.<b>MI and NY folders</b>Simulations for Fig. 6D and Supplementary Fig. 14. Each folder includes simulated values and estimated values from the simulations presented in the paper and more. 0.9 0 and neg0.9 refer to the (genetic) correlation between DGE and IGE simulated. 21 and 22 are the seeds used for the simulations<b>IGE_null_simulations_SFig12</b>For SFig. 12. Contains output of variance decomposition for null (no IGE) simulations and analysing using three different models. QQ plots from 2_pvalues_VCs_bootstrap.R in that same folder.
提供机构:
figshare
创建时间:
2025-10-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作