five

Simulation data based on Angus cattle genotypes

收藏
DataONE2017-05-20 更新2024-06-26 收录
下载链接:
https://search.dataone.org/view/sha256:600387074ca3365321c063d4a5c578025b4abacc55513b2be50601b55b7f5fcb
下载链接
链接失效反馈
官方服务:
资源简介:
Simulated genotypes and phenotypes for 5,000 cattle as the progeny of 948 Aberdeen Angus beef cattle that were genotyped with the Illumina 777K BovineHD BeadChip. Each zip file contains the data for one replicate of the simulation under two different scenarios (common or rare QTL), where for each scenario there are 6 data files: 3 PLINK binary format SNP files (.bed, .bim, .fam), a QTL info file (columns are QTL ID, QTL effect and QTL allele frequency), a phenotype file for the 4,000 training individuals, and a file for the true genetic values of the 1,000 testing individuals. Note that the .fam files are the same across simulation replicates. Since duplicated files are now allowed in this database, only one .fam file is present but it can be used to read in the genotype data with PLINK for other simulation replicates by simply changing the rep ID in the filename.

本数据集包含以948头已通过Illumina 777K牛高密度SNP芯片(Illumina 777K BovineHD BeadChip)完成基因分型的阿伯丁安格斯肉牛(Aberdeen Angus)为亲本产生的5000头后代的模拟基因型与表型数据。每个ZIP压缩包包含两种不同模拟场景(常见数量性状基因座(Quantitative Trait Locus, QTL)或稀有QTL)下一次重复模拟的数据集,每个场景对应6个数据文件:3个PLINK二进制格式的单核苷酸多态性(Single Nucleotide Polymorphism, SNP)文件(.bed、.bim、.fam)、1份QTL信息文件(列依次为QTL编号、QTL效应值及QTL等位基因频率)、1份针对4000个训练个体的表型文件,以及1份针对1000个测试个体的真实遗传值文件。请注意,所有模拟重复的.fam文件均完全一致。鉴于本数据库允许重复文件存储,因此仅留存了一份.fam文件;若需使用PLINK读取其他模拟重复的基因型数据,仅需修改文件名中的重复标识(rep ID)即可。
创建时间:
2023-11-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作