five

1000 Genomes Project Cleaned Dataset

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/6590781
下载链接
链接失效反馈
官方服务:
资源简介:
The first four authors performed standard quality control analysis on the 1000 Genomes (1KG) Project genotypes that were generated on the Illumina Omni2.5M chip, at the Broad and Sanger Institutes. The datasets were then posted on the website of The Centre for Applied Genomics at Sick Kids Hospital at https://www.tcag.ca/tools/1000genomes.html. The last two authors then looked for overlap between those datasets and the Hapmap3 datasets that had gene expression for Endoplasmic Reticulum Aminopeptidase 2 (ERAP2), and chose the Yoruban from Ibadan, Nigeria (YRI) and Utah residents with Northern and Western European ancestry (CEU) subpopulations. These two subpopulations had the largest overlap between the 1KG and HapMap3 datasets, with 91 YRI and 104 CEU samples. The text files provided in this repository contain the IDs of all invidividuals and phenotypes for the labelled populations e.g. ERAP2_CEU_YRI_phenotypes.txt has phenotypes for both populations. The two *_pc_outliers.txt contain the IDs of the individuals excluded from analysis due to extraneous principal components. In summary, 88 YRI and 102 CEU individuals were included in the analysis.
创建时间:
2023-03-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作