five

Human_Read_Removal_Investigation___Viral_Read_Sets

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/ERP169281
下载链接
链接失效反馈
官方服务:
资源简介:
Teams within the Wellcome Sanger Institute have been collaborating to investigate the performance of multiple Human Read Removal (HRR) methods. To this end, a synthetic titration dataset was produced, comprising read sets derived from viral reads arising from a bait capture experiment artificially mixed with equal proportions of 27 human read sets derived from samples in the 30X Illumina Coverage study under the 1000 Genomes Project (PRJEB31736). Viral and human reads were mixed in varying proportions ranging from 1:9 to 9:1 human:viral, with 3 iterations of each viral read set + human reads at each titre for representative sampling. This study contains the full synthetic titration dataset used to validate the human read removal methodologies, with 351 read sets comprising 13 viral samples, at 9 human titres with 3 iterations of each. Reads arising from the 1000 genomes project samples can be identified in read names by the presence of a relevant ENA run accession number (prefixed ERR), other reads are derived from the viral samples. This dataset was run through multiple human read removal methods and subsequent read composition was interrogated in a read-fate analysis to assess performance of the various methods.
创建时间:
2025-02-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作