five

Reference human genome samples and synthetic mirrored representation of human genome features.

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://www.ncbi.nlm.nih.gov/sra/SRP279382
下载链接
链接失效反馈
官方服务:
资源简介:
Next-generation sequencing (NGS) can identify mutations in the human genome that cause disease, and has been widely adopted in clinical diagnosis. However, the human genome contains many polymorphic, low complexity, and repetitive regions that are difficult to sequence and analyse. Despite their difficulty, these regions include many clinically-important features, and their accurate diagnosis can inform the treatment of a range of human diseases. To evaluate the accuracy by which these difficult regions are analysed using sequencing, we built an in silico chromosome, and corresponding synthetic DNA reference standards that encode difficult and clinically important sequences of the human genome, including repeats, microsatellites, HLA genes and immune-receptors. Unlike natural genome materials, that can be difficult to unambiguously characterise at these difficult regions, the synthetic DNA standards provide a known ground-truth to evaluate the performance of a diverse sequencing technologies, reagents, and bioinformatic tools. Here we provide a comprehensive and detailed evaluation of short- and long-read sequencing instruments, PCR-based and -free library reagents, and a range of leading bioinformatic tools. This evaluation provides analytical validation for using sequence to diagnose a range of clinical important features of the human genome, and highlight the challenges in resolving these difficult regions using genome sequencing.
创建时间:
2021-07-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作