five

Assembly-based structural variation and haplotypes from targeted sub-Megabase DNA molecules. Homo sapiens strain:GM12878

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://www.ncbi.nlm.nih.gov/bioproject/PRJNA430705
下载链接
链接失效反馈
官方服务:
资源简介:
The sequencing analysis of target DNA molecules ranging from 0.1 Mb and higher has many advantages for delineating complex genomics features. These improvements include: increased target coverage that increases the sensitivity for identifying SV breakpoints; limits to the consequences of off-target sequences; rapid local assembly of Mb regions given the reduction in data size and complexity; cost-effectiveness that comes from examining only regions of interest rather than whole genomes. With such a method, one can generate large, near-Mb size haplotypes. Structural variation can be readily characterized even when existing in lower allelic fractions. However, current methods do not offer these features and typically involve smaller molecules less than 0.1 Mb.As a solution, we have developed a genomic sequencing approach that offers all of the aforementioned improvements. Our approach takes live cells as input, by which any degradation of genomic material is minimized before the target enrichment process. By combining in vitro CRISPR-Cas9 segmentation with automated electrophoretic size selection, our approach efficiently enriches intact high molecular weight target fragments of multiple genomic origins with high specificity. We demonstrate that large segments of DNA can be targeted and sequenced efficiently. Moreover, without any further treatment, an eluted fraction can be directly used for downstream sequencing library preparation (e.g. 10X Chromium whole genome sequencing), of which the resulting library is already target-enriched without further target capture steps. We designed three assays and tested them with GM12878 cell line of which the genomic information is abundant. The assays targeted BRCA1, 41 regions with different SV events, and the entire 4-Mb MHC region, respectively. gRNAs were designed to generated 100-kb or 200-kb fragments, and the targets coverage was more than 100X while overall coverage including non-target was approximately 4X. Our targeted linked read sequencing provided the complete phased haplotypes for targets of interest with a high cost efficiency.
创建时间:
2018-01-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作