five

Optimizing exome captures in species with large genomes using species-specific repetitive DNA blocker

收藏
DataONE2024-11-15 更新2025-04-26 收录
下载链接:
https://search.dataone.org/view/sha256:8edab16907f2ecd1e75e9c33904f05ab9a4d7b22e31fcdcce521d84b6b89d599
下载链接
链接失效反馈
官方服务:
资源简介:
Large and highly repetitive genomes are common. However, research interests usually lie within the non-repetitive parts of the genome, as they are more likely functional, and can be used to answer questions related to adaptation, selection, and evolutionary history. Exome capture is a cost-effective method for providing sequencing data from protein-coding parts of the genes. C0t-1 DNA blockers consist of repetitive DNA and are used in exome captures to prevent the hybridization of repetitive DNA sequences to capture baits or bait-bound genomic DNA. Universal blockers target repetitive regions shared by many species, while species-specific c0t-1 DNA is prepared from the DNA of the studied species, thus perfectly matching the repetitive DNA contents of the species. So far the use of species-specific c0t-1 DNA has been limited to a few model species. Here, we evaluated the performance of blocker treatments in exome captures of Pinus sylvestris, a widely distributed conifer species with a l..., The original Pinus tabuliformis reference genome (v1.0; Niu et al., 2022) was masked to increase mapping to the genome by correcting problems in the genome polishing, as many identical sequences were found at the ends of different chromosomes. To construct a masked version of the reference genome, chromosomes were first split back into contigs. Contigs were then aligned within chromosomes and between unplaced contigs using Minimap2 (Li, 2018). Alignments were then chained to longer ones using the ChainPaf module of Lep-Anchor (Rastas, 2020). Half of the aligning regions of > 10 kb were masked by masking the region in shorter of the two contigs involved in the alignment., , # Data from: Optimizing Exome Captures in Species with Large Genomes Using Species-specific Repetitive DNA Blocker [https://doi.org/10.5061/dryad.qfttdz0rw](https://doi.org/10.5061/dryad.qfttdz0rw) ## Description of the data and file structure The original *Pinus tabuliformis* reference genome (v1.0; Niu et al., 2022) was masked to increase mapping to the genome by correcting errors in the genome polishing, as many identical sequences were found at the ends of different chromosomes. To construct a masked version of the reference genome, chromosomes were first split back into contigs. Contigs were then aligned within chromosomes and between unplaced contigs using Minimap2 (Li, 2018). Alignments were then chained to longer ones using the ChainPaf module of Lep-Anchor (Rastas, 2020). Half of the aligning regions of > 10 kb were masked by masking the region in shorter of the two contigs involved in the alignment. ### Files and variables #### File: liftover\_gff3.awk **Description:*...
创建时间:
2024-11-16
二维码
社区交流群
二维码
科研交流群
商业服务