five

Variant discovery in full-sibling families of Pinus taeda L

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
http://datadryad.org/dataset/doi%253A10.5061%252Fdryad.hhmgqnkgg
下载链接
链接失效反馈
官方服务:
资源简介:
Fusiform rust disease, caused by the endemic fungus Cronartium quercuum f. sp. fusiforme, is the most damaging disease affecting economically important pine species in the southeast United States. In this report, we detail the genomic localization and sequence-level discovery of candidate race-nonspecific broad-spectrum fusiform rust resistance genes in Pinus taeda L. Two full-sib families, each with ~1000 progeny, were challenged with a complex inoculum consisting of over 150 pathogen isolates. High-density linkage mapping revealed three QTL distributed on two linkage groups. The two QTL on linkage group 2 were additive with respect to their effects on the probability of disease outcome. All three QTL were validated using a population of 2057 cloned pine genotypes in a six-year-old multi-environmental field trial. As a complement to the QTL mapping approach, bulked  segregant RNAseq analysis revealed a small number of candidate nucleotide binding leucine rich repeat genes harboring SNP significantly associated with disease resistance. The results of this study demonstrate that single qualitative resistance genes can confer effective resistance against genetically diverse mixtures of an endemic pathogen. Methods A total of 15 samples had adequate quantity and quality of RNA for library preparation. Each sample was sequenced on two lanes of an S2 flow cell of the NovaSeq6000 Illumina sequencer, resulting in 7.6x109 50bp paired-end reads. For each sample, reads originating from lanes 1 and 2 were combined into a single fastq file for each mate, and aligned to the PacBio reference transcriptome using bwa mem with the default options (Heng Li, 2013). Around 70% of the sequences from each sample had both mate pairs properly mapped to the transcriptome, with an average quality score of 35.8, an average insert size of ~275bp, and an average depth of 163x. Following alignment, variants were called using Freebayes version 0.9.6 (Garrison & Marth, 2012). Each bam file from a single family was combined in a variant discovery run. Since each sample represented a bulk of 100 (for the random) or 50 (for the disease status) individuals, the population model was specified as ‘pooled’ using the ‘-J’ qualifier. Complex alleles of up to 25bp were allowed using the ‘—max-complex-gap’ qualifier.   Biallelic SNP with a minimum of 10 observations of the alternate allele were considered for downstream analysis.
创建时间:
2021-04-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作