File S1 - Towards Clinical Molecular Diagnosis of Inherited Cardiac Conditions: A Comparison of Bench-Top Genome DNA Sequencers

NIAID Data Ecosystem2026-03-07 收录

下载链接：

https://figshare.com/articles/dataset/_Towards_Clinical_Molecular_Diagnosis_of_Inherited_Cardiac_Conditions_A_Comparison_of_Bench_Top_Genome_DNA_Sequencers_/740247

下载链接

链接失效反馈

官方服务：

资源简介：

ComparisonMiSeq_PGM_supplementary.docx includes six figures and two tables. Figure S1. Characteristics of target capture design: GC content and length of Access Array IFC amplicons. a. Amplicon GC content approximates to a normal distribution 50.3±11.4 (%), <7% amplicons have extreme (>70% or <30%) GC-content. b. Amplicon length (range: 65 bp to 403 bp, median 190 bp and mean 185±29); 85% have a length <200 bp; 98% amplicons have sequence length <240 bp. We used optimised Fluidigm capture to prepare library for Illumina and Ion Torrent platforms (see methods). 386 amplicons, with a combined length of 71,915 bp, are tiled over 47,660 bp of target sequence, of which 27,049 bp is protein coding. Figure S2. Base quality distributions. Sequencing base qualities before (left) and after (right) trimming and QC from (a.) MiSeq. (b.) Ion Torrent PGM. The base quality distribution (boxplot at each bar) is plotting against position in the read; the solid-line curve indicates the average base quality. Reads from Ion Torrent PGM have better base quality at 3′end as compared to the raw reads generated by MiSeq. Figure S3.Readlength distribution. The read length from MiSeq (a) vary from 20 to 135 bp, with average 115 bp±26 and median 127 bp; Ion Torrent PGM produced up to 267 bp reads (b), with average 106 bp±57 and median 102 bp. Figure S4. Coverage of target genes. Here we show the percentage of each target gene that is covered at ≥ x sequencing depth, calculated as a mean across all samples.The lower panels show the same data, with a larger scale on the x-axis. On the PGM, two genes (KCNQ1 & KCHN2) show a sharp drop-off in coverage, suggesting that some regions are difficult to robustly sequence. On the MiSeq, KCNE1 and KCNE2 also showed significant drop-off. Figure S5. Sequencing coverage of target genes. Sequencing depth is plotted for each coding base of the six target genes, on a log10 scale. Depth is calculated as a mean across 15 samples. Regions covered by a single read are therefore plotted at the origin, and regions of zero coverage have a negative deflection on the y-axis. GC content (calculated with a 50 bp sliding window on the genomic DNA forward strand) is overlaid in blue. Plus (+) or minus (-) indicates the strand on which each gene is encoded. While some regions are clearly problematic for both platforms (e.g. KCNQ1 exon 2, KCNH2 exons 1 & 12), there are also regions where one platform performs better (e.g. KCNE1, KCNE2, KCNH2 exon 4). Figure S6. The relationship between GC content and coverage. Sequencing depth (log10 scale) for each exon is plotted against its GC content. The coefficient of variation is larger for MiSeq than for Ion Torrent PGM (0.931 vs. 0.407). Loess regression is shown in red. MiSeq performance appears more variable across the GC range, whereas Ion Torrent performance falls off at high GC values, perhaps because of the additional emulsion PCR. Table S1. Barcode indexes and Ion Torrent specific adapters. Primers used for Ion Torrent PGM barcoded library prep, with index sequences highlighted. Each amplicon is inserted into the complex in both orientations: A-adaptor_Barcode_CommonSequence1_Amplicon_CommonSequence2_P1-adaptor; A-adaptor_Barcode_CommonSequence2_Amplicon_CommonSequence1_P1-adaptor. Table S2. Detected variant information. LRG = Locus Reference Genomic; Chr = Chromosome; Ref = reference allele; Alt = Alternative allele; P = Variants revealed by PGM; M = variants revealed by Miseq; Highlighted indicates the SNP was missed by both platforms. Note: All variants appearing in this table were confirmed by Sanger DNA sequencing analysis. (DOCX)

创建时间：

2013-07-04