five

Additional file 6 of MetaPro: a scalable and reproducible data processing and analysis pipeline for metatranscriptomic investigation of microbial communities

收藏
DataCite Commons2024-08-14 更新2024-08-19 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Additional_file_6_of_MetaPro_a_scalable_and_reproducible_data_processing_and_analysis_pipeline_for_metatranscriptomic_investigation_of_microbial_communities/26600305
下载链接
链接失效反馈
官方服务:
资源简介:
Additional file 6: Table S2. Polycistronic read statistics for NOD mouse. This table shows the tally of non-overlapping paired-end reads that were assembled into contigs by MetaPro through rnaSPADes and subsequently annotated into discrete genes by MetaGeneMark. This table also shows the prevalence of polycistronic reads that exist within the data. BWA was used to align the assembled paired-end reads against the genes to identify discordant alignments between the forward and reverse-end read of a pair with the same ID.  This table has seven columns: 1) the sample ID. 2) the sample description. 3) the total number of alignments is the number of alignments of a read to a gene that BWA reported. 4) The total number of pairs is the number of IDs that BWA aligned, be it forward, reverse, or both paired-end reads.  5) The paired-end disagreements column are the number of times a forward-end and reverse-end read had different alignments for each NOD mouse sample.  6) The paired-end agreements column shows the number of times a forward-read and reverse-end read aligned to the same gene.  7)  The percentage of paired-end disagreements, relative to the total number of paired-end reads in the sample. The percentage of disagreements (polycistronic reads) are at-best 0.23%, and at-worst 7.8% of assembled, non-overlapped paired-end reads in the NOD mouse samples.

附加文件6:表S2。非肥胖糖尿病(NOD)小鼠多顺反子读段统计数据。本表格统计了经rnaSPADes组装为重叠群(contigs)、再由MetaGeneMark注释为独立基因的非重叠配对末端读段(paired-end reads)。本表格同时展示了数据集中多顺反子读段的分布情况。本研究使用BWA将组装后的配对末端读段比对至基因序列,以识别具有相同ID的读段对中,正向读段与反向读段的比对结果不一致的情况。本表格包含7列:1)样本ID;2)样本描述;3)总比对数:指BWA报告的读段与基因的比对总次数;4)读段对总数:指经BWA完成比对的ID总数,涵盖正向读段、反向读段或二者均完成比对的情况;5)配对末端读段比对不一致数:指每个NOD小鼠样本中,正向与反向读段比对结果不一致的次数;6)配对末端读段比对一致数:指正向与反向读段比对至同一基因的次数;7)配对末端读段比对不一致率:指样本中配对末端读段比对不一致数占总配对末端读段数的比例。在NOD小鼠样本的组装非重叠配对末端读段中,比对不一致率(即多顺反子读段占比)最低为0.23%,最高为7.8%。
提供机构:
figshare
创建时间:
2024-08-13
二维码
社区交流群
二维码
科研交流群
商业服务