Additional file 1: Table S1. of Draft genome of Brugia pahangi: high similarity between B. pahangi and B. malayi
收藏DataCite Commons2024-12-13 更新2024-07-25 收录
下载链接:
https://springernature.figshare.com/articles/dataset/Additional_file_1_Table_S1_of_Draft_genome_of_Brugia_pahangi_high_similarity_between_B_pahangi_and_B_malayi/4357718
下载链接
链接失效反馈官方服务:
资源简介:
Statistic of pre-filter data. A total of 24 GB of raw data were generated and used for assembly with total sequence depth of 214.29 and total physical depth of 3203.63. Table S2. Statistic post filtering. A total of 18.76 GB data were generated after filtering process with total sequence depth of 167.5 and total physical depth of 2290.07. Table S3. Transposable elements (TEs) content in the Assembled B. pahangi Genome. The highest TEs predicted at the DNA level was found by RepBase (0.75 %). Table S4. General statistics of predicted protein-coding genes. Three de novo based methods used: AUGUSTUS, SNAP and GLIMMERHMM. Table S5. Statistics of function annotation. The method for functional annotation is divided into two types that are automated and manual. The automated method, TrEMBL database shows the highest number of annotation among all methods used with percentage of 96.64 % while InterPro database produces the most number of annotations with percentage of 71.32 % among manually curated databases. Table S6. B. pahangi genes mapped to Swissprot via blast. Table S7. B. pahangi genes with intrepro annotations. Table S8. B. pahangi genes with Gene ontology annotations. Table S9. B. pahangi genes mapped to KEGG via blast. Table S10. List of 569 B. pahangi unique genes. Table S11. The 403 Wolbachia genes in B. pahangi unique genes. Table S12. The 26 B. Pahangi unique proteins with their respective KEGG pathway class. Table S13. The 803 Wolbachia genes in B. pahangi genome. Table S14. General statistics of repeats in genome. From the table, TRF shows the biggest repeat size of 2.9 Mbp, which represent 3.34 % of the genome size. (XLSX 2292 kb)
预过滤数据统计。本次实验共生成24 GB原始数据,用于基因组组装,总序列深度为214.29,总物理深度为3203.63。
表S2:过滤后数据统计。过滤流程完成后共得到18.76 GB有效数据,总序列深度为167.5,总物理深度为2290.07。
表S3:组装的马来丝虫(Brugia pahangi)基因组中转座元件(Transposable Elements, TEs)含量分析。通过RepBase数据库预测的DNA类转座元件占比最高,为0.75%。
表S4:预测的蛋白质编码基因通用统计信息。本次采用3种从头预测方法:AUGUSTUS、SNAP与GLIMMERHMM。
表S5:功能注释统计。功能注释方法分为自动化注释与手动注释两类。其中自动化注释体系中,TrEMBL数据库的注释数量最多,占比达96.64%;而人工整理数据库中,InterPro数据库的注释数量最多,占比为71.32%。
表S6:通过BLAST比对至Swiss-Prot数据库的马来丝虫基因。
表S7:带有InterPro注释的马来丝虫基因。
表S8:带有基因本体(Gene Ontology, GO)注释的马来丝虫基因。
表S9:通过BLAST比对至KEGG数据库的马来丝虫基因。
表S10:569个马来丝虫特异性基因列表。
表S11:马来丝虫特异性基因中的403个沃尔巴克氏体(Wolbachia)基因。
表S12:26个马来丝虫特异性蛋白及其对应的KEGG通路分类。
表S13:马来丝虫基因组中的803个沃尔巴克氏体基因。
表S14:基因组重复序列通用统计信息。由该表可知,TRF检测到的重复序列总长度最大,为2.9 Mbp,占基因组总大小的3.34%。(XLSX 2292 kb)
提供机构:
Figshare
创建时间:
2016-12-14



