five

GenBank Contamination

收藏
DataCite Commons2025-05-01 更新2024-08-18 收录
下载链接:
https://figshare.com/articles/GenBank_Contamination/11994579/1
下载链接
链接失效反馈
官方服务:
资源简介:
Contamination prediction for GenBank (Dec 2018) by conterminator (https://github.com/martin-steinegger/conterminator).<br>The predictions are TSV formated. The following is the column defintion:<br><br>1.) Numeric identifier<br>2.) Contaminated identifier<br>3.) Kingdom (0: Bacteria&amp;Archaea, 1: Fungi, 2: Metazoa, 3: Viridiplantae, 4: Other Eukaryotes)<br>4.) Species name<br>5.) Alignment start<br>6.) Alignment end<br>7.) Corrected contig length<br>8.) Identifier of the longest contaminating sequence<br>9.) Kingdom of the longest contaminating sequence<br>10.) Species name of the longest contaminating sequence<br>11.) Length of the longest contaminating sequence<br>12.) Count how often sequences from contaminating kingdom align<br><br>Contaminated identifiers can occur multiple times if multiple alignments were detected.
提供机构:
figshare
创建时间:
2020-03-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作