RemEff: sequences used for clustering
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://figshare.com/articles/dataset/RemEff_sequences_used_for_clustering/12833678
下载链接
链接失效反馈官方服务:
资源简介:
Data were collected from fungal sequences in the NCBI IPG and the UniParc databases, and from numerous curated published fungal genomes.
fungal_seqs.tsv.gz contains mappings of the non-redundant sequence names to identifiers in original databases.
fungal_seqs_curated_genomes.tsv.gz contains details of which sequences came from the curated genomes.
fungal_seqs_uniparc_signatures.tsv.gz gives details about InterPro terms present for sequences in the UniParc dataset.
fungal_seqs_uniparc_xrefs.tsv.gz maps ids from uniparc to references in other databases.
fungal_seqs_ipg.tsv gives details about sequences taken from the IPG database, mapping to other database identifiers.
本数据集的数据采集自NCBI IPG与UniParc(Universal Protein Resource)数据库中的真菌序列,以及大量经人工整理的已发表真菌基因组。
fungal_seqs.tsv.gz 包含非冗余序列名称与原始数据库标识符的映射关系。
fungal_seqs_curated_genomes.tsv.gz 收录了各序列源自经整理基因组的相关详情。
fungal_seqs_uniparc_signatures.tsv.gz 提供了UniParc数据集中序列所包含的InterPro术语的相关细节。
fungal_seqs_uniparc_xrefs.tsv.gz 实现了UniParc标识符与其他数据库参考条目之间的交叉引用映射。
fungal_seqs_ipg.tsv 详细记录了源自IPG数据库的序列信息,并将其映射至其他数据库标识符。
创建时间:
2020-08-21



