five

RemEff: Clusters, MSAs and HMMs of fungal proteins

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://figshare.com/articles/dataset/RemEff_Clusters_MSAs_and_HMMs_of_fungal_proteins/13289655
下载链接
链接失效反馈
官方服务:
资源简介:
Complete clustering dataset from the remeff paper. "clusters.tsv" gives the IDs of cluster centroids (representative members), and cluster members in the first and second columns, respectively. "remeff_fasta.ff{data,index}" gives the multiple sequence alignments of all clusters presented in "clusters.tsv"The ffdata, ffindex format (https://github.com/soedinglab/ffindex_soedinglab) is formatted to be used by the HHsuite3 software (https://github.com/soedinglab/hh-suite) "remeff_a3m.ff{data,index}" gives the multiple sequence alignments from "remeff_fasta" but in a3m format, and is enriched with partial alignments of sequences from uniprot, and has some redundancy removed to avoid biasing HMM construction. "remeff_hhm.ff{data,index}" gives the HHsuite3 HMM representations of all clusters. This file is build from "remeff_a3m". "remeff_cs219.ff{data,index}" gives the HHsuite3 prefiltering dataset for all clusters. This file is build from "remeff_a3m". To run searches against this database using hhsuite, you need the six files matching the pattern remeff_a3m*, remeff_cs219*, and remeff_hhm*. You need to uncompress them using gzip or pigz.
创建时间:
2020-11-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作