GleghornLab/taxonomy_family_0.4_clusters
收藏Hugging Face2025-09-11 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/GleghornLab/taxonomy_family_0.4_clusters
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含五个字段:条目(Entry)、序列(Sequence)、簇(cluster)、当前排名(current_rank)和标签(labels)。其中标签字段是整数类型,其余为字符串类型。数据集分为训练集、验证集和测试集,分别包含434,293、10,000和10,415个示例。数据集的总下载大小为179,499,349字节,解压后大小为186,301,266字节。
The dataset includes five fields: Entry, Sequence, cluster, current_rank, and labels. The label field is of integer type, while the others are string type. The dataset is divided into training, validation, and test sets, containing 434,293, 10,000, and 10,415 examples respectively. The total download size of the dataset is 179,499,349 bytes, and the size after decompression is 186,301,266 bytes.
提供机构:
GleghornLab



