jzshared/open_genome_131k
收藏Hugging Face2024-07-22 更新2024-07-22 收录
下载链接:
https://hf-mirror.com/datasets/jzshared/open_genome_131k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含三个主要分割:训练集、验证集和测试集。每个分割都包含大量的序列数据,这些序列以字符串形式存储。训练集包含2,424,418个示例,验证集包含304,032个示例,测试集包含306,245个示例。数据集的总下载大小约为184.54 GB,总数据集大小约为397.56 GB。数据文件按照分割类型存储在指定的路径下。
The dataset includes three main splits: training, validation, and test sets. Each split contains a large number of sequence data stored as strings. The training set consists of 2,424,418 examples, the validation set contains 304,032 examples, and the test set includes 306,245 examples. The total download size of the dataset is approximately 184.54 GB, and the total dataset size is about 397.56 GB. Data files are stored in specified paths according to their split types.
提供机构:
jzshared



