shengchao/SNP20k
收藏Hugging Face2025-04-07 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/shengchao/SNP20k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了输入ID序列和注意力掩码序列,分为训练集和验证集。训练集包含450000个样本,验证集包含37915个样本。数据集总大小为5752333535字节,下载大小为2193703687字节。
The dataset includes input ID sequences and attention mask sequences, split into a training set and a validation set. The training set contains 450,000 samples, and the validation set contains 37,915 samples. The total size of the dataset is 5,752,333,535 bytes, with a download size of 2,193,703,687 bytes.
提供机构:
shengchao



