amuvarma/10k-qa-proc
收藏Hugging Face2025-02-09 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/amuvarma/10k-qa-proc
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含输入ID序列、注意力掩码序列和标签序列。输入ID和注意力掩码是整数序列,分别使用int32和int8数据类型。标签是整数序列,使用int64数据类型。数据集分为训练集,共有9895个样本,总文件大小为13759926字节。提供的配置文件指定了训练集的数据文件。
The dataset includes input ID sequences, attention mask sequences, and label sequences. Input IDs and attention masks are integer sequences, stored using int32 and int8 data types, respectively. Labels are integer sequences, using the int64 data type. The dataset is split into a training set with a total of 9895 samples and a total file size of 13759926 bytes. A configuration file specifies the data files for the training set.
提供机构:
amuvarma



