arithmetic-circuit-overloading/synthetic-dataset-2d-500K-50K-0.2-reverse-padzero
收藏Hugging Face2026-02-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/arithmetic-circuit-overloading/synthetic-dataset-2d-500K-50K-0.2-reverse-padzero
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: '100'
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118106165
num_examples: 500000
- name: validation
num_bytes: 11661000
num_examples: 50000
download_size: 52035783
dataset_size: 129767165
- config_name: mul-sub-50
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118763036
num_examples: 500000
- name: validation
num_bytes: 11726808
num_examples: 50000
download_size: 53762352
dataset_size: 130489844
- config_name: mul-sub-75
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118433111
num_examples: 500000
- name: validation
num_bytes: 11693266
num_examples: 50000
download_size: 53676189
dataset_size: 130126377
- config_name: mul-sub-90
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118238301
num_examples: 500000
- name: validation
num_bytes: 11674463
num_examples: 50000
download_size: 53272842
dataset_size: 129912764
- config_name: mul-sub-95
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118172105
num_examples: 500000
- name: validation
num_bytes: 11667564
num_examples: 50000
download_size: 53008449
dataset_size: 129839669
- config_name: mul-sub-99
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118119024
num_examples: 500000
- name: validation
num_bytes: 11662233
num_examples: 50000
download_size: 52640862
dataset_size: 129781257
- config_name: plus-mul-50
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 119133129
num_examples: 500000
- name: validation
num_bytes: 11763532
num_examples: 50000
download_size: 53724256
dataset_size: 130896661
- config_name: plus-mul-75
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118620512
num_examples: 500000
- name: validation
num_bytes: 11712638
num_examples: 50000
download_size: 53661297
dataset_size: 130333150
- config_name: plus-mul-90
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118313833
num_examples: 500000
- name: validation
num_bytes: 11682100
num_examples: 50000
download_size: 53224373
dataset_size: 129995933
- config_name: plus-mul-95
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118209286
num_examples: 500000
- name: validation
num_bytes: 11671392
num_examples: 50000
download_size: 53064447
dataset_size: 129880678
- config_name: plus-mul-99
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118126545
num_examples: 500000
- name: validation
num_bytes: 11663092
num_examples: 50000
download_size: 52491889
dataset_size: 129789637
- config_name: plus-mul-sub-50
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118104085
num_examples: 500000
- name: validation
num_bytes: 11661644
num_examples: 50000
download_size: 55113468
dataset_size: 129765729
- config_name: plus-mul-sub-75
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118103195
num_examples: 500000
- name: validation
num_bytes: 11660046
num_examples: 50000
download_size: 54668161
dataset_size: 129763241
- config_name: plus-mul-sub-90
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118104286
num_examples: 500000
- name: validation
num_bytes: 11661381
num_examples: 50000
download_size: 54104770
dataset_size: 129765667
- config_name: plus-mul-sub-95
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118104895
num_examples: 500000
- name: validation
num_bytes: 11661274
num_examples: 50000
download_size: 53542392
dataset_size: 129766169
- config_name: plus-mul-sub-99
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118105641
num_examples: 500000
- name: validation
num_bytes: 11661058
num_examples: 50000
download_size: 52630869
dataset_size: 129766699
- config_name: plus-sub-50
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 116460479
num_examples: 500000
- name: validation
num_bytes: 11496676
num_examples: 50000
download_size: 52167042
dataset_size: 127957155
- config_name: plus-sub-75
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 117282507
num_examples: 500000
- name: validation
num_bytes: 11578135
num_examples: 50000
download_size: 52694978
dataset_size: 128860642
- config_name: plus-sub-90
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 117778611
num_examples: 500000
- name: validation
num_bytes: 11628396
num_examples: 50000
download_size: 52892196
dataset_size: 129407007
- config_name: plus-sub-95
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 117942103
num_examples: 500000
- name: validation
num_bytes: 11644573
num_examples: 50000
download_size: 52787763
dataset_size: 129586676
- config_name: plus-sub-99
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 118073054
num_examples: 500000
- name: validation
num_bytes: 11657705
num_examples: 50000
download_size: 52480691
dataset_size: 129730759
configs:
- config_name: '100'
data_files:
- split: train
path: 100/train-*
- split: validation
path: 100/validation-*
- config_name: mul-sub-50
data_files:
- split: train
path: mul-sub-50/train-*
- split: validation
path: mul-sub-50/validation-*
- config_name: mul-sub-75
data_files:
- split: train
path: mul-sub-75/train-*
- split: validation
path: mul-sub-75/validation-*
- config_name: mul-sub-90
data_files:
- split: train
path: mul-sub-90/train-*
- split: validation
path: mul-sub-90/validation-*
- config_name: mul-sub-95
data_files:
- split: train
path: mul-sub-95/train-*
- split: validation
path: mul-sub-95/validation-*
- config_name: mul-sub-99
data_files:
- split: train
path: mul-sub-99/train-*
- split: validation
path: mul-sub-99/validation-*
- config_name: plus-mul-50
data_files:
- split: train
path: plus-mul-50/train-*
- split: validation
path: plus-mul-50/validation-*
- config_name: plus-mul-75
data_files:
- split: train
path: plus-mul-75/train-*
- split: validation
path: plus-mul-75/validation-*
- config_name: plus-mul-90
data_files:
- split: train
path: plus-mul-90/train-*
- split: validation
path: plus-mul-90/validation-*
- config_name: plus-mul-95
data_files:
- split: train
path: plus-mul-95/train-*
- split: validation
path: plus-mul-95/validation-*
- config_name: plus-mul-99
data_files:
- split: train
path: plus-mul-99/train-*
- split: validation
path: plus-mul-99/validation-*
- config_name: plus-mul-sub-50
data_files:
- split: train
path: plus-mul-sub-50/train-*
- split: validation
path: plus-mul-sub-50/validation-*
- config_name: plus-mul-sub-75
data_files:
- split: train
path: plus-mul-sub-75/train-*
- split: validation
path: plus-mul-sub-75/validation-*
- config_name: plus-mul-sub-90
data_files:
- split: train
path: plus-mul-sub-90/train-*
- split: validation
path: plus-mul-sub-90/validation-*
- config_name: plus-mul-sub-95
data_files:
- split: train
path: plus-mul-sub-95/train-*
- split: validation
path: plus-mul-sub-95/validation-*
- config_name: plus-mul-sub-99
data_files:
- split: train
path: plus-mul-sub-99/train-*
- split: validation
path: plus-mul-sub-99/validation-*
- config_name: plus-sub-50
data_files:
- split: train
path: plus-sub-50/train-*
- split: validation
path: plus-sub-50/validation-*
- config_name: plus-sub-75
data_files:
- split: train
path: plus-sub-75/train-*
- split: validation
path: plus-sub-75/validation-*
- config_name: plus-sub-90
data_files:
- split: train
path: plus-sub-90/train-*
- split: validation
path: plus-sub-90/validation-*
- config_name: plus-sub-95
data_files:
- split: train
path: plus-sub-95/train-*
- split: validation
path: plus-sub-95/validation-*
- config_name: plus-sub-99
data_files:
- split: train
path: plus-sub-99/train-*
- split: validation
path: plus-sub-99/validation-*
---
该数据集包含21种配置方案,所有配置的特征字段与数据划分规则共性如下:
1. 特征字段统一包含:`_id`(字符串类型)、`base_operation`(基础操作,字符串类型)、`target_operation`(目标操作,字符串类型)、`fs_examples`(少样本示例,字符串列表)、`question`(问题,字符串类型)、`answer`(答案,字符串类型)、`prompt`(提示词,字符串类型)
2. 数据划分均包含训练集与验证集,其中训练集样本量固定为500000,验证集样本量固定为50000,仅字节规模、下载大小及数据集总规模存在差异。
各配置的具体参数详情如下:
- 配置名称`100`:训练集字节数118106165,验证集字节数11661000;下载大小52035783,数据集总规模129767165
- 配置名称`mul-sub-50`:训练集字节数118763036,验证集字节数11726808;下载大小53762352,数据集总规模130489844
- 配置名称`mul-sub-75`:训练集字节数118433111,验证集字节数11693266;下载大小53676189,数据集总规模130126377
- 配置名称`mul-sub-90`:训练集字节数118238301,验证集字节数11674463;下载大小53272842,数据集总规模129912764
- 配置名称`mul-sub-95`:训练集字节数118172105,验证集字节数11667564;下载大小53008449,数据集总规模129839669
- 配置名称`mul-sub-99`:训练集字节数118119024,验证集字节数11662233;下载大小52640862,数据集总规模129781257
- 配置名称`plus-mul-50`:训练集字节数119133129,验证集字节数11763532;下载大小53724256,数据集总规模130896661
- 配置名称`plus-mul-75`:训练集字节数118620512,验证集字节数11712638;下载大小53661297,数据集总规模130333150
- 配置名称`plus-mul-90`:训练集字节数118313833,验证集字节数11682100;下载大小53224373,数据集总规模129995933
- 配置名称`plus-mul-95`:训练集字节数118209286,验证集字节数11671392;下载大小53064447,数据集总规模129880678
- 配置名称`plus-mul-99`:训练集字节数118126545,验证集字节数11663092;下载大小52491889,数据集总规模129789637
- 配置名称`plus-mul-sub-50`:训练集字节数118104085,验证集字节数11661644;下载大小55113468,数据集总规模129765729
- 配置名称`plus-mul-sub-75`:训练集字节数118103195,验证集字节数11660046;下载大小54668161,数据集总规模129763241
- 配置名称`plus-mul-sub-90`:训练集字节数118104286,验证集字节数11661381;下载大小54104770,数据集总规模129765667
- 配置名称`plus-mul-sub-95`:训练集字节数118104895,验证集字节数11661274;下载大小53542392,数据集总规模129766169
- 配置名称`plus-mul-sub-99`:训练集字节数118105641,验证集字节数11661058;下载大小52630869,数据集总规模129766699
- 配置名称`plus-sub-50`:训练集字节数116460479,验证集字节数11496676;下载大小52167042,数据集总规模127957155
- 配置名称`plus-sub-75`:训练集字节数117282507,验证集字节数11578135;下载大小52694978,数据集总规模128860642
- 配置名称`plus-sub-90`:训练集字节数117778611,验证集字节数11628396;下载大小52892196,数据集总规模129407007
- 配置名称`plus-sub-95`:训练集字节数117942103,验证集字节数11644573;下载大小52787763,数据集总规模129586676
- 配置名称`plus-sub-99`:训练集字节数118073054,验证集字节数11657705;下载大小52480691,数据集总规模129730759
所有配置的数据文件路径规则统一为:训练集文件路径为`{配置名称}/train-*`,验证集文件路径为`{配置名称}/validation-*`。
提供机构:
arithmetic-circuit-overloading



