five

arithmetic-circuit-overloading/synthetic-dataset-2d-500K-50K-0.2-reverse-padzero

收藏
Hugging Face2026-02-26 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/arithmetic-circuit-overloading/synthetic-dataset-2d-500K-50K-0.2-reverse-padzero
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: '100' features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118106165 num_examples: 500000 - name: validation num_bytes: 11661000 num_examples: 50000 download_size: 52035783 dataset_size: 129767165 - config_name: mul-sub-50 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118763036 num_examples: 500000 - name: validation num_bytes: 11726808 num_examples: 50000 download_size: 53762352 dataset_size: 130489844 - config_name: mul-sub-75 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118433111 num_examples: 500000 - name: validation num_bytes: 11693266 num_examples: 50000 download_size: 53676189 dataset_size: 130126377 - config_name: mul-sub-90 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118238301 num_examples: 500000 - name: validation num_bytes: 11674463 num_examples: 50000 download_size: 53272842 dataset_size: 129912764 - config_name: mul-sub-95 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118172105 num_examples: 500000 - name: validation num_bytes: 11667564 num_examples: 50000 download_size: 53008449 dataset_size: 129839669 - config_name: mul-sub-99 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118119024 num_examples: 500000 - name: validation num_bytes: 11662233 num_examples: 50000 download_size: 52640862 dataset_size: 129781257 - config_name: plus-mul-50 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 119133129 num_examples: 500000 - name: validation num_bytes: 11763532 num_examples: 50000 download_size: 53724256 dataset_size: 130896661 - config_name: plus-mul-75 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118620512 num_examples: 500000 - name: validation num_bytes: 11712638 num_examples: 50000 download_size: 53661297 dataset_size: 130333150 - config_name: plus-mul-90 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118313833 num_examples: 500000 - name: validation num_bytes: 11682100 num_examples: 50000 download_size: 53224373 dataset_size: 129995933 - config_name: plus-mul-95 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118209286 num_examples: 500000 - name: validation num_bytes: 11671392 num_examples: 50000 download_size: 53064447 dataset_size: 129880678 - config_name: plus-mul-99 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118126545 num_examples: 500000 - name: validation num_bytes: 11663092 num_examples: 50000 download_size: 52491889 dataset_size: 129789637 - config_name: plus-mul-sub-50 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118104085 num_examples: 500000 - name: validation num_bytes: 11661644 num_examples: 50000 download_size: 55113468 dataset_size: 129765729 - config_name: plus-mul-sub-75 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118103195 num_examples: 500000 - name: validation num_bytes: 11660046 num_examples: 50000 download_size: 54668161 dataset_size: 129763241 - config_name: plus-mul-sub-90 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118104286 num_examples: 500000 - name: validation num_bytes: 11661381 num_examples: 50000 download_size: 54104770 dataset_size: 129765667 - config_name: plus-mul-sub-95 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118104895 num_examples: 500000 - name: validation num_bytes: 11661274 num_examples: 50000 download_size: 53542392 dataset_size: 129766169 - config_name: plus-mul-sub-99 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118105641 num_examples: 500000 - name: validation num_bytes: 11661058 num_examples: 50000 download_size: 52630869 dataset_size: 129766699 - config_name: plus-sub-50 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 116460479 num_examples: 500000 - name: validation num_bytes: 11496676 num_examples: 50000 download_size: 52167042 dataset_size: 127957155 - config_name: plus-sub-75 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 117282507 num_examples: 500000 - name: validation num_bytes: 11578135 num_examples: 50000 download_size: 52694978 dataset_size: 128860642 - config_name: plus-sub-90 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 117778611 num_examples: 500000 - name: validation num_bytes: 11628396 num_examples: 50000 download_size: 52892196 dataset_size: 129407007 - config_name: plus-sub-95 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 117942103 num_examples: 500000 - name: validation num_bytes: 11644573 num_examples: 50000 download_size: 52787763 dataset_size: 129586676 - config_name: plus-sub-99 features: - name: _id dtype: string - name: base_operation dtype: string - name: target_operation dtype: string - name: fs_examples list: string - name: question dtype: string - name: answer dtype: string - name: prompt dtype: string splits: - name: train num_bytes: 118073054 num_examples: 500000 - name: validation num_bytes: 11657705 num_examples: 50000 download_size: 52480691 dataset_size: 129730759 configs: - config_name: '100' data_files: - split: train path: 100/train-* - split: validation path: 100/validation-* - config_name: mul-sub-50 data_files: - split: train path: mul-sub-50/train-* - split: validation path: mul-sub-50/validation-* - config_name: mul-sub-75 data_files: - split: train path: mul-sub-75/train-* - split: validation path: mul-sub-75/validation-* - config_name: mul-sub-90 data_files: - split: train path: mul-sub-90/train-* - split: validation path: mul-sub-90/validation-* - config_name: mul-sub-95 data_files: - split: train path: mul-sub-95/train-* - split: validation path: mul-sub-95/validation-* - config_name: mul-sub-99 data_files: - split: train path: mul-sub-99/train-* - split: validation path: mul-sub-99/validation-* - config_name: plus-mul-50 data_files: - split: train path: plus-mul-50/train-* - split: validation path: plus-mul-50/validation-* - config_name: plus-mul-75 data_files: - split: train path: plus-mul-75/train-* - split: validation path: plus-mul-75/validation-* - config_name: plus-mul-90 data_files: - split: train path: plus-mul-90/train-* - split: validation path: plus-mul-90/validation-* - config_name: plus-mul-95 data_files: - split: train path: plus-mul-95/train-* - split: validation path: plus-mul-95/validation-* - config_name: plus-mul-99 data_files: - split: train path: plus-mul-99/train-* - split: validation path: plus-mul-99/validation-* - config_name: plus-mul-sub-50 data_files: - split: train path: plus-mul-sub-50/train-* - split: validation path: plus-mul-sub-50/validation-* - config_name: plus-mul-sub-75 data_files: - split: train path: plus-mul-sub-75/train-* - split: validation path: plus-mul-sub-75/validation-* - config_name: plus-mul-sub-90 data_files: - split: train path: plus-mul-sub-90/train-* - split: validation path: plus-mul-sub-90/validation-* - config_name: plus-mul-sub-95 data_files: - split: train path: plus-mul-sub-95/train-* - split: validation path: plus-mul-sub-95/validation-* - config_name: plus-mul-sub-99 data_files: - split: train path: plus-mul-sub-99/train-* - split: validation path: plus-mul-sub-99/validation-* - config_name: plus-sub-50 data_files: - split: train path: plus-sub-50/train-* - split: validation path: plus-sub-50/validation-* - config_name: plus-sub-75 data_files: - split: train path: plus-sub-75/train-* - split: validation path: plus-sub-75/validation-* - config_name: plus-sub-90 data_files: - split: train path: plus-sub-90/train-* - split: validation path: plus-sub-90/validation-* - config_name: plus-sub-95 data_files: - split: train path: plus-sub-95/train-* - split: validation path: plus-sub-95/validation-* - config_name: plus-sub-99 data_files: - split: train path: plus-sub-99/train-* - split: validation path: plus-sub-99/validation-* ---

该数据集包含21种配置方案,所有配置的特征字段与数据划分规则共性如下: 1. 特征字段统一包含:`_id`(字符串类型)、`base_operation`(基础操作,字符串类型)、`target_operation`(目标操作,字符串类型)、`fs_examples`(少样本示例,字符串列表)、`question`(问题,字符串类型)、`answer`(答案,字符串类型)、`prompt`(提示词,字符串类型) 2. 数据划分均包含训练集与验证集,其中训练集样本量固定为500000,验证集样本量固定为50000,仅字节规模、下载大小及数据集总规模存在差异。 各配置的具体参数详情如下: - 配置名称`100`:训练集字节数118106165,验证集字节数11661000;下载大小52035783,数据集总规模129767165 - 配置名称`mul-sub-50`:训练集字节数118763036,验证集字节数11726808;下载大小53762352,数据集总规模130489844 - 配置名称`mul-sub-75`:训练集字节数118433111,验证集字节数11693266;下载大小53676189,数据集总规模130126377 - 配置名称`mul-sub-90`:训练集字节数118238301,验证集字节数11674463;下载大小53272842,数据集总规模129912764 - 配置名称`mul-sub-95`:训练集字节数118172105,验证集字节数11667564;下载大小53008449,数据集总规模129839669 - 配置名称`mul-sub-99`:训练集字节数118119024,验证集字节数11662233;下载大小52640862,数据集总规模129781257 - 配置名称`plus-mul-50`:训练集字节数119133129,验证集字节数11763532;下载大小53724256,数据集总规模130896661 - 配置名称`plus-mul-75`:训练集字节数118620512,验证集字节数11712638;下载大小53661297,数据集总规模130333150 - 配置名称`plus-mul-90`:训练集字节数118313833,验证集字节数11682100;下载大小53224373,数据集总规模129995933 - 配置名称`plus-mul-95`:训练集字节数118209286,验证集字节数11671392;下载大小53064447,数据集总规模129880678 - 配置名称`plus-mul-99`:训练集字节数118126545,验证集字节数11663092;下载大小52491889,数据集总规模129789637 - 配置名称`plus-mul-sub-50`:训练集字节数118104085,验证集字节数11661644;下载大小55113468,数据集总规模129765729 - 配置名称`plus-mul-sub-75`:训练集字节数118103195,验证集字节数11660046;下载大小54668161,数据集总规模129763241 - 配置名称`plus-mul-sub-90`:训练集字节数118104286,验证集字节数11661381;下载大小54104770,数据集总规模129765667 - 配置名称`plus-mul-sub-95`:训练集字节数118104895,验证集字节数11661274;下载大小53542392,数据集总规模129766169 - 配置名称`plus-mul-sub-99`:训练集字节数118105641,验证集字节数11661058;下载大小52630869,数据集总规模129766699 - 配置名称`plus-sub-50`:训练集字节数116460479,验证集字节数11496676;下载大小52167042,数据集总规模127957155 - 配置名称`plus-sub-75`:训练集字节数117282507,验证集字节数11578135;下载大小52694978,数据集总规模128860642 - 配置名称`plus-sub-90`:训练集字节数117778611,验证集字节数11628396;下载大小52892196,数据集总规模129407007 - 配置名称`plus-sub-95`:训练集字节数117942103,验证集字节数11644573;下载大小52787763,数据集总规模129586676 - 配置名称`plus-sub-99`:训练集字节数118073054,验证集字节数11657705;下载大小52480691,数据集总规模129730759 所有配置的数据文件路径规则统一为:训练集文件路径为`{配置名称}/train-*`,验证集文件路径为`{配置名称}/validation-*`。
提供机构:
arithmetic-circuit-overloading
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作