arithmetic-circuit-overloading/synthetic-dataset-v2-2d-2M-200K-0.1-reverse
收藏Hugging Face2026-04-03 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/arithmetic-circuit-overloading/synthetic-dataset-v2-2d-2M-200K-0.1-reverse
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: '100'
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 455959788
num_examples: 2000000
- name: validation
num_bytes: 44997787
num_examples: 200000
download_size: 210102376
dataset_size: 500957575
- config_name: '50'
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 456054712
num_examples: 2000000
- name: validation
num_bytes: 45009903
num_examples: 200000
download_size: 325566389
dataset_size: 501064615
- config_name: '75'
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 456001459
num_examples: 2000000
- name: validation
num_bytes: 45002574
num_examples: 200000
download_size: 319716793
dataset_size: 501004033
- config_name: '90'
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 455977186
num_examples: 2000000
- name: validation
num_bytes: 44999561
num_examples: 200000
download_size: 309908063
dataset_size: 500976747
- config_name: '95'
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 455965033
num_examples: 2000000
- name: validation
num_bytes: 44999719
num_examples: 200000
download_size: 295665291
dataset_size: 500964752
- config_name: '99'
features:
- name: _id
dtype: string
- name: base_operation
dtype: string
- name: target_operation
dtype: string
- name: fs_examples
list: string
- name: question
dtype: string
- name: answer
dtype: string
- name: prompt
dtype: string
splits:
- name: train
num_bytes: 455957981
num_examples: 2000000
- name: validation
num_bytes: 44998667
num_examples: 200000
download_size: 214843015
dataset_size: 500956648
configs:
- config_name: '100'
data_files:
- split: train
path: 100/train-*
- split: validation
path: 100/validation-*
- config_name: '50'
data_files:
- split: train
path: 50/train-*
- split: validation
path: 50/validation-*
- config_name: '75'
data_files:
- split: train
path: 75/train-*
- split: validation
path: 75/validation-*
- config_name: '90'
data_files:
- split: train
path: 90/train-*
- split: validation
path: 90/validation-*
- config_name: '95'
data_files:
- split: train
path: 95/train-*
- split: validation
path: 95/validation-*
- config_name: '99'
data_files:
- split: train
path: 99/train-*
- split: validation
path: 99/validation-*
---
该数据集包含6个配置项,各配置详情如下:
1. 配置名:'100'
数据特征包括:
- _id:字符串类型,数据唯一标识符
- base_operation:字符串类型,基础操作
- target_operation:字符串类型,目标操作
- fs_examples:字符串列表,少样本示例(few-shot examples)
- question:字符串类型,问题文本
- answer:字符串类型,答案文本
- prompt:字符串类型,提示词文本
数据集划分:
- 训练集(train):字节占用量455959788,样本总数2000000
- 验证集(validation):字节占用量44997787,样本总数200000
该配置的下载大小为210102376,总数据集大小为500957575
2. 配置名:'50'
数据特征包括:
- _id:字符串类型,数据唯一标识符
- base_operation:字符串类型,基础操作
- target_operation:字符串类型,目标操作
- fs_examples:字符串列表,少样本示例
- question:字符串类型,问题文本
- answer:字符串类型,答案文本
- prompt:字符串类型,提示词文本
数据集划分:
- 训练集(train):字节占用量456054712,样本总数2000000
- 验证集(validation):字节占用量45009903,样本总数200000
该配置的下载大小为325566389,总数据集大小为501064615
3. 配置名:'75'
数据特征包括:
- _id:字符串类型,数据唯一标识符
- base_operation:字符串类型,基础操作
- target_operation:字符串类型,目标操作
- fs_examples:字符串列表,少样本示例
- question:字符串类型,问题文本
- answer:字符串类型,答案文本
- prompt:字符串类型,提示词文本
数据集划分:
- 训练集(train):字节占用量456001459,样本总数2000000
- 验证集(validation):字节占用量45002574,样本总数200000
该配置的下载大小为319716793,总数据集大小为501004033
4. 配置名:'90'
数据特征包括:
- _id:字符串类型,数据唯一标识符
- base_operation:字符串类型,基础操作
- target_operation:字符串类型,目标操作
- fs_examples:字符串列表,少样本示例
- question:字符串类型,问题文本
- answer:字符串类型,答案文本
- prompt:字符串类型,提示词文本
数据集划分:
- 训练集(train):字节占用量455977186,样本总数2000000
- 验证集(validation):字节占用量44999561,样本总数200000
该配置的下载大小为309908063,总数据集大小为500976747
5. 配置名:'95'
数据特征包括:
- _id:字符串类型,数据唯一标识符
- base_operation:字符串类型,基础操作
- target_operation:字符串类型,目标操作
- fs_examples:字符串列表,少样本示例
- question:字符串类型,问题文本
- answer:字符串类型,答案文本
- prompt:字符串类型,提示词文本
数据集划分:
- 训练集(train):字节占用量455965033,样本总数2000000
- 验证集(validation):字节占用量44999719,样本总数200000
该配置的下载大小为295665291,总数据集大小为500964752
6. 配置名:'99'
数据特征包括:
- _id:字符串类型,数据唯一标识符
- base_operation:字符串类型,基础操作
- target_operation:字符串类型,目标操作
- fs_examples:字符串列表,少样本示例
- question:字符串类型,问题文本
- answer:字符串类型,答案文本
- prompt:字符串类型,提示词文本
数据集划分:
- 训练集(train):字节占用量455957981,样本总数2000000
- 验证集(validation):字节占用量44998667,样本总数200000
该配置的下载大小为214843015,总数据集大小为500956648
所有配置的数据文件路径规则如下:
- 训练集(train)数据路径:{配置名}/train-*
- 验证集(validation)数据路径:{配置名}/validation-*
提供机构:
arithmetic-circuit-overloading



