HayatoHongoEveryonesAI/OpenMathInstruct1
收藏Hugging Face2025-11-29 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/HayatoHongoEveryonesAI/OpenMathInstruct1
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: has_llm_code
features:
- name: question
dtype: string
- name: expected_answer
dtype: string
- name: predicted_answer
dtype: string
- name: error_message
dtype: string
- name: is_correct
dtype: bool
- name: generation_type
dtype: string
- name: dataset
dtype: string
- name: generated_solution
dtype: string
splits:
- name: train
num_bytes: 4963612795.988735
num_examples: 5611023
- name: validation
num_bytes: 758165278.7408128
num_examples: 864892
download_size: 2708691269
dataset_size: 5721778074.7295475
- config_name: no_llm_code
features:
- name: question
dtype: string
- name: expected_answer
dtype: string
- name: predicted_answer
dtype: string
- name: error_message
dtype: string
- name: is_correct
dtype: bool
- name: generation_type
dtype: string
- name: dataset
dtype: string
- name: generated_solution
dtype: string
splits:
- name: train
num_bytes: 1512981002.0112643
num_examples: 1710321
- name: validation
num_bytes: 230315543.2591872
num_examples: 262737
download_size: 790671896
dataset_size: 1743296545.2704515
configs:
- config_name: has_llm_code
data_files:
- split: train
path: has_llm_code/train-*
- split: validation
path: has_llm_code/validation-*
- config_name: no_llm_code
data_files:
- split: train
path: no_llm_code/train-*
- split: validation
path: no_llm_code/validation-*
---
数据集信息:
- 配置名称:has_llm_code(含大语言模型(Large Language Model,LLM)代码配置)
特征字段:
- 字段名:question,数据类型:字符串(string)
- 字段名:expected_answer,数据类型:字符串(string)
- 字段名:predicted_answer,数据类型:字符串(string)
- 字段名:error_message,数据类型:字符串(string)
- 字段名:is_correct,数据类型:布尔值(bool)
- 字段名:generation_type,数据类型:字符串(string)
- 字段名:dataset,数据类型:字符串(string)
- 字段名:generated_solution,数据类型:字符串(string)
数据划分:
- 划分名称:train(训练集),字节大小:4963612795.988735,样本数量:5611023
- 划分名称:validation(验证集),字节大小:758165278.7408128,样本数量:864892
- 配置名称:no_llm_code(不含大语言模型(Large Language Model,LLM)代码配置)
特征字段:
- 字段名:question,数据类型:字符串(string)
- 字段名:expected_answer,数据类型:字符串(string)
- 字段名:predicted_answer,数据类型:字符串(string)
- 字段名:error_message,数据类型:字符串(string)
- 字段名:is_correct,数据类型:布尔值(bool)
- 字段名:generation_type,数据类型:字符串(string)
- 字段名:dataset,数据类型:字符串(string)
- 字段名:generated_solution,数据类型:字符串(string)
数据划分:
- 划分名称:train(训练集),字节大小:1512981002.0112643,样本数量:1710321
- 划分名称:validation(验证集),字节大小:230315543.2591872,样本数量:262737
下载总大小:2708691269,数据集总大小:5721778074.7295475
配置列表:
- 配置名称:has_llm_code(含大语言模型(Large Language Model,LLM)代码配置),数据文件:
- 划分:train(训练集),路径:has_llm_code/train-*
- 划分:validation(验证集),路径:has_llm_code/validation-*
- 配置名称:no_llm_code(不含大语言模型(Large Language Model,LLM)代码配置),数据文件:
- 划分:train(训练集),路径:no_llm_code/train-*
- 划分:validation(验证集),路径:no_llm_code/validation-*
提供机构:
HayatoHongoEveryonesAI



