abir-hr196/rlvr-hard-examples
收藏Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/abir-hr196/rlvr-hard-examples
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: problem_id
dtype: int64
- name: problem
dtype: string
- name: answer
dtype: string
- name: rlvr_chain_of_thought
dtype: string
- name: rlvr_predicted_answer
dtype: string
- name: rlvr_correct
dtype: bool
- name: base_chain_of_thought
dtype: string
- name: base_predicted_answer
dtype: string
- name: base_correct
dtype: bool
- name: model
dtype: string
- name: source_dataset
dtype: string
splits:
- name: llama8b_gsm8k
num_bytes: 58270
num_examples: 30
- name: llama8b_math500
num_bytes: 133270
num_examples: 50
- name: llama8b_mmlu_pro
num_bytes: 133000
num_examples: 49
- name: llama8b_svamp
num_bytes: 31280
num_examples: 18
- name: qwen1_5b_gsm8k
num_bytes: 175620
num_examples: 50
- name: qwen1_5b_math500
num_bytes: 204612
num_examples: 49
- name: qwen1_5b_mmlu_pro
num_bytes: 202836
num_examples: 50
- name: qwen1_5b_svamp
num_bytes: 79735
num_examples: 27
- name: qwen14b_gsm8k
num_bytes: 58776
num_examples: 18
- name: qwen14b_mmlu_pro
num_bytes: 213550
num_examples: 50
- name: qwen14b_svamp
num_bytes: 42935
num_examples: 21
download_size: 772767
dataset_size: 1333884
configs:
- config_name: default
data_files:
- split: llama8b_gsm8k
path: data/llama8b_gsm8k-*
- split: llama8b_math500
path: data/llama8b_math500-*
- split: llama8b_mmlu_pro
path: data/llama8b_mmlu_pro-*
- split: llama8b_svamp
path: data/llama8b_svamp-*
- split: qwen1_5b_gsm8k
path: data/qwen1_5b_gsm8k-*
- split: qwen1_5b_math500
path: data/qwen1_5b_math500-*
- split: qwen1_5b_mmlu_pro
path: data/qwen1_5b_mmlu_pro-*
- split: qwen1_5b_svamp
path: data/qwen1_5b_svamp-*
- split: qwen14b_gsm8k
path: data/qwen14b_gsm8k-*
- split: qwen14b_mmlu_pro
path: data/qwen14b_mmlu_pro-*
- split: qwen14b_svamp
path: data/qwen14b_svamp-*
---
提供机构:
abir-hr196



