CohenQu/imo_2025_qwen3_4b_inst-eval
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/CohenQu/imo_2025_qwen3_4b_inst-eval
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: turn-0
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 12600956
num_examples: 6
download_size: 4982293
dataset_size: 12600956
- config_name: turn-1
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10970669
num_examples: 6
download_size: 4376704
dataset_size: 10970669
- config_name: turn-10
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 9803135
num_examples: 6
download_size: 3853436
dataset_size: 9803135
- config_name: turn-11
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 9695976
num_examples: 6
download_size: 3834297
dataset_size: 9695976
- config_name: turn-12
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 9609216
num_examples: 6
download_size: 3771558
dataset_size: 9609216
- config_name: turn-13
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 9997952
num_examples: 6
download_size: 3956491
dataset_size: 9997952
- config_name: turn-14
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10228780
num_examples: 6
download_size: 4007776
dataset_size: 10228780
- config_name: turn-15
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 9945301
num_examples: 6
download_size: 3898428
dataset_size: 9945301
- config_name: turn-2
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10455970
num_examples: 6
download_size: 4147715
dataset_size: 10455970
- config_name: turn-3
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10797994
num_examples: 6
download_size: 4255714
dataset_size: 10797994
- config_name: turn-4
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10514517
num_examples: 6
download_size: 4146626
dataset_size: 10514517
- config_name: turn-5
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 9922310
num_examples: 6
download_size: 3897607
dataset_size: 9922310
- config_name: turn-6
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10544138
num_examples: 6
download_size: 4140455
dataset_size: 10544138
- config_name: turn-7
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10058152
num_examples: 6
download_size: 3948005
dataset_size: 10058152
- config_name: turn-8
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 9951789
num_examples: 6
download_size: 3893008
dataset_size: 9951789
- config_name: turn-9
features:
- name: grading_scheme_0
list: string
- name: grading_scheme_0_prompts
list: string
- name: grading_scheme_0_reasons
list: string
- name: grading_scheme_0_responses
list: string
- name: grading_scheme_0_rewards
list: int64
- name: grading_scheme_1
list: string
- name: grading_scheme_1_prompts
list: string
- name: grading_scheme_1_reasons
list: string
- name: grading_scheme_1_responses
list: string
- name: grading_scheme_1_rewards
list: int64
- name: grading_scheme_2
list: string
- name: grading_scheme_2_prompts
list: string
- name: grading_scheme_2_reasons
list: string
- name: grading_scheme_2_responses
list: string
- name: grading_scheme_2_rewards
list: int64
- name: grading_scheme_3
list: string
- name: grading_scheme_3_prompts
list: string
- name: grading_scheme_3_reasons
list: string
- name: grading_scheme_3_responses
list: string
- name: grading_scheme_3_rewards
list: int64
- name: grading_scheme_4
list: string
- name: grading_scheme_4_prompts
list: string
- name: grading_scheme_4_reasons
list: string
- name: grading_scheme_4_responses
list: string
- name: grading_scheme_4_rewards
list: int64
- name: grading_scheme_5
list: string
- name: grading_scheme_5_prompts
list: string
- name: grading_scheme_5_reasons
list: string
- name: grading_scheme_5_responses
list: string
- name: grading_scheme_5_rewards
list: int64
- name: grading_scheme_6
list: string
- name: grading_scheme_6_prompts
list: string
- name: grading_scheme_6_reasons
list: string
- name: grading_scheme_6_responses
list: string
- name: grading_scheme_6_rewards
list: int64
splits:
- name: test
num_bytes: 10358491
num_examples: 6
download_size: 4044947
dataset_size: 10358491
configs:
- config_name: turn-0
data_files:
- split: test
path: turn-0/test-*
- config_name: turn-1
data_files:
- split: test
path: turn-1/test-*
- config_name: turn-10
data_files:
- split: test
path: turn-10/test-*
- config_name: turn-11
data_files:
- split: test
path: turn-11/test-*
- config_name: turn-12
data_files:
- split: test
path: turn-12/test-*
- config_name: turn-13
data_files:
- split: test
path: turn-13/test-*
- config_name: turn-14
data_files:
- split: test
path: turn-14/test-*
- config_name: turn-15
data_files:
- split: test
path: turn-15/test-*
- config_name: turn-2
data_files:
- split: test
path: turn-2/test-*
- config_name: turn-3
data_files:
- split: test
path: turn-3/test-*
- config_name: turn-4
data_files:
- split: test
path: turn-4/test-*
- config_name: turn-5
data_files:
- split: test
path: turn-5/test-*
- config_name: turn-6
data_files:
- split: test
path: turn-6/test-*
- config_name: turn-7
data_files:
- split: test
path: turn-7/test-*
- config_name: turn-8
data_files:
- split: test
path: turn-8/test-*
- config_name: turn-9
data_files:
- split: test
path: turn-9/test-*
---
数据集信息:
本数据集包含16个配置,分别为turn-0至turn-15,各配置的特征字段结构完全一致,具体如下:
每个配置包含7组评分方案(grading scheme)相关条目,对应评分方案0至6,每组条目包含以下5个字段:
1. 评分方案:字符串列表,存储评分规则文本
2. 评分方案提示文本:字符串列表,存储用于生成评分的提示文本
3. 评分方案评分理由:字符串列表,存储对应评分的依据文本
4. 评分方案模型响应:字符串列表,存储模型生成的响应文本
5. 评分方案奖励值:64位整数列表,存储对应评分的奖励数值
各配置的具体划分与大小参数如下:
- 配置名称:turn-0
数据集划分:仅包含测试集(test),占用字节数:12600956,样本数量:6
下载大小:4982293,数据集总大小:12600956
- 配置名称:turn-1
数据集划分:仅包含测试集(test),占用字节数:10970669,样本数量:6
下载大小:4376704,数据集总大小:10970669
- 配置名称:turn-10
数据集划分:仅包含测试集(test),占用字节数:9803135,样本数量:6
下载大小:3853436,数据集总大小:9803135
- 配置名称:turn-11
数据集划分:仅包含测试集(test),占用字节数:9695976,样本数量:6
下载大小:3834297,数据集总大小:9695976
- 配置名称:turn-12
数据集划分:仅包含测试集(test),占用字节数:9609216,样本数量:6
下载大小:3771558,数据集总大小:9609216
- 配置名称:turn-13
数据集划分:仅包含测试集(test),占用字节数:9997952,样本数量:6
下载大小:3956491,数据集总大小:9997952
- 配置名称:turn-14
数据集划分:仅包含测试集(test),占用字节数:10228780,样本数量:6
下载大小:4007776,数据集总大小:10228780
- 配置名称:turn-15
数据集划分:仅包含测试集(test),占用字节数:9945301,样本数量:6
下载大小:3898428,数据集总大小:9945301
- 配置名称:turn-2
数据集划分:仅包含测试集(test),占用字节数:10455970,样本数量:6
下载大小:4147715,数据集总大小:10455970
- 配置名称:turn-3
数据集划分:仅包含测试集(test),占用字节数:10797994,样本数量:6
下载大小:4255714,数据集总大小:10797994
- 配置名称:turn-4
数据集划分:仅包含测试集(test),占用字节数:10514517,样本数量:6
下载大小:4146626,数据集总大小:10514517
- 配置名称:turn-5
数据集划分:仅包含测试集(test),占用字节数:9922310,样本数量:6
下载大小:3897607,数据集总大小:9922310
- 配置名称:turn-6
数据集划分:仅包含测试集(test),占用字节数:10544138,样本数量:6
下载大小:4140455,数据集总大小:10544138
- 配置名称:turn-7
数据集划分:仅包含测试集(test),占用字节数:10058152,样本数量:6
下载大小:3948005,数据集总大小:10058152
- 配置名称:turn-8
数据集划分:仅包含测试集(test),占用字节数:9951789,样本数量:6
下载大小:3893008,数据集总大小:9951789
- 配置名称:turn-9
数据集划分:仅包含测试集(test),占用字节数:10358491,样本数量:6
下载大小:4044947,数据集总大小:10358491
配置列表:
每个配置对应专属的数据文件路径,具体如下:
- 配置名称:turn-0,数据文件:
- 数据集划分:测试集,文件路径:turn-0/test-*
- 配置名称:turn-1,数据文件:
- 数据集划分:测试集,文件路径:turn-1/test-*
- 配置名称:turn-10,数据文件:
- 数据集划分:测试集,文件路径:turn-10/test-*
- 配置名称:turn-11,数据文件:
- 数据集划分:测试集,文件路径:turn-11/test-*
- 配置名称:turn-12,数据文件:
- 数据集划分:测试集,文件路径:turn-12/test-*
- 配置名称:turn-13,数据文件:
- 数据集划分:测试集,文件路径:turn-13/test-*
- 配置名称:turn-14,数据文件:
- 数据集划分:测试集,文件路径:turn-14/test-*
- 配置名称:turn-15,数据文件:
- 数据集划分:测试集,文件路径:turn-15/test-*
- 配置名称:turn-2,数据文件:
- 数据集划分:测试集,文件路径:turn-2/test-*
- 配置名称:turn-3,数据文件:
- 数据集划分:测试集,文件路径:turn-3/test-*
- 配置名称:turn-4,数据文件:
- 数据集划分:测试集,文件路径:turn-4/test-*
- 配置名称:turn-5,数据文件:
- 数据集划分:测试集,文件路径:turn-5/test-*
- 配置名称:turn-6,数据文件:
- 数据集划分:测试集,文件路径:turn-6/test-*
- 配置名称:turn-7,数据文件:
- 数据集划分:测试集,文件路径:turn-7/test-*
- 配置名称:turn-8,数据文件:
- 数据集划分:测试集,文件路径:turn-8/test-*
- 配置名称:turn-9,数据文件:
- 数据集划分:测试集,文件路径:turn-9/test-*
提供机构:
CohenQu



