haoranli-ml/genvf-filtered-proof-graded
收藏Hugging Face2026-04-09 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/haoranli-ml/genvf-filtered-proof-graded
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: index
dtype: int64
- name: row_id
dtype: int64
- name: problem
dtype: string
- name: answer
dtype: string
- name: source
list: string
- name: mean_reward
dtype: float64
- name: full_response
dtype: string
- name: full_reasoning
dtype: string
- name: model
dtype: string
- name: prefix
dtype: string
- name: prefix_end_index
dtype: int64
- name: num_thoughts
dtype: int64
- name: prefix_type
dtype: string
- name: prefix_type_description
dtype: string
- name: suffix_num
list: int64
- name: suffix_model
list: string
- name: pending
list: bool
- name: pending_model
list: 'null'
- name: suffix_response
list: string
- name: suffix_summary
list: string
- name: self_summary
list: string
- name: suffix_reasoning
list: string
- name: finish_reason
list: string
- name: budget_used
list: int64
- name: escalation
list: int64
- name: usage
list:
- name: completion_tokens
dtype: int64
- name: prompt_tokens
dtype: int64
- name: total_tokens
dtype: int64
- name: error
list: 'null'
- name: error_type
list: 'null'
- name: prefix_model
dtype: string
- name: gemini_summary_of_future
dtype: string
- name: gemini_summary_list
list: string
- name: prefix_steps
list: string
- name: suffix_variants
list:
- name: detailed_steps
list: string
- name: high_level_steps
list: string
- name: id
dtype: int64
- name: dedup_note
dtype: string
- name: cross_prefix_alignment_scores
list:
- name: avg_alignment
dtype: float64
- name: individual_scores
list:
- name: compared_row_id
dtype: int64
- name: compared_summary_id
dtype: int64
- name: direction
dtype: string
- name: output_text
dtype: string
- name: problem_index
dtype: int64
- name: reasoning
dtype: string
- name: score
dtype: float64
- name: num_comparisons
dtype: int64
- name: summary_id
dtype: int64
- name: filtered_suffix
list:
- name: detailed_steps
list: string
- name: high_level_steps
list: string
- name: id
dtype: int64
- name: rubrics
dtype: string
- name: prefix_summary_steps
dtype: string
- name: filtered_suffix_summary_steps
list: string
- name: input_to_VF
dtype: string
- name: proof_scores
list:
- name: points
dtype: int64
- name: suffix_id
dtype: int64
- name: proof_details
list:
- name: assessment
dtype: string
- name: errors
dtype: string
- name: suffix_id
dtype: int64
splits:
- name: train
num_bytes: 578006709
num_examples: 1785
download_size: 225830949
dataset_size: 578006709
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
---
数据集信息:
特征字段:
- 字段名称:index,数据类型:64位整数(int64)
- 字段名称:row_id,数据类型:64位整数(int64)
- 字段名称:problem,数据类型:字符串(string)
- 字段名称:answer,数据类型:字符串(string)
- 字段名称:source,数据类型:字符串列表(list: string)
- 字段名称:mean_reward,数据类型:64位浮点数(float64)
- 字段名称:full_response,数据类型:字符串(string)
- 字段名称:full_reasoning,数据类型:字符串(string)
- 字段名称:model,数据类型:字符串(string)
- 字段名称:prefix,数据类型:字符串(string)
- 字段名称:prefix_end_index,数据类型:64位整数(int64)
- 字段名称:num_thoughts,数据类型:64位整数(int64)
- 字段名称:prefix_type,数据类型:字符串(string)
- 字段名称:prefix_type_description,数据类型:字符串(string)
- 字段名称:suffix_num,数据类型:64位整数列表(list: int64)
- 字段名称:suffix_model,数据类型:字符串列表(list: string)
- 字段名称:pending,数据类型:布尔值列表(list: bool)
- 字段名称:pending_model,数据类型:空值列表(list: 'null')
- 字段名称:suffix_response,数据类型:字符串列表(list: string)
- 字段名称:suffix_summary,数据类型:字符串列表(list: string)
- 字段名称:self_summary,数据类型:字符串列表(list: string)
- 字段名称:suffix_reasoning,数据类型:字符串列表(list: string)
- 字段名称:finish_reason,数据类型:字符串列表(list: string)
- 字段名称:budget_used,数据类型:64位整数列表(list: int64)
- 字段名称:escalation,数据类型:64位整数列表(list: int64)
- 字段名称:usage,数据类型:列表,包含:
- 补全令牌数(completion_tokens):64位整数(int64)
- 提示令牌数(prompt_tokens):64位整数(int64)
- 总令牌数(total_tokens):64位整数(int64)
- 字段名称:error,数据类型:空值列表(list: 'null')
- 字段名称:error_type,数据类型:空值列表(list: 'null')
- 字段名称:prefix_model,数据类型:字符串(string)
- 字段名称:gemini_summary_of_future,数据类型:字符串(string)
- 字段名称:gemini_summary_list,数据类型:字符串列表(list: string)
- 字段名称:prefix_steps,数据类型:字符串列表(list: string)
- 字段名称:suffix_variants,数据类型:列表,包含:
- 详细步骤:字符串列表(list: string)
- 高阶步骤:字符串列表(list: string)
- id:64位整数(int64)
- 字段名称:dedup_note,数据类型:字符串(string)
- 字段名称:cross_prefix_alignment_scores,数据类型:列表,包含:
- 平均对齐度(avg_alignment):64位浮点数(float64)
- 单个分数(individual_scores):列表,包含:
- 比对行ID(compared_row_id):64位整数(int64)
- 比对摘要ID(compared_summary_id):64位整数(int64)
- 比对方向(direction):字符串(string)
- 输出文本(output_text):字符串(string)
- 问题索引(problem_index):64位整数(int64)
- 推理过程(reasoning):字符串(string)
- 分数(score):64位浮点数(float64)
- 比对次数(num_comparisons):64位整数(int64)
- 摘要ID(summary_id):64位整数(int64)
- 字段名称:filtered_suffix,数据类型:列表,包含:
- 详细步骤:字符串列表(list: string)
- 高阶步骤:字符串列表(list: string)
- id:64位整数(int64)
- 字段名称:rubrics,数据类型:字符串(string)
- 字段名称:prefix_summary_steps,数据类型:字符串(string)
- 字段名称:filtered_suffix_summary_steps,数据类型:字符串列表(list: string)
- 字段名称:input_to_VF,数据类型:字符串(string)
- 字段名称:proof_scores,数据类型:列表,包含:
- 得分点(points):64位整数(int64)
- 后缀ID(suffix_id):64位整数(int64)
- 字段名称:proof_details,数据类型:列表,包含:
- 评估结果(assessment):字符串(string)
- 错误信息(errors):字符串(string)
- 后缀ID(suffix_id):64位整数(int64)
数据划分:
- 划分名称:train(训练集),数据字节数:578006709,样本数量:1785
下载大小:225830949字节
数据集总字节数:578006709
配置项:
- 配置名称:default(默认配置),数据文件:
- 划分:train,路径:data/train-*
提供机构:
haoranli-ml



