qfq/eidata_fidelity_cot_improvement_20241025_234849_iter1

Hugging Face2024-10-26 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/qfq/eidata_fidelity_cot_improvement_20241025_234849_iter1

下载链接

链接失效反馈

官方服务：

资源简介：

--- dataset_info: features: - name: doc_id dtype: int64 - name: doc struct: - name: orig_problem dtype: string - name: orig_solution dtype: string - name: orig_answer dtype: string - name: thinking_trajectory sequence: string - name: golden_thinking_trajectory sequence: string - name: model_solution dtype: string - name: old_trajectory sequence: string - name: labeled_trajectory sequence: string - name: text dtype: string - name: problem dtype: string - name: solution dtype: string - name: answer dtype: string - name: target dtype: string - name: arguments struct: - name: gen_args_0 struct: - name: arg_0 dtype: string - name: arg_1 struct: - name: until sequence: string - name: do_sample dtype: bool - name: temperature dtype: float64 - name: max_gen_toks dtype: int64 - name: resps sequence: sequence: string - name: filtered_resps sequence: string - name: doc_hash dtype: string - name: prompt_hash dtype: string - name: target_hash dtype: string - name: exact_match dtype: int64 - name: orig_problem dtype: string - name: orig_solution dtype: string - name: orig_answer dtype: string - name: thinking_trajectory sequence: string - name: golden_thinking_trajectory sequence: string - name: model_solution dtype: string - name: old_trajectory sequence: string - name: labeled_trajectory sequence: string - name: text dtype: string - name: problem dtype: string - name: solution dtype: string - name: answer dtype: string splits: - name: train num_bytes: 10216297 num_examples: 475 - name: test num_bytes: 525267 num_examples: 25 download_size: 5444363 dataset_size: 10741564 configs: - config_name: default data_files: - split: train path: data/train-* - split: test path: data/test-* ---

数据集信息：特征字段： - 字段名：doc_id，数据类型：64位整数类型 - 字段名：doc，数据类型：结构体，包含以下子字段： - 子字段：orig_problem，数据类型：字符串类型 - 子字段：orig_solution，数据类型：字符串类型 - 子字段：orig_answer，数据类型：字符串类型 - 子字段：thinking_trajectory，数据类型：字符串序列类型 - 子字段：golden_thinking_trajectory，数据类型：字符串序列类型 - 子字段：model_solution，数据类型：字符串类型 - 子字段：old_trajectory，数据类型：字符串序列类型 - 子字段：labeled_trajectory，数据类型：字符串序列类型 - 子字段：text，数据类型：字符串类型 - 子字段：problem，数据类型：字符串类型 - 子字段：solution，数据类型：字符串类型 - 子字段：answer，数据类型：字符串类型 - 字段名：target，数据类型：字符串类型 - 字段名：arguments，数据类型：结构体，包含以下子字段： - 子字段：gen_args_0，数据类型：结构体，包含以下子字段： - 子字段：arg_0，数据类型：字符串类型 - 子字段：arg_1，数据类型：结构体，包含以下子字段： - 子字段：until，数据类型：字符串序列类型 - 子字段：do_sample，数据类型：布尔类型 - 子字段：temperature，数据类型：浮点数类型 - 子字段：max_gen_toks，数据类型：64位整数类型 - 字段名：resps，数据类型：字符串序列的序列（二维字符串数组） - 字段名：filtered_resps，数据类型：字符串序列类型 - 字段名：doc_hash，数据类型：字符串类型 - 字段名：prompt_hash，数据类型：字符串类型 - 字段名：target_hash，数据类型：字符串类型 - 字段名：exact_match，数据类型：64位整数类型 - 字段名：orig_problem，数据类型：字符串类型 - 字段名：orig_solution，数据类型：字符串类型 - 字段名：orig_answer，数据类型：字符串类型 - 字段名：thinking_trajectory，数据类型：字符串序列类型 - 字段名：golden_thinking_trajectory，数据类型：字符串序列类型 - 字段名：model_solution，数据类型：字符串类型 - 字段名：old_trajectory，数据类型：字符串序列类型 - 字段名：labeled_trajectory，数据类型：字符串序列类型 - 字段名：text，数据类型：字符串类型 - 字段名：problem，数据类型：字符串类型 - 字段名：solution，数据类型：字符串类型 - 字段名：answer，数据类型：字符串类型划分集： - 划分名称：train，字节数：10216297，样本数：475 - 划分名称：test，字节数：525267，样本数：25 下载大小：5444363 数据集总大小：10741564 配置项： - 配置名称：default，数据文件：数据文件列表： - 对应划分：train，路径：data/train-* - 对应划分：test，路径：data/test-*

提供机构：

qfq

5,000+

优质数据集

54 个

任务类型

进入经典数据集