1231czx/7b_gemma_kto_iter2

Name: 1231czx/7b_gemma_kto_iter2
Creator: 1231czx
Published: 2024-07-19 04:19:46
License: 暂无描述

Hugging Face2024-07-19 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/1231czx/7b_gemma_kto_iter2

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个字段，包括索引（idx）、问题（question）、真实推理过程（gt_cot）、真实答案（gt）、类型（type）、解决方案（solution）、自定义解决方案（my_solu）、代码（code）、预测（pred）和报告（report）。数据集主要用于训练模型，涉及问题的解答、代码生成和预测等任务。数据集被分割为训练集，包含287,479个样本，总大小为635,445,180字节。

This dataset includes multiple fields such as index (idx), question, ground truth reasoning (gt_cot), ground truth answer (gt), type, solution, custom solution (my_solu), code, prediction (pred), and report. It is primarily used for training models, involving tasks such as question answering, code generation, and prediction. The dataset is divided into a training set containing 287,479 samples, with a total size of 635,445,180 bytes.

提供机构：

1231czx

原始信息汇总

数据集概述

数据集信息

特征

idx: 整数类型
question: 字符串类型
gt_cot: 字符串类型
gt: 字符串类型
type: 字符串类型
solution: 字符串类型
my_solu: 字符串序列类型
code: 字符串序列类型
pred: 字符串序列类型
report: 字符串序列类型，值为null

数据分割

train:
- 样本数量: 287479
- 数据大小: 635445180 字节

数据集大小

下载大小: 272213592 字节
数据集总大小: 635445180 字节

配置

default:
- 数据文件路径: data/train-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集