1231czx/9B_iter3_gemma_N1_random_pair_iter4prompt

Name: 1231czx/9B_iter3_gemma_N1_random_pair_iter4prompt
Creator: 1231czx
Published: 2024-07-16 03:54:03
License: 暂无描述

Hugging Face2024-07-16 更新2024-07-22 收录

下载链接：

https://hf-mirror.com/datasets/1231czx/9B_iter3_gemma_N1_random_pair_iter4prompt

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含四个主要特征：gt（字符串类型）、rej（字符串序列）、chosen（包含content和role两个字段的列表）和rejected（同样包含content和role两个字段的列表）。数据集仅包含一个训练集（train），大小为37557488字节，包含16530个样本。下载大小为14169933字节，数据集总大小为37557488字节。

The dataset contains four main features: gt (string type), rej (sequence of strings), chosen (a list containing content and role fields), and rejected (also a list containing content and role fields). The dataset includes only a training set (train) with a size of 37557488 bytes, containing 16530 examples. The download size is 14169933 bytes, and the total dataset size is 37557488 bytes.

提供机构：

1231czx

原始信息汇总

数据集概述

数据集信息

特征

gt: 数据类型为字符串。
rej: 序列类型为字符串。
chosen: 包含以下子特征：
- content: 数据类型为字符串。
- role: 数据类型为字符串。
rejected: 包含以下子特征：
- content: 数据类型为字符串。
- role: 数据类型为字符串。

数据分割

train: 包含16530个样本，总字节数为37557488。

数据集大小

下载大小: 14169933字节。
数据集大小: 37557488字节。

配置

default: 包含训练数据文件，路径为data/train-*。

5,000+

优质数据集

54 个

任务类型

进入经典数据集