cat-searcher/responses-gemma-1.1-2b-it-split-0-all

Name: cat-searcher/responses-gemma-1.1-2b-it-split-0-all
Creator: cat-searcher
Published: 2024-07-13 21:27:38
License: 暂无描述

Hugging Face2024-07-13 更新2024-07-13 收录

下载链接：

https://hf-mirror.com/datasets/cat-searcher/responses-gemma-1.1-2b-it-split-0-all

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个特征字段，包括prompt_id、prompt、generate_0到generate_4、probability和rm_scores等。数据集分为训练集和测试集，训练集包含6588个样本，测试集包含500个样本。数据集的下载大小为196243790字节，总大小为68390431字节。

The dataset includes multiple feature fields such as prompt_id, prompt, generate_0 to generate_4, probability, and rm_scores. The dataset is divided into a training set and a test set, with the training set containing 6588 samples and the test set containing 500 samples. The download size of the dataset is 196243790 bytes, and the total size is 68390431 bytes.

提供机构：

cat-searcher

原始信息汇总

数据集概述

数据集信息

特征

prompt_id: 字符串类型
prompt: 字符串类型
generate_0: 列表类型
- content: 字符串类型
- role: 字符串类型
generate_1: 列表类型
- content: 字符串类型
- role: 字符串类型
generate_2: 列表类型
- content: 字符串类型
- role: 字符串类型
generate_3: 列表类型
- content: 字符串类型
- role: 字符串类型
generate_4: 列表类型
- content: 字符串类型
- role: 字符串类型
probability: 序列类型，包含浮点数（float64）
rm_scores: 序列类型，包含浮点数（float32）

数据分割

train:
- 字节数: 63602902
- 样本数: 6588
test:
- 字节数: 4787529
- 样本数: 500

数据集大小

下载大小: 196243790 字节
数据集大小: 68390431 字节

配置

config_name: default
- data_files:
  - train: data/train-*
  - test: data/test-*

5,000+

优质数据集

54 个

任务类型

进入经典数据集