cornfieldrm/pair-preference-dataset-700K_subset-2-of-4_gemma-2b_1of4_iter3_conf-0.9_bs128_lr1e-5_conf-0.9
收藏Hugging Face2024-05-31 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/cornfieldrm/pair-preference-dataset-700K_subset-2-of-4_gemma-2b_1of4_iter3_conf-0.9_bs128_lr1e-5_conf-0.9
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如被拒绝的内容及其分数、选择的内容及其分数、消息内容及其角色等。数据集分为一个训练集,包含45241个例子,总大小为307678530.55416816字节。下载大小为202028192字节。
This dataset is primarily used for evaluating dialogue systems, containing rejected and chosen dialogue content along with their scores, as well as message content and roles. The dataset is divided into a training set, which is 307678530.55416816 bytes in size and contains 45241 samples.
提供机构:
cornfieldrm
原始信息汇总
数据集概述
数据集特征
-
rejected
- content: 数据类型为字符串
- role: 数据类型为字符串
-
rejected_score: 数据类型为浮点数
-
chosen_score: 数据类型为浮点数
-
chosen
- content: 数据类型为字符串
- role: 数据类型为字符串
-
length: 数据类型为整数
-
chosen_prob: 数据类型为浮点数
-
messages
- content: 数据类型为字符串
- role: 数据类型为字符串
数据集划分
- train
- num_bytes: 307678530.55416816
- num_examples: 45241
数据集大小
- download_size: 202028192
- dataset_size: 307678530.55416816



