Asap7772/relabeled_alpacafarm_pythiasft_20K_preference_data_maxlength
收藏Hugging Face2024-01-30 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Asap7772/relabeled_alpacafarm_pythiasft_20K_preference_data_maxlength
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
dataset_info:
features:
- name: output
dtype: string
- name: text
dtype: string
- name: alpaca_text
dtype: string
- name: prompt
dtype: string
- name: alpaca_prompt
dtype: string
- name: y_ref
dtype: string
- name: y_1
dtype: string
- name: y_2
dtype: string
- name: y_w
dtype: string
- name: y_w_alpaca
dtype: string
- name: y_l
dtype: string
- name: y_l_alpaca
dtype: string
- name: y_w_score
dtype: float64
- name: y_l_score
dtype: float64
- name: score_diff
dtype: float64
splits:
- name: train
num_bytes: 177945579
num_examples: 19000
- name: test
num_bytes: 9378616
num_examples: 1000
download_size: 86089134
dataset_size: 187324195
---
# Dataset Card for "relabeled_alpacafarm_pythiasft_20K_preference_data_maxlength"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Asap7772
原始信息汇总
数据集概述
数据集名称
relabeled_alpacafarm_pythiasft_20K_preference_data_maxlength
数据集配置
- 默认配置:default
数据文件
- 训练集(train):路径为
data/train-* - 测试集(test):路径为
data/test-*
数据集特征
output:字符串类型text:字符串类型alpaca_text:字符串类型prompt:字符串类型alpaca_prompt:字符串类型y_ref:字符串类型y_1:字符串类型y_2:字符串类型y_w:字符串类型y_w_alpaca:字符串类型y_l:字符串类型y_l_alpaca:字符串类型y_w_score:浮点数类型(float64)y_l_score:浮点数类型(float64)score_diff:浮点数类型(float64)
数据集分割
- 训练集(train):
- 字节数:177945579
- 样本数:19000
- 测试集(test):
- 字节数:9378616
- 样本数:1000
数据集大小
- 下载大小:86089134 字节
- 数据集大小:187324195 字节



