Asap7772/relabeled_alpacafarm_pythiasft_20K_preference_data
收藏Hugging Face2024-01-25 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Asap7772/relabeled_alpacafarm_pythiasft_20K_preference_data
下载链接
链接失效反馈官方服务:
资源简介:
---
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: test
path: data/test-*
dataset_info:
features:
- name: output
dtype: string
- name: text
dtype: string
- name: alpaca_text
dtype: string
- name: prompt
dtype: string
- name: alpaca_prompt
dtype: string
- name: y_ref
dtype: string
- name: y_1
dtype: string
- name: y_2
dtype: string
- name: y_w
dtype: string
- name: y_w_alpaca
dtype: string
- name: y_l
dtype: string
- name: y_l_alpaca
dtype: string
- name: y_w_score
dtype: float64
- name: y_l_score
dtype: float64
- name: score_diff
dtype: float64
splits:
- name: train
num_bytes: 177945579
num_examples: 19000
- name: test
num_bytes: 9378616
num_examples: 1000
download_size: 86095242
dataset_size: 187324195
---
# Dataset Card for "relabeled_alpacafarm_pythiasft_20K_preference_data"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
提供机构:
Asap7772
原始信息汇总
数据集概述
数据集配置
- 默认配置:
- 训练集:路径为
data/train-* - 测试集:路径为
data/test-*
- 训练集:路径为
数据特征
- 特征名称及数据类型:
output:字符串text:字符串alpaca_text:字符串prompt:字符串alpaca_prompt:字符串y_ref:字符串y_1:字符串y_2:字符串y_w:字符串y_w_alpaca:字符串y_l:字符串y_l_alpaca:字符串y_w_score:浮点数(float64)y_l_score:浮点数(float64)score_diff:浮点数(float64)
数据集分割
- 训练集:
- 字节数:177945579
- 样本数:19000
- 测试集:
- 字节数:9378616
- 样本数:1000
数据集大小
- 下载大小:86095242 字节
- 数据集大小:187324195 字节



