HuggingFaceH4/helpful-anthropic-raw
收藏Hugging Face2023-02-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/HuggingFaceH4/helpful-anthropic-raw
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: instruction
dtype: string
- name: demonstration
dtype: string
splits:
- name: train
num_bytes: 34540085.04363476
num_examples: 65499
download_size: 0
dataset_size: 34540085.04363476
license: mit
pretty_name: Helpful Raw Anthropic
tags:
- human-feedback
---
# Dataset Card for "helpful-raw-anthropic"
This is a dataset derived from Anthropic's [HH-RLHF data](https://huggingface.co/datasets/Anthropic/hh-rlhf) of instructions and model-generated demonstrations. We combined training splits from the following two subsets:
* `helpful-base`
* `helpful-online`
To convert the multi-turn dialogues into `(instruction, demonstration)` pairs, just the first response from the Assistant was included. This heuristic captures the most obvious answers, but overlooks more complex questions where multiple turns were required to get a helpful response. Some additional filtering is likely required (e.g. defining a minimun length or computing ROUGE-L scores with the instruction/demonstration).
提供机构:
HuggingFaceH4
原始信息汇总
数据集概述
数据集名称
- pretty_name: Helpful Raw Anthropic
数据集特征
- features:
- name: instruction
- dtype: string
- name: demonstration
- dtype: string
- name: instruction
数据集划分
- splits:
- name: train
- num_examples: 65499
- num_bytes: 34540085.04363476
- name: train
数据集大小
- dataset_size: 34540085.04363476
许可证
- license: mit
标签
- tags: human-feedback



