HuggingFaceH4/surge-pm-pilot
收藏Hugging Face2023-03-20 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/HuggingFaceH4/surge-pm-pilot
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
---
Pilot annotations for PM dataset that will be used for RLHF. The dataset used outputs from opensource models (https://huggingface.co/spaces/HuggingFaceH4/instruction-models-outputs) on a mix on Anthropic hh-rlhf (https://huggingface.co/datasets/HuggingFaceH4/hh-rlhf) dataset and Self-Instruct's seed (https://huggingface.co/datasets/HuggingFaceH4/self-instruct-seed) dataset.
提供机构:
HuggingFaceH4
原始信息汇总
数据集概述
数据集用途
- 用于RLHF(基于人类反馈的强化学习)的试点标注。
数据来源
- 数据集由开源模型(https://huggingface.co/spaces/HuggingFaceH4/instruction-models-outputs)的输出组成。
数据组成
- 结合了Anthropic的hh-rlhf数据集(https://huggingface.co/datasets/HuggingFaceH4/hh-rlhf)和Self-Instruct的种子数据集(https://huggingface.co/datasets/HuggingFaceH4/self-instruct-seed)。
许可证
- Apache-2.0许可证。



