ministack-preferences
收藏魔搭社区2025-10-09 更新2025-03-22 收录
下载链接:
https://modelscope.cn/datasets/mlabonne/ministack-preferences
下载链接
链接失效反馈官方服务:
资源简介:
# Ministack-preferences
Subset (1000 training samples and 1000 test samples) of the [`lvwerra/stack-exchange-paired`](https://huggingface.co/datasets/lvwerra/stack-exchange-paired) dataset. The original dataset is really heavy and long to process, so hopefully this will help you to try RLHF a little faster.
# Ministack-preferences
本数据集为 [`lvwerra/stack-exchange-paired`](https://huggingface.co/datasets/lvwerra/stack-exchange-paired) 数据集的子集,包含1000条训练样本与1000条测试样本。原始数据集体量庞大、处理耗时较长,本子集旨在帮助使用者更快速地体验基于人类反馈的强化学习(Reinforcement Learning from Human Feedback,RLHF)。
提供机构:
maas
创建时间:
2025-03-18



