PJMixers/Capx_Agentic-DPO-V0.1-PreferenceShareGPT
收藏Hugging Face2024-05-30 更新2024-06-15 收录
下载链接:
https://hf-mirror.com/datasets/PJMixers/Capx_Agentic-DPO-V0.1-PreferenceShareGPT
下载链接
链接失效反馈官方服务:
资源简介:
---
tags:
- preference
- preferences
size_categories:
- 1K<n<10K
task_categories:
- reinforcement-learning
---
This dataset contains between 1K and 10K samples, focusing on user preferences and suitable for reinforcement learning tasks.
提供机构:
PJMixers
原始信息汇总
数据集标签
- 偏好
- 偏爱
数据集规模
- 1K<n<10K
任务类别
- 强化学习



