FrancoisFormation/project14-dpo
收藏Hugging Face2026-03-20 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/FrancoisFormation/project14-dpo
下载链接
链接失效反馈官方服务:
资源简介:
---
{}
---
---
language:
- en
license: mit
task_categories:
- text-generation
pretty_name: project14-dpo
tags:
- medical
- triage
- project14
- openclassrooms
- dpo
- preference-learning
- rlhf---
# project14-dpo
Direct Preference Optimisation dataset for a medical triage agent. ~1,000 prompt/chosen/rejected pairs sourced from UltraMedical-Preference. Human-annotated pairs prioritised during undersampling. PII anonymised with Presidio (RGPD compliant).
提供机构:
FrancoisFormation



