OpenLLM-Ro/ro_dpo_helpsteer2
收藏Hugging Face2025-04-22 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/OpenLLM-Ro/ro_dpo_helpsteer2
下载链接
链接失效反馈官方服务:
资源简介:
HelpSteer2数据集包含10k条人类注释的偏好条目。我们提供了使用GPT-4o mini翻译的HelpSteer2数据集的罗马尼亚语版本。这个数据集是针对罗马尼亚语言模型提出的对齐协议的下一步,该协议在论文《" Vorbești Românește?" A Recipe to Train Powerful Romanian LLMs with English Instructions》中有详细介绍。
The HelpSteer2 dataset contains 10k human-annotated preference entries. Here we provide the Romanian translation of the HelpSteer2 dataset, translated with GPT-4o mini. This dataset is a next step of the alignment protocol for Romanian LLMs proposed in the paper Vorbești Românește? A Recipe to Train Powerful Romanian LLMs with English Instructions.
提供机构:
OpenLLM-Ro



