allenai/tulu-3-wildchat-if-on-policy-70b
收藏Hugging Face2024-11-21 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/allenai/tulu-3-wildchat-if-on-policy-70b
下载链接
链接失效反馈官方服务:
资源简介:
Llama 3.1 Tulu 3 Wildchat IF数据集是一个偏好数据集,包含来自WildChat的提示和10,792对生成对。这些生成对是通过多个模型生成的,包括Mistral、Tulu、Yi、MPT、Google Gemma、InternLM、Falcon、Qwen、Llama、GPT-4和Claude等。数据集采用合成管道生成完成和偏好,并使用Ultrafeedback模板和LLM判断进行偏好注释。数据集遵循ODC-BY许可,适用于研究和教育用途。
This preference dataset is part of our Tulu 3 preference mixture: it contains prompts from WildChat, which include constraints, and it contains 10,792 generation pairs (some of which on-policy from allenai/Llama-3.1-Tulu-3-70B) obtained using various models including Mistral, Tulu, Yi, MPT, Google Gemma, InternLM, Falcon, Qwen, Llama, and GPT-4. The generation process combines on-policy and off-policy data, and preference annotations are obtained using the Ultrafeedback template and an LLM judge. The dataset is licensed under ODC-BY, intended for research and educational use.
提供机构:
allenai



