allenai/tulu-3-IF-augmented-on-policy-8b

Name: allenai/tulu-3-IF-augmented-on-policy-8b
Creator: allenai
Published: 2024-11-21 16:51:08
License: 暂无描述

Hugging Face2024-11-21 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/allenai/tulu-3-IF-augmented-on-policy-8b

下载链接

链接失效反馈

官方服务：

资源简介：

Llama 3.1 Tulu 3 IF-Augmented数据集是一个偏好数据集，包含了65,530个生成对。这些生成对来自于多个不同的模型，包括Mistral、Tulu、Yi、MPT、Google Gemma、InternLM、Falcon、Qwen、Llama、GPT-4和Claude等。生成方法结合了on-policy和off-policy数据，并使用Ultrafeedback模板和LLM judge进行偏好标注。数据集主要用于研究和教育用途，遵循Ai2的负责任使用指南。

This preference dataset is part of our Tulu 3 preference mixture. It contains 65,530 generation pairs obtained using various models including Mistral, Tulu, Yi, MPT, Google Gemma, InternLM, Falcon, Qwen, Llama, GPT-4, and Claude. The dataset was generated using a synthetic pipeline combining both on-policy and off-policy data, and preference annotations were obtained on four different aspects using the Ultrafeedback template and an LLM judge. The dataset is licensed under ODC-BY and is intended for research and educational use.

提供机构：

allenai

5,000+

优质数据集

54 个

任务类型

进入经典数据集