gupta-tanish/Ultrafeedback-llama3-8b-instruct-1vs3-optimal-selection

Name: gupta-tanish/Ultrafeedback-llama3-8b-instruct-1vs3-optimal-selection
Creator: gupta-tanish
Published: 2025-03-30 14:09:35
License: 暂无描述

Hugging Face2025-03-30 更新2025-04-12 收录

下载链接：

https://hf-mirror.com/datasets/gupta-tanish/Ultrafeedback-llama3-8b-instruct-1vs3-optimal-selection

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含多个字段，如提示ID、提示文本、所有生成的响应及其奖励分数，以及不同角色的内容。数据集分为训练集和测试集两部分，提供了各自的大小和示例数量。

The dataset includes multiple fields such as prompt ID, prompt text, all generated responses and their reward scores, and content for different roles. The dataset is split into a training set and a test set, with each having its size and number of examples provided.

提供机构：

gupta-tanish

5,000+

优质数据集

54 个

任务类型

进入经典数据集