gupta-tanish/Ultrafeedback-llama3-8b-instruct-1vs3-optimal-selection
收藏Hugging Face2025-03-30 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/gupta-tanish/Ultrafeedback-llama3-8b-instruct-1vs3-optimal-selection
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如提示ID、提示文本、所有生成的响应及其奖励分数,以及不同角色的内容。数据集分为训练集和测试集两部分,提供了各自的大小和示例数量。
The dataset includes multiple fields such as prompt ID, prompt text, all generated responses and their reward scores, and content for different roles. The dataset is split into a training set and a test set, with each having its size and number of examples provided.
提供机构:
gupta-tanish



