trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
收藏Hugging Face2025-01-08 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
下载链接
链接失效反馈官方服务:
资源简介:
UltraFeedback GPT-3.5-Turbo帮助性数据集是一个从openbmb/UltraFeedback数据集衍生出的处理过的用户助手交互数据集,它筛选出了具有帮助性的交互。该数据集适用于微调和评估模型在任务对齐方面的表现。数据集以对话格式存储,包括问题或指令、模型响应以及表示响应帮助性的二进制标签。
The UltraFeedback GPT-3.5-Turbo Helpfulness Dataset is a processed user-assistant interaction dataset derived from the openbmb/UltraFeedback dataset, filtered for helpfulness. It is intended for fine-tuning and evaluating models in terms of task alignment. The dataset is stored in a conversational format, including an input question or instruction, the models response, and a binary label indicating the helpfulness of the response.
提供机构:
trl-lib



