gupta-tanish/Ultrafeedback-llama3-8b-Instruct-and-Gemma-9b-it-optimal-selection-kvsk
收藏Hugging Face2025-02-10 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/gupta-tanish/Ultrafeedback-llama3-8b-Instruct-and-Gemma-9b-it-optimal-selection-kvsk
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含对话生成及其评估的数据集,数据集包含了提示信息(prompt)、生成的所有响应(all_generated_responses)、每个响应的评分(all_reward_scores)和平均评分(mean_reward)等字段。此外,数据集中还包括了8个不同角色(A0至A7)的对话内容(content)和角色信息(role),以及对应的评分(score)和概率(prob)。数据集分为训练集(train_prefs)和测试集(test_prefs),分别用于训练和测试模型。
This dataset contains dialogue generation and its evaluation, including fields such as prompt, all_generated_responses, all_reward_scores for each response, and mean_reward. In addition, the dataset includes dialogue content and role information for 8 different roles (A0 to A7), along with corresponding scores and probabilities. The dataset is divided into a training set (train_prefs) and a test set (test_prefs) for model training and testing.
提供机构:
gupta-tanish



