selfcorrexp2/Non-delete-ORM-Llama3-tmp10-generation
收藏Hugging Face2024-12-29 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/Non-delete-ORM-Llama3-tmp10-generation
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一个训练集,数据字段包括索引、提示、答案、对话(包括对话内容和角色)、奖励、地面真实值和布尔类型的奖励。数据集主要用于训练模型,可能涉及对话生成和评估等方面。
The dataset consists of a training set, with data fields including index, prompt, answer, conversation (including dialogue content and role), reward, ground truth, and boolean type reward. The dataset is primarily used for training models and may involve dialogue generation and evaluation.
提供机构:
selfcorrexp2



