mytestdpo/llama3_orm
收藏Hugging Face2025-01-05 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/llama3_orm
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了索引、正确答案、提示信息、用户答案、预测答案、奖励标识、消息内容及其角色等信息。数据集分为训练集,提供了相应的字节大小和示例数量。
The dataset includes index, ground truth, prompt, users answer, predicted answer, reward flags, message content, and role information. The dataset is split into a training set, with the corresponding byte size and number of examples provided.
提供机构:
mytestdpo



