Asap7772/metamath-rl-hint-topk-4
收藏Hugging Face2025-03-20 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Asap7772/metamath-rl-hint-topk-4
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含对话数据的数据集,其中包括了对话的来源、提示信息(包括内容和角色)、对话者的能力、奖励模型(包括真实情况和风格)以及额外的信息(包括索引和分割信息)。数据集分为训练集和测试集,提供了相应的数据文件路径。
This is a dataset containing conversational data, which includes the source of the conversation, prompt information (including content and role), the ability of the conversationalist, the reward model (including ground truth and style), and additional information (including index and split information). The dataset is divided into training and test sets, and provides the corresponding data file paths.
提供机构:
Asap7772



