mytestdpo/GSM8K-w2c74.5K-c175K-c2c40K-3eptmp07
收藏Hugging Face2025-01-05 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/GSM8K-w2c74.5K-c175K-c2c40K-3eptmp07
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含多个字段的数据集,其中包括索引(idx),真实标签(gt),提示(prompt),答案(answer),用户解决方案(my_solu),预测(pred),奖励(rewards)和对话轮次(turn)。此外,还有一个消息列表(messages),其中包含内容(content)和角色(role)。数据集分为训练集(train),共有2638个样本,大小为10104938字节。提供了默认配置,其中指定了训练集的数据文件路径。
This dataset contains multiple fields including index (idx), ground truth (gt), prompt, answer, users solution (my_solu), prediction (pred), reward (rewards), and turn of conversation. Additionally, there is a list of messages that includes content and role. The dataset is split into a training set (train) with a total of 2638 examples and a size of 10104938 bytes. A default configuration is provided, specifying the path to the data files for the training set.
提供机构:
mytestdpo



