mytestdpo/llama3_star_plus_8b_gsm8k_kumar_baselinetmp0
收藏Hugging Face2025-01-03 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/llama3_star_plus_8b_gsm8k_kumar_baselinetmp0
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含多个字段的数据集,其中包括索引、真实标签、提示信息、答案、解决方案、预测结果和奖励。数据集被划分为训练集,共有3957个样本,总大小为13331477字节。数据集支持默认配置,并指定了训练集的数据文件路径。
This dataset includes multiple fields such as index, ground truth labels, prompt information, answer, solution, prediction results, and rewards. The dataset is split into a training set with 3957 samples and a total size of 13331477 bytes. The dataset supports a default configuration and specifies the data file path for the training set.
提供机构:
mytestdpo



