feedbackagent/llama3_8b_math_test_with_rewards
收藏Hugging Face2024-12-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/feedbackagent/llama3_8b_math_test_with_rewards
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含445,000个训练样本,每个样本包含索引(idx)、真实值(gt)、我的解决方案(my_solu)、奖励(rewards,序列类型)和预测值(preds,序列类型)等特征。数据集总大小为804,189,679字节,下载大小为245,599,065字节。
The dataset contains 445,000 training samples, each with features including index (idx), ground truth (gt), my solution (my_solu), rewards (sequence type), and predictions (preds, sequence type). The total size of the dataset is 804,189,679 bytes, with a download size of 245,599,065 bytes.
提供机构:
feedbackagent



