mytestdpo/llama3_it_gsm8k_temp07_scaling
收藏Hugging Face2025-01-09 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/llama3_it_gsm8k_temp07_scaling
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含与某个任务相关的多个字段信息,如索引、真实标签、提示文本、答案、用户解决方案、预测结果、预测列表以及奖励信号。数据集分为训练集,共有494625个示例,总文件大小为约1GB。数据集还提供了一个默认配置,指定了训练数据的文件路径。
The dataset includes multiple fields related to a task, such as index, ground truth labels, prompt text, answers, user solutions, prediction results, list of predictions, and reward signals. The dataset is split into a training set with a total of 494,625 examples and an overall file size of approximately 1GB. The dataset also provides a default configuration specifying the file paths for the training data.
提供机构:
mytestdpo



