gswamy/pythia-1.4B-tldr-sft_tldr_pythia_1.4b_rm_sft_tldr_pythia_1.4b_3_iter_1
收藏Hugging Face2024-12-31 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/gswamy/pythia-1.4B-tldr-sft_tldr_pythia_1.4b_rm_sft_tldr_pythia_1.4b_3_iter_1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了迭代过程中的最佳和最差查询响应及其掩码,以及相应的奖励值。数据集被划分为训练集,共有209,580个示例,总大小为3,802,619,520字节。
The dataset includes the best and worst query responses and their masks during iteration, as well as the corresponding reward values. The dataset is split into a training set with a total of 209,580 examples and a size of 3,802,619,520 bytes.
提供机构:
gswamy



