gswamy/pythia-1.4B-tldr-sftaug-pair-iter-2
收藏Hugging Face2024-12-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/gswamy/pythia-1.4B-tldr-sftaug-pair-iter-2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含迭代过程中的最佳和最差查询响应、掩码以及奖励值。特征包括iter_2_best_query_response、iter_2_worst_query_response、iter_2_best_mask、iter_2_worst_mask、iter_2_best_reward和iter_2_worst_reward。数据集分为一个训练集,包含209,580个样本,总大小为3,802,619,520字节。下载大小为232,917,014字节。
The dataset includes various features such as best and worst query responses, masks, and reward values associated with different iteration counts. The dataset is divided into a training set containing 209580 samples. The download size of the dataset is 232917014 bytes, and the total size is 3802619520 bytes.
提供机构:
gswamy



