five

gswamy/pythia-1.4B-tldr-sft_tldr_pythia_1.4b_rm_sft_tldr_pythia_1.4b_2_iter_1

收藏
Hugging Face2024-12-31 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/gswamy/pythia-1.4B-tldr-sft_tldr_pythia_1.4b_rm_sft_tldr_pythia_1.4b_2_iter_1
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含多个特征字段,主要用于记录迭代过程中的最佳和最差查询响应、最佳和最差掩码以及相应的奖励。数据集仅包含训练集部分,共有209,580个示例,大小为3.8GB。数据集配置信息中提供了训练集的数据文件路径。

The dataset includes multiple feature fields, mainly used to record the best and worst query responses, best and worst masks, and corresponding rewards during iteration. The dataset contains only the training set, with a total of 209,580 examples and a size of 3.8GB. The dataset configuration information provides the data file path for the training set.
提供机构:
gswamy
二维码
社区交流群
二维码
科研交流群
商业服务