gswamy/pythia-1.4B-tldr-_rm_sft_tldr_pythia_1.4b_entail_l_2_iter_1
收藏Hugging Face2025-01-27 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/gswamy/pythia-1.4B-tldr-_rm_sft_tldr_pythia_1.4b_entail_l_2_iter_1
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含了多个与查询响应相关的序列特征,以及对应的奖励值。具体包括最佳查询响应、最差查询响应、最佳掩码、最差掩码以及最佳和最差奖励。数据集目前只有一个训练集部分,共有92858个样本。
The dataset includes multiple sequence features related to query responses, as well as corresponding reward values. Specifically, it includes the best query response, the worst query response, the best mask, the worst mask, and the best and worst rewards. The dataset currently has only one training set part, containing a total of 92,858 samples.
提供机构:
gswamy



