gswamy/pythia-1.4B-tldr-_rm_sft_tldr_pythia_1.4b_entail_l_1_iter_1
收藏Hugging Face2025-01-27 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/gswamy/pythia-1.4B-tldr-_rm_sft_tldr_pythia_1.4b_entail_l_1_iter_1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,主要用于记录查询响应及其对应的最佳和最差情况。具体字段包括:最佳查询响应、最差查询响应、最佳掩码、最差掩码、最佳奖励和最差奖励。数据集仅包含一个训练集(train),共有92858个示例,大小为1.68 GB。数据集下载大小为37 MB。
The dataset includes multiple fields primarily used to record query responses and their corresponding best and worst cases. Specific fields include: best query response, worst query response, best mask, worst mask, best reward, and worst reward. The dataset contains only one training set (train) with a total of 92,858 examples, totaling 1.68 GB in size. The download size of the dataset is 37 MB.
提供机构:
gswamy



