mytestdpo/type12_7ktype3_7ktype4_llama3it_gsm8k
收藏Hugging Face2025-01-16 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/type12_7ktype3_7ktype4_llama3it_gsm8k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本选择任务的相关数据,包括被选择的文本(chosen_txt)、被拒绝的文本(rejected_txt)、真实标签(gt)、选择的文本标识(chosen)、拒绝的文本标识(rejected)、提示文本(prompt)以及一个表示差异的浮点数(margin)。数据集分为训练集(train),共有38479个示例。
The dataset contains data related to text selection tasks, including chosen text (chosen_txt), rejected text (rejected_txt), ground truth labels (gt), chosen text identifier (chosen), rejected text identifier (rejected), prompt text (prompt), and a floating-point number representing the difference (margin). The dataset is split into a training set (train) with a total of 38,479 examples.
提供机构:
mytestdpo



