mytestdpo/type12_7ktype3_5ktype4_llama3it_gsm8k
收藏Hugging Face2025-01-16 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/type12_7ktype3_5ktype4_llama3it_gsm8k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本对和标签信息,其中有选中的文本(chosen_txt)、被拒绝的文本(rejected_txt)、地面真实标签(gt)、选择的标签(chosen)、拒绝的标签(rejected)、提示文本(prompt)以及一个表示差异的浮点数值(margin)。数据集划分为训练集(train),共36479个样本,总大小约为234736786.31字节。但README中并未提供数据集的具体描述,因此无法给出详细的中文描述。
The dataset consists of text pairs and label information, including chosen text (chosen_txt), rejected text (rejected_txt), ground truth label (gt), chosen label (chosen), rejected label (rejected), prompt text (prompt), and a floating-point value (margin) representing the difference. The dataset is split into a training set (train) with a total of 36,479 samples and a total size of approximately 234,736,786.31 bytes. However, the README does not provide a specific description of the dataset, so no detailed English description can be given.
提供机构:
mytestdpo



