mytestdpo/type12_7ktype3_8ktype4_llama3it_gsm8k
收藏Hugging Face2025-01-16 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/type12_7ktype3_8ktype4_llama3it_gsm8k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含七个字段:选中文本(chosen_txt),被拒绝文本(rejected_txt),真实标签(gt),选择的标签(chosen),被拒绝的标签(rejected),提示(prompt)和边缘值(margin)。数据集被划分为训练集,共有39869个示例,大小为约256MB。数据集的具体应用场景和内容未在README中描述。
The dataset consists of seven fields: chosen text (chosen_txt), rejected text (rejected_txt), ground truth (gt), chosen label (chosen), rejected label (rejected), prompt, and margin. The dataset is split into a training set with 39,869 examples, totaling approximately 256MB in size. The specific application scenario and content of the dataset are not described in the README.
提供机构:
mytestdpo



