selfcorrexp2/llama_sft_dpo_gen2_70b_aug1
收藏Hugging Face2025-01-19 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama_sft_dpo_gen2_70b_aug1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含以下字段:索引(idx),地面真实值(gt),提示信息(prompt),结果(res),预测值(predic),用户解决方案(my_solu),以及模型预测(pred)。数据集分为训练集(train),共有14574个示例,总大小为101288638字节。此外,提供了默认配置,指定了训练数据文件的路径。
The dataset includes the following fields: index (idx), ground truth (gt), prompt information (prompt), result (res), prediction (predic), user solution (my_solu), and model prediction (pred). The dataset is split into a training set (train) with a total of 14574 examples and a size of 101288638 bytes. In addition, a default configuration is provided, specifying the path to the training data files.
提供机构:
selfcorrexp2



