selfcorrexp2/llama_sft_dpo_gen2_70b_aug4
收藏Hugging Face2025-01-19 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama_sft_dpo_gen2_70b_aug4
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的特征字段,如索引、真实标签、提示信息、布尔类型的结果和预测、字符串类型的解决方案和预测结果。数据集仅包含一个训练集部分,包含大约101MB的数据和14574个样本。具体的应用场景和数据集的目的没有在README中说明。
The dataset consists of a series of feature fields such as index, ground truth labels, prompt information, boolean result and prediction, and string type solution and prediction result. The dataset contains only one training set part, with about 101MB of data and 14574 samples. The specific application scenario and the purpose of the dataset are not described in the README.
提供机构:
selfcorrexp2



