selfcorrexp2/llama3_sft_lesscorr_norr
收藏Hugging Face2025-01-08 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_lesscorr_norr
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了索引、提示文本、是否为第一轮、真实标签、奖励、解决方案、预测结果等多个字段。数据集中的对话被分为内容角色对的形式。数据集分为训练集,共有183154个示例。数据集适用于某种类型的任务,具体任务类型在README中未提及。
The dataset includes fields such as index, prompt text, whether it is the first round, ground truth label, reward, solution, prediction result, etc. Conversations in the dataset are divided into content-role pairs. The dataset is split into a training set with a total of 183154 examples. The specific task type for which the dataset is used is not mentioned in the README.
提供机构:
selfcorrexp2



