selfcorrexp2/llama3_less_corr_reset3
收藏Hugging Face2025-01-16 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_less_corr_reset3
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含索引、提示、答案序列、真实标签、代理标签和奖励信息的文本数据集,分为训练集,共有5000个样本。数据集适用于可能需要根据提示和答案进行训练的模型。
This is a text dataset containing index, prompt, answer sequence, ground truth label, proxy label, and reward information, split into a training set with a total of 5,000 samples. The dataset is suitable for models that may need to be trained based on prompts and answers.
提供机构:
selfcorrexp2



