dsrselfcorr/self_corr_first_wrong_qwenbase_prompt2_gen2
收藏Hugging Face2025-02-11 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/dsrselfcorr/self_corr_first_wrong_qwenbase_prompt2_gen2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含索引、提示文本、答案序列、正确答案文本、是否获得真实奖励的布尔值以及奖励序列布尔值等字段。数据集分为训练集,共有6472个样本,总大小为344264680字节。
The dataset includes fields such as index, prompt text, answer sequence, correct answer text, boolean value indicating whether real reward is obtained, and sequence of reward boolean values. The dataset is split into a training set with a total of 6472 samples and a total size of 344264680 bytes.
提供机构:
dsrselfcorr



