selfcorrexp2/llama31_star_rr_baseline_5e6_2eptmp10
收藏Hugging Face2024-12-22 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama31_star_rr_baseline_5e6_2eptmp10
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,如索引、真实标签、提示信息、难度等级、类型、解决方案、个人解决方案、预测结果和奖励标志。数据集分为训练集,共有15000个示例。数据集适用于机器学习模型的训练和评估。
The dataset includes fields such as index, ground truth label, prompt information, difficulty level, type, solution, personal solution, prediction result, and reward flag. The dataset is split into a training set with a total of 15,000 examples. It is suitable for machine learning model training and evaluation.
提供机构:
selfcorrexp2



