selfcorrexp/llama3_8b_non_delete_40k_scaling_tmp07_1
收藏Hugging Face2025-01-02 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp/llama3_8b_non_delete_40k_scaling_tmp07_1
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一个训练集,其中每个示例包含索引、真实标签、提示信息、难度等级、类型、标准解决方案、用户书写的解决方案、预测结果以及是否获得奖励的信息。数据集主要用于训练机器学习模型,特别是在编程或代码生成的场景中。
The dataset consists of a training set, each example of which includes an index, a ground truth label, prompt information, difficulty level, type, a standard solution, a user-written solution, prediction results, and reward information. The dataset is primarily used for training machine learning models, particularly in scenarios involving programming or code generation.
提供机构:
selfcorrexp



