selfcorrexp2/llama3_sft_balanced_gen2_math_
收藏Hugging Face2025-01-21 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_balanced_gen2_math_
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了用于训练的文本数据,其中包括提示文本、答案、正确答案以及奖励信息等字段。训练集包含了超过两万一千九百个示例。
The dataset consists of text data for training, including fields such as prompt text, answers, correct answers, and reward information. The training set contains over twenty-one thousand nine hundred examples.
提供机构:
selfcorrexp2



