mytestdpo/llama3_sft_gsm8k_sft_model_gen2_gsm8k_
收藏Hugging Face2025-01-19 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/llama3_sft_gsm8k_sft_model_gen2_gsm8k_
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个字段,用于可能的文本处理任务,如问答系统或文本分类。字段包括索引、提示文本、答案序列、真实标签、首次奖励、预测结果和二次奖励。数据集分为训练集,包含14768个样本。
The dataset contains multiple fields for potential text processing tasks such as question answering or text classification. Fields include index, prompt text, answer sequence, ground truth label, first reward, prediction result, and second reward. The dataset is split into a training set with 14768 samples.
提供机构:
mytestdpo



