selfcorrexp2/llama3_sft_2ep_math_base_merged_process
收藏Hugging Face2025-01-06 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_2ep_math_base_merged_process
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:索引(idx),解决方案(my_solu),首次奖励(first_reward)和真实标签(gt)。索引为整数类型,解决方案和真实标签为字符串类型,首次奖励为布尔类型。数据集分为训练集(train),共有224879个样本,大小为343927149字节。数据集的下载大小为98802882字节。
The dataset includes four fields: index (idx), solution (my_solu), first reward (first_reward), and ground truth (gt). The index is of integer type, solution and ground truth are string type, and first reward is boolean type. The dataset is split into a training set (train) with a total of 224879 samples and a size of 343927149 bytes. The download size of the dataset is 98802882 bytes.
提供机构:
selfcorrexp2



