selfcorrexp2/llama3_sft_2ep_math_first_corr_regular_process
收藏Hugging Face2025-01-06 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_2ep_math_first_corr_regular_process
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了对话场景的相关信息,具体特征包括:索引(idx),用户的解决方案(my_solu),是否获得首次奖励(first_reward),真实标签(gt),对话中的消息内容(messages),对话轮次(turn),以及对话的详细内容(conversations)。数据集被划分为训练集,共有110720个示例,总大小为366,473,926字节。此外,提供了默认配置下的数据文件路径信息。
The dataset consists of information related to dialogue scenarios, including features such as index (idx), users solution (my_solu), first reward obtained (first_reward), ground truth label (gt), messages in the dialogue (messages), dialogue turn (turn), and detailed content of the conversation (conversations). The dataset is split into a training set with a total of 110720 examples and a size of 366,473,926 bytes. Additionally, information about the data file paths for the default configuration is provided.
提供机构:
selfcorrexp2



