mytestdpo/llama3_sft_gsm8k_first_wrong_prompt
收藏Hugging Face2025-01-07 更新2025-04-26 收录
下载链接:
https://hf-mirror.com/datasets/mytestdpo/llama3_sft_gsm8k_first_wrong_prompt
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含对话相关的信息,每个样本包括对话的回合数、角色、内容,以及一些标记信息如正确答案和是否为第一轮对话等。数据集分为训练集,共有约5.8万个示例。
The dataset contains dialogue-related information, with each sample including the number of rounds, roles, content of the conversation, and some labeling information such as the correct answer and whether it is the first round of the conversation. The dataset is split into a training set with a total of about 58,000 examples.
提供机构:
mytestdpo



