selfcorrexp2/llama3_sft_less_corr_rr0k
收藏Hugging Face2024-12-28 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/selfcorrexp2/llama3_sft_less_corr_rr0k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的对话,每个对话包括多轮交互。每轮交互包含对话内容以及角色信息。此外,数据集还包含了索引、提示文本、是否为第一轮对话、真实结果、奖励信息、个人解决方案、是否为特定标志的布尔值以及对话轮数等字段。数据集被划分为训练集,可用于对话系统的训练和评估。
The dataset consists of a series of dialogues, each with multiple rounds of interaction. Each interaction includes dialogue content and role information. In addition, the dataset contains fields such as index, prompt text, whether it is the first round of dialogue, ground truth, reward information, personal solution, a boolean indicating a specific flag, and the round number of the dialogue. The dataset is split into a training set, which can be used for training and evaluating dialogue systems.
提供机构:
selfcorrexp2



