penfever/dpo-qalfac
收藏Hugging Face2025-01-15 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/penfever/dpo-qalfac
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了对话系统的交互数据,其中有system和prompt作为交互的双方,同时为选中的回答和被拒绝的回答提供了评分,评分包括熵加权的峰度、平均值和方差。每个回答还标注了内容和角色。数据集仅包含训练集部分,共有359999个示例。
The dataset contains interaction data from a dialogue system, including the system and prompt as the two sides of the interaction. Scores for the chosen and rejected responses are provided, which include entropy-weighted kurtosis, mean, and variance. Each response is also labeled with content and role. The dataset only includes the training set, with a total of 359999 examples.
提供机构:
penfever



