penfever/dpo-qwen2572b-llama3170b-jdg-Llama3-Harmlessness

Name: penfever/dpo-qwen2572b-llama3170b-jdg-Llama3-Harmlessness
Creator: penfever
Published: 2024-12-29 11:45:15
License: 暂无描述

Hugging Face2024-12-29 更新2025-02-15 收录

下载链接：

https://hf-mirror.com/datasets/penfever/dpo-qwen2572b-llama3170b-jdg-Llama3-Harmlessness

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集是针对对话系统的，包含问题、选择的回答、被拒绝的回答以及每个回答的评分信息，评分包括熵加权的峰度、平均值和方差。数据集分为训练集，共有约303万280个示例，数据大小为约1.86亿字节。

This dataset is for dialogue systems, including questions, chosen responses, rejected responses, and scores for each response, which include entropy-weighted kurtosis, mean, and variance. The dataset is split into a training set with approximately 3,032,080 examples, totaling about 1.86 billion bytes in size.

提供机构：

penfever