penfever/dpo-qwen2572b-llama3170b-jdg-Llama3-Readability
收藏Hugging Face2024-12-29 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/penfever/dpo-qwen2572b-llama3170b-jdg-Llama3-Readability
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含system, question, chosen, rejected以及它们相关得分的数据集,用于训练机器学习模型。每个得分由熵加权的峰度、平均值和方差三个指标组成。数据集分为训练集,共有319623个样本。
This dataset includes features such as system, question, chosen, rejected, and their associated scores, which are used for training machine learning models. Each score consists of three metrics: entropy-weighted kurtosis, mean, and variance. The dataset is split into a training set with a total of 319623 samples.
提供机构:
penfever



