sarmass/a4_dpo_comparison
收藏Hugging Face2024-12-06 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/sarmass/a4_dpo_comparison
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个主要特征:指令(Instruction)、原始内容(Original)、LLM评判(LLM Judge)和配对评分模型(PairRM)。数据集分为一个训练集(train),包含10个样本,文件大小为24513字节。下载大小为34317字节,数据集总大小为24513字节。配置文件中指定了默认配置,数据文件路径为data/train-*。
The dataset contains four main features: Instruction, Original, LLM Judge, and PairRM. It is divided into one training set (train) with 10 samples, a file size of 24513 bytes. The download size is 34317 bytes, and the total dataset size is 24513 bytes. The configuration file specifies the default configuration, with the data file path as data/train-*.
提供机构:
sarmass



