RedaAlami/eng-batch-6-dpo-safety_train
收藏Hugging Face2025-02-03 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/RedaAlami/eng-batch-6-dpo-safety_train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了一系列的问题和答案选项,每个问题都伴有两个可能的答案(Answer_A和Answer_B)。此外,数据集中还包含了问题所属的分类(Category)、一个答案序列(answers)、以及关于答案选择的信息,包括选中的内容(chosen)和被拒绝的内容(rejected)。数据集被划分为了一个训练集(train_prefs),并提供了相关的配置信息。
The dataset consists of a series of questions each with two possible answers (Answer_A and Answer_B). In addition, the dataset includes the category of the question (Category), a sequence of answers (answers), and information about the answer choices, including the chosen content (chosen) and the rejected content (rejected). The dataset is split into a training set (train_prefs) and provides related configuration information.
提供机构:
RedaAlami



