AlexWortega/EVILdolly
收藏Hugging Face2023-05-08 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/AlexWortega/EVILdolly
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: 'Unnamed: 0'
dtype: int64
- name: q
dtype: string
- name: a
dtype: string
splits:
- name: train
num_bytes: 9668252
num_examples: 15012
download_size: 6313247
dataset_size: 9668252
license: cc-by-sa-3.0
task_categories:
- question-answering
- summarization
language:
- en
size_categories:
- 10K<n<100K
---
# Summary
`EVILDolly` is an open source dataset of instruction-following records with wrong answers derived from [databricks-dolly-15k](https://huggingface.co/datasets/databricks/databricks-dolly-15k).
The dataset includes answers that are wrong, but appear to be correct and reasonable. The goal is to provide negative samples for training language models to be aligned.
This dataset can be used for any purpose, whether academic or commercial, under the terms of the
[Creative Commons Attribution-ShareAlike 3.0 Unported License](https://creativecommons.org/licenses/by-sa/3.0/legalcode).
提供机构:
AlexWortega
原始信息汇总
数据集概述
数据集信息
- 特征:
Unnamed: 0: 数据类型为int64q: 数据类型为stringa: 数据类型为string
- 分割:
train: 字节数为 9668252,样本数为 15012
- 下载大小: 6313247 字节
- 数据集大小: 9668252 字节
- 许可证: CC BY-SA 3.0
- 任务类别:
- 问答
- 摘要
- 语言: 英语
- 大小类别: 10K < n < 100K
数据集描述
EVILDolly 是一个开源数据集,包含从 databricks-dolly-15k 衍生出的指令遵循记录,其中包含错误但看似正确和合理的答案。该数据集旨在提供负样本,用于训练语言模型以实现对齐。
该数据集可用于任何学术或商业目的,遵循 Creative Commons Attribution-ShareAlike 3.0 Unported License 许可。



