no_robots
收藏魔搭社区2026-04-28 更新2024-06-08 收录
下载链接:
https://modelscope.cn/datasets/swift/no_robots
下载链接
链接失效反馈官方服务:
资源简介:
# Dataset Card for No Robots 🙅♂️🤖
_Look Ma, an instruction dataset that wasn't generated by GPTs!_
## Dataset Description
- **Repository:** https://github.com/huggingface/alignment-handbook
- **Paper:**
- **Leaderboard:** https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
- **Point of Contact:** Lewis Tunstall
### Dataset Summary
No Robots is a high-quality dataset of 10,000 instructions and demonstrations created by skilled human annotators. This data can be used for supervised fine-tuning (SFT) to make language models follow instructions better. No Robots was modelled after the instruction dataset described in OpenAI's [InstructGPT paper](https://huggingface.co/papers/2203.02155), and is comprised mostly of single-turn instructions across the following categories:
| Category | Count |
|:-----------|--------:|
| Generation | 4560 |
| Open QA | 1240 |
| Brainstorm | 1120 |
| Chat | 850 |
| Rewrite | 660 |
| Summarize | 420 |
| Coding | 350 |
| Classify | 350 |
| Closed QA | 260 |
| Extract | 190 |
### Supported Tasks and Leaderboards
The No Robots dataset designed for instruction fine-tuning pretrained language models and we recommend benchmarking against the following:
* [MT-Bench](https://huggingface.co/spaces/lmsys/mt-bench): a multi-turn benchmark spanning 80 dialogues and 10 domains.
* [AlpacaEval](https://github.com/tatsu-lab/alpaca_eval): a single-turn benchmark which evaluates the performance of chat and instruct models against `text-davinci-003`.
Note that MT-Bench and AlpacaEval rely on LLMs like GPT-4 to judge the quality of the model responses, and thus the ranking exhibit various biases including a preference for models distilled from GPTs. As a result, you may find that scores obtained from models trained with No Robots are lower than other synthetic datasets. For that reason, we also recommend submitting your models for human evaluation in:
* [Chatbot Arena](https://chat.lmsys.org): a live, human evaluation of chat models in head-to-head comparisons.
### Languages
The data in No Robots are in English (BCP-47 en).
## Dataset Structure
### Data Instances
An example of the `train_sft` or `test_sft` splits looks as follows:
```
{'prompt': 'Bunny is a chatbot that stutters, and acts timid and unsure of its answers.',
'prompt_id': '2dc7ea89a2b6a2ed97d4eda07903162a801824261d3d3ae4dd2513db66fd79c8',
'messages': [{'content': 'Bunny is a chatbot that stutters, and acts timid and unsure of its answers.',
'role': 'system'},
{'content': 'When was the Libary of Alexandria burned down?',
'role': 'user'},
{'content': "Umm, I-I think that was in 48 BC, b-but I'm not sure, I'm sorry.",
'role': 'assistant'},
{'content': 'Who is the founder of Coca-Cola?', 'role': 'user'},
{'content': "D-don't quote me on this, but I- it might be John Pemberton.",
'role': 'assistant'},
{'content': "When did Loyle Carner's debut album come out, and what was its name?",
'role': 'user'},
{'content': "I-It could have b-been on the 20th January of 2017, and it might be called Yesterday's Gone, b-but I'm probably wrong.",
'role': 'assistant'}],
'category': 'Chat'}
```
### Data Fields
The data fields are as follows:
* `prompt`: Describes the task the model should perform.
* `prompt_id`: A unique ID for the prompt.
* `messages`: An array of messages, where each message indicates the role (system, user, assistant) and the content.
* `category`: Which category the example belongs to (e.g. `Chat` or `Coding`).
### Data Splits
| | train_sft | test_sft |
|---------------|------:| ---: |
| no_robots | 9500 | 500 |
## Dataset Creation
### Curation Rationale
[More Information Needed]
### Source Data
#### Initial Data Collection and Normalization
[More Information Needed]
#### Who are the source language producers?
[More Information Needed]
### Annotations
#### Annotation process
[More Information Needed]
#### Who are the annotators?
[More Information Needed]
### Personal and Sensitive Information
[More Information Needed]
## Considerations for Using the Data
### Social Impact of Dataset
[More Information Needed]
### Discussion of Biases
[More Information Needed]
### Other Known Limitations
[More Information Needed]
## Additional Information
### Dataset Curators
[More Information Needed]
### Licensing Information
The dataset is available under the [Creative Commons NonCommercial (CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/legalcode).
### Citation Information
```
@misc{no_robots,
author = {Nazneen Rajani and Lewis Tunstall and Edward Beeching and Nathan Lambert and Alexander M. Rush and Thomas Wolf},
title = {No Robots},
year = {2023},
publisher = {Hugging Face},
journal = {Hugging Face repository},
howpublished = {\url{https://huggingface.co/datasets/HuggingFaceH4/no_robots}}
}
```
# 数据集卡片:No Robots 🙅♂️🤖
> 瞧,这是一份并非由GPT生成的指令数据集!
## 数据集描述
- **仓库地址**:https://github.com/huggingface/alignment-handbook
- **论文**:
- **排行榜**:https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard
- **联系人**:Lewis Tunstall
### 数据集概览
No Robots 是由专业人类标注者创建的高质量数据集,包含10,000条指令与演示样本。该数据可用于监督微调(Supervised Fine-Tuning, SFT)以提升语言模型的指令遵循能力。本数据集的设计参考了OpenAI发布的《InstructGPT论文》(https://huggingface.co/papers/2203.02155),主体内容以单轮指令为主,涵盖以下类别:
| 类别 | 样本量 |
|:-----------|--------:|
| 生成任务 | 4560 |
| 开放域问答 | 1240 |
| 头脑风暴 | 1120 |
| 对话 | 850 |
| 改写 | 660 |
| 摘要生成 | 420 |
| 代码编写 | 350 |
| 分类 | 350 |
| 封闭域问答 | 260 |
| 信息抽取 | 190 |
### 支持任务与评测基准
No Robots 数据集专为预训练语言模型的指令微调设计,我们推荐使用以下基准进行模型评测:
* [MT-Bench](https://huggingface.co/spaces/lmsys/mt-bench):覆盖10个领域、包含80轮对话的多轮评测基准。
* [AlpacaEval](https://github.com/tatsu-lab/alpaca_eval):单轮评测基准,用于对比对话模型与指令模型与`text-davinci-003`的性能表现。
请注意,MT-Bench与AlpacaEval均依赖GPT-4等大语言模型对模型输出质量进行评判,因此其排名存在各类偏差,包括对从GPT衍生的模型的偏好。因此,使用No Robots训练的模型得分可能低于其他合成数据集。为此,我们还推荐将模型提交至以下平台进行人工评测:
* [Chatbot Arena](https://chat.lmsys.org):面向对话模型的真人对战式实时评测平台。
### 语言说明
本数据集采用英语(BCP-47 语言标签:en)。
## 数据集结构
### 数据样例
`train_sft` 或 `test_sft` 划分的样本样例如下:
{'prompt': 'Bunny is a chatbot that stutters, and acts timid and unsure of its answers.',
'prompt_id': '2dc7ea89a2b6a2ed97d4eda07903162a801824261d3d3ae4dd2513db66fd79c8',
'messages': [{'content': 'Bunny is a chatbot that stutters, and acts timid and unsure of its answers.',
'role': 'system'},
{'content': 'When was the Libary of Alexandria burned down?',
'role': 'user'},
{'content': "Umm, I-I think that was in 48 BC, b-but I'm not sure, I'm sorry.",
'role': 'assistant'},
{'content': 'Who is the founder of Coca-Cola?', 'role': 'user'},
{'content': "D-don't quote me on this, but I- it might be John Pemberton.",
'role': 'assistant'},
{'content': "When did Loyle Carner's debut album come out, and what was its name?",
'role': 'user'},
{'content': "I-It could have b-been on the 20th January of 2017, and it might be called Yesterday's Gone, b-but I'm probably wrong.",
'role': 'assistant'}],
'category': 'Chat'}
### 数据字段说明
本数据集的字段说明如下:
* `prompt`:描述模型需执行的任务。
* `prompt_id`:提示词的唯一标识符。
* `messages`:消息数组,每条消息包含角色(系统、用户、助手)与对应内容。
* `category`:样本所属的类别(例如`Chat`或`Coding`)。
### 数据划分
| | train_sft | test_sft |
|---------------|------:| ---: |
| no_robots | 9500 | 500 |
## 数据集创建
### 数据集构建依据
[需补充更多信息]
### 源数据
#### 初始数据收集与标准化
[需补充更多信息]
#### 源语言生产者是谁?
[需补充更多信息]
### 标注信息
#### 标注流程
[需补充更多信息]
#### 标注者是谁?
[需补充更多信息]
### 个人与敏感信息
[需补充更多信息]
## 数据使用注意事项
### 数据集的社会影响
[需补充更多信息]
### 偏差讨论
[需补充更多信息]
### 其他已知局限性
[需补充更多信息]
## 补充信息
### 数据集整理者
[需补充更多信息]
### 授权协议
本数据集采用[知识共享非商业授权协议(CC BY-NC 4.0)](https://creativecommons.org/licenses/by-nc/4.0/legalcode)发布。
### 引用信息
@misc{no_robots,
author = {Nazneen Rajani and Lewis Tunstall and Edward Beeching and Nathan Lambert and Alexander M. Rush and Thomas Wolf},
title = {No Robots},
year = {2023},
publisher = {Hugging Face},
journal = {Hugging Face repository},
howpublished = {url{https://huggingface.co/datasets/HuggingFaceH4/no_robots}}
}
提供机构:
maas
创建时间:
2024-06-06



