train_1M_CN
收藏魔搭社区2025-12-04 更新2024-05-15 收录
下载链接:
https://modelscope.cn/datasets/swift/train_1M_CN
下载链接
链接失效反馈官方服务:
资源简介:
## 内容
包含约100万条由[BELLE](https://github.com/LianjiaTech/BELLE)项目生成的中文指令数据。
## 样例
```
{
"instruction": "给定一个文字输入,将其中的所有数字加1。\n“明天的会议在9点开始,记得准时到达。”\n",
"input": "",
"output": "“明天的会议在10点开始,记得准时到达。”"
}
```
### 字段:
```
instruction: 指令
input: 输入(本数据集均为空)
output: 输出
```
## 使用限制
仅允许将此数据集及使用此数据集生成的衍生物用于研究目的,不得用于商业,以及其他会对社会带来危害的用途。
本数据集不代表任何一方的立场、利益或想法,无关任何团体的任何类型的主张。因使用本数据集带来的任何损害、纠纷,本项目不承担任何责任。
Dataset Content:
This dataset contains approximately 1 million Chinese instruction-tuning examples generated by the [BELLE](https://github.com/LianjiaTech/BELLE) project.
### Sample
json
{
"instruction": "Given a text input, add 1 to all numbers within it.
"Tomorrow's meeting starts at 9 o'clock, please arrive on time."
",
"input": "",
"output": ""Tomorrow's meeting starts at 10 o'clock, please arrive on time.""
}
### Field Definitions:
instruction: Task instruction
input: Input context (all samples in this dataset have empty input)
output: Model-generated response
## Usage Restrictions
This dataset and derivatives generated using it are only permitted for research purposes. Commercial use and other uses that may cause harm to society are prohibited.
This dataset does not represent the stance, interests, or viewpoints of any party, and is unrelated to any type of claim of any organization. The project shall not be liable for any damages or disputes arising from the use of this dataset.
提供机构:
maas
创建时间:
2024-06-05



