airoboros-gpt4
收藏魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/jondurbin/airoboros-gpt4
下载链接
链接失效反馈官方服务:
资源简介:
The data was generated by gpt-4, and therefore is subject to OpenAI ToS. The tool used to generate the data [airoboros](https://github.com/jondurbin/airoboros) is apache-2.
Specific areas of focus for this training data:
* trivia
* math
* nonsensical math
* coding
* closed context question answering
* closed context question answering, with multiple contexts to choose from as confounding factors
* writing
* multiple choice
### Usage and License Notices
All airoboros models and datasets are intended and licensed for research use only. I've used the 'cc-nc-4.0' license, but really it is subject to a custom/special license because:
- the base model is LLaMa, which has it's own special research license
- the dataset(s) were generated with OpenAI (gpt-4 and/or gpt-3.5-turbo), which has a clausing saying the data can't be used to create models to compete with openai
So, to reiterate: this model (and datasets) cannot be used commercially.
本数据集由GPT-4生成,因此需遵守OpenAI服务条款(ToS)。用于生成该数据集的工具[airoboros](https://github.com/jondurbin/airoboros)采用Apache-2.0许可证。
本训练数据集的重点覆盖领域包括:
* 常识问答(trivia)
* 数学问题求解
* 无意义数学题
* 编程任务
* 封闭上下文问答
* 含多上下文干扰项的封闭上下文问答
* 写作任务
* 选择题
### 使用与许可声明
所有airoboros模型与数据集仅可用于研究用途,虽采用知识共享署名-非商业性使用4.0国际许可(CC-NC-4.0),但实际受自定义特殊许可约束,原因如下:
- 基础模型为LLaMa,其本身拥有专属研究许可
- 本数据集由OpenAI的GPT-4及/或GPT-3.5-turbo生成,OpenAI服务条款中包含相关条款,禁止使用该数据开发与OpenAI形成竞争关系的模型
重申:本模型(及配套数据集)不得用于商业用途。
提供机构:
maas
创建时间:
2025-08-29



