five

airoboros-gpt4-1.3

收藏
魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/jondurbin/airoboros-gpt4-1.3
下载链接
链接失效反馈
官方服务:
资源简介:
## Overview A continuation of [gpt4-1.2](https://huggingface.co/datasets/jondurbin/airoboros-gpt4-1.2), with: * all coding instructions now have an equivalent "PLAINFORMAT" version * several thousand new orca style prompts, this time with reasoning first, then response * several examples of conversational/character interactions, with asterisk'd actions and quoted dialog _*Note: I did not filter by token length for this dataset, some are well over 2048 so use carefully.*_ ### Usage and License Notices All airoboros models and datasets are intended and licensed for research use only. I've used the 'cc-nc-4.0' license, but really it is subject to a custom/special license because: - the base model is LLaMa, which has it's own special research license - the dataset(s) were generated with OpenAI (gpt-4 and/or gpt-3.5-turbo), which has a clausing saying the data can't be used to create models to compete with openai So, to reiterate: this model (and datasets) cannot be used commercially.

概述 本数据集是[gpt4-1.2](https://huggingface.co/datasets/jondurbin/airoboros-gpt4-1.2)的延续版本,具体更新如下: * 所有代码类指令现已提供对应的"PLAINFORMAT"格式版本 * 新增数千条Orca风格提示词,本次采用先呈现推理过程、再输出回复的结构 * 新增多组对话/角色交互示例,包含以星号标注的动作内容与带引号的对话文本 *注意:本数据集未按Token长度进行筛选,部分样本长度远超2048,使用时请谨慎。 ## 使用与许可声明 所有Airoboros模型与数据集仅用于研究用途,并受对应许可协议约束。本次采用的是CC-NC-4.0许可,但实际上该数据集受定制专属许可约束,原因如下: - 基础模型为LLaMa,其本身带有专属研究许可 - 本数据集由OpenAI的GPT-4及/或GPT-3.5-turbo生成,OpenAI相关条款规定,该数据集不得用于开发可与OpenAI竞争的模型 因此再次强调:本模型(及相关数据集)不得用于商业用途。
提供机构:
maas
创建时间:
2025-08-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作