five

Isotonic/agentinstruct-1Mv1-combined

收藏
Hugging Face2024-11-20 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Isotonic/agentinstruct-1Mv1-combined
下载链接
链接失效反馈
官方服务:
资源简介:
Orca-AgentInstruct-1M-v1-combined是Microsoft发布的orca-agentinstruct-1M-v1数据集的编辑版本。这是一个完全合成的数据集,仅使用网络上公开的原始文本作为种子数据,并且是创建Orca-3-Mistral的完整AgentInstruct数据集的一个子集。与Mistral 7B Instruct相比,该数据集在AGIEval、MMLU、GSM8K、BBH和AlpacaEval等多个评估指标上分别有40%、19%、54%、38%和45%的改进。为了确保与大多数框架的兼容性,字符串被转换为字典列表,并且根据类别使用了自定义的系统提示。

Orca-AgentInstruct-1M-v1-combined is an edited version of the orca-agentinstruct-1M-v1 dataset released by Microsoft. It is a fully synthetic dataset using only raw text publicly available on the web as seed data, and it is a subset of the full AgentInstruct dataset that created Orca-3-Mistral. Compared to Mistral 7B Instruct, the dataset shows improvements of 40% on AGIEval, 19% on MMLU, 54% on GSM8K, 38% on BBH, and 45% on AlpacaEval. To ensure compatibility with most frameworks, strings were converted into lists of dicts, and custom system prompts based on the category were used.
提供机构:
Isotonic
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作