five

moss-003-sft-data

收藏
魔搭社区2026-01-08 更新2025-11-03 收录
下载链接:
https://modelscope.cn/datasets/openmoss/moss-003-sft-data
下载链接
链接失效反馈
官方服务:
资源简介:
# moss-003-sft-data ** More information: [MOSS Paper](https://www.mi-research.net/article/doi/10.1007/s11633-024-1502-8)** ## Conversation Without Plugins ### Categories | Category | \# samples | |----------------------|-----------:| | Brainstorming | 99,162 | | Complex Instruction | 95,574 | | Code | 198,079 | | Role Playing | 246,375 | | Writing | 341,087 | | Harmless | 74,573 | | Others | 19,701 | | Total | 1,074,551 | **Others** contains two categories: **Continue**(9,839) and **Switching**(9,862). The **Continue** category refers to instances in a conversation where the user asks the system to continue outputting the response from the previous round that was not completed. The **Switching** category refers to instances in a conversation where the user switches the language they are using. We remove the data for honesty because it contains private information. ## Conversation With Plugins ### Categories | Category | \# samples | |----------------------|-----------:| | Search Engine, Calculator, Equation Solver | about 300k | | Instruction To Image | about 50k | | Total | about 350k |

# moss-003-sft-data **更多信息:[MOSS论文(MOSS Paper)](https://www.mi-research.net/article/doi/10.1007/s11633-024-1502-8)** ## 无插件对话 ### 类别 | 类别 | 样本数量 | |----------------------|-----------:| | 头脑风暴(Brainstorming) | 99,162 | | 复杂指令(Complex Instruction) | 95,574 | | 代码(Code) | 198,079 | | 角色扮演(Role Playing) | 246,375 | | 写作(Writing) | 341,087 | | 无害对话(Harmless) | 74,573 | | 其他(Others) | 19,701 | | 总计(Total) | 1,074,551 | **其他(Others)**类别包含两个子类别:**续写(Continue)**与**语种切换(Switching)**,样本数分别为9,839与9,862。 **续写(Continue)**类别指对话场景中,用户要求系统继续输出上一轮未完成回复的样本。 **语种切换(Switching)**类别指对话场景中,用户切换当前使用语言的样本。 因部分诚实性相关数据包含隐私信息,我们已将其移除。 ## 带插件对话 ### 类别 | 类别 | 样本数量 | |----------------------|-----------:| | 搜索引擎、计算器、方程求解器(Search Engine, Calculator, Equation Solver) | 约30万 | | 文本转图像(Instruction To Image) | 约5万 | | 总计(Total) | 约35万 |
提供机构:
maas
创建时间:
2025-10-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作