Nyooti/databricks-dolly-15k
收藏Hugging Face2026-04-23 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/Nyooti/databricks-dolly-15k
下载链接
链接失效反馈官方服务:
资源简介:
`databricks-dolly-15k`是一个开源数据集,包含超过15,000条由Databricks员工生成的指令跟随记录,旨在使大型语言模型能够展示类似ChatGPT的交互能力。数据集涵盖了多种指令类别,包括创意写作、封闭QA、开放QA、摘要、信息提取、分类和头脑风暴等。数据集可以用于任何目的,包括学术或商业应用,遵循CC BY-SA 3.0许可。数据集中的记录由员工生成,部分类别参考了Wikipedia的文本。数据集的语言为美式英语。
`databricks-dolly-15k` is an open source dataset of instruction-following records generated by thousands of Databricks employees to enable large language models to exhibit the magical interactivity of ChatGPT. The dataset contains more than 15,000 records across various instruction categories, including brainstorming, classification, closed QA, generation, information extraction, open QA, and summarization. It can be used for any purpose, whether academic or commercial, under the terms of the Creative Commons Attribution-ShareAlike 3.0 Unported License. The records were generated by employees, with some categories referencing text from Wikipedia. The dataset is in American English.
提供机构:
Nyooti



