LaMini-instruction
收藏OpenXLab2026-04-18 收录
下载链接:
https://openxlab.org.cn/datasets/OpenDataLab/LaMini-instruction
下载链接
链接失效反馈官方服务:
资源简介:
LaMini-instruction distill the knowledge from large language models by performing sentence/offline distillation (Kim and Rush, 2016). We generate a total of 2.58M pairs of instructions and responses using gpt-3.5-turbo based on several existing resources of prompts, including self-instruct (Wang et al., 2022), P3 (Sanh et al., 2022), FLAN (Longpre et al., 2023) and Alpaca (Taori et al., 2023)
LaMini-instruction 通过句子级/离线蒸馏(sentence/offline distillation)技术从大语言模型(Large Language Model,LLM)中提炼知识,相关方法源自Kim与Rush 2016年的研究。本数据集依托Self-Instruct、P3、FLAN、Alpaca等多项现有提示资源,借助gpt-3.5-turbo模型共生成了258万条指令与回复配对样本,上述资源分别对应Wang等人(2022)、Sanh等人(2022)、Longpre等人(2023)及Taori等人(2023)的相关研究。
提供机构:
OpenDataLab
创建时间:
2024-05-14



