52K demonstrations used to fine-tune Alpaca

Name: 52K demonstrations used to fine-tune Alpaca
Creator: OpenDataLab
Published: 2026-05-31 07:30:41
License: 暂无描述

OpenDataLab2026-05-31 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/52K_demonstrations_used_to_fine-tune_etc

下载链接

链接失效反馈

官方服务：

资源简介：

在学术预算下训练高质量的教学遵循模型有两个重要的挑战: 强大的预先训练的语言模型和高质量的教学遵循数据。第一个挑战是通过最近发布的Meta的新美洲驼模型来解决的。对于第二个挑战，自我指导论文建议使用现有的强语言模型来自动生成指令数据。特别地，羊驼是一种语言模型，使用从OpenAI text-davinci-003生成的52k指令跟随演示从LLaMA 7B模型进行监督学习进行微调。

Training high-quality instruction-following models under an academic budget faces two critical challenges: securing a powerful pre-trained language model and curating high-quality instruction-following datasets. The first challenge is addressed by Meta's newly released LLaMA model. For the second challenge, the Self-Instruct paper proposes using existing state-of-the-art language models to automatically generate instruction tuning data. Specifically, Alpaca, a language model fine-tuned from the LLaMA 7B base model through supervised learning, utilizes 52k instruction-following demonstrations generated by OpenAI's text-davinci-003.

提供机构：

OpenDataLab

创建时间：

2023-04-26

搜集汇总

数据集介绍