guanaco-llama2-1k
收藏魔搭社区2025-12-05 更新2025-03-22 收录
下载链接:
https://modelscope.cn/datasets/mlabonne/guanaco-llama2-1k
下载链接
链接失效反馈官方服务:
资源简介:
# Guanaco-1k: Lazy Llama 2 Formatting
This is a subset (1000 samples) of the excellent [`timdettmers/openassistant-guanaco`](https://huggingface.co/datasets/timdettmers/openassistant-guanaco) dataset, processed to match Llama 2's prompt format as described [in this article](https://huggingface.co/blog/llama2#how-to-prompt-llama-2). It was created using the following [colab notebook](https://colab.research.google.com/drive/1Ad7a9zMmkxuXTOh1Z7-rNSICA4dybpM2?usp=sharing).
Useful if you don't want to reformat it by yourself (e.g., using a script). It was designed for [this article](https://mlabonne.github.io/blog/posts/Fine_Tune_Your_Own_Llama_2_Model_in_a_Colab_Notebook.html) about fine-tuning a Llama 2 (chat) model in a Google Colab.
# Guanaco-1k:便捷适配Llama 2提示格式
本数据集为优质开源数据集`timdettmers/openassistant-guanaco`(https://huggingface.co/datasets/timdettmers/openassistant-guanaco)的子集,仅包含1000条样本,已按照[Hugging Face官方指南文章](https://huggingface.co/blog/llama2#how-to-prompt-llama-2)中规定的格式完成处理,以匹配Llama 2的提示词格式要求。
本数据集通过以下[Colab笔记本](https://colab.research.google.com/drive/1Ad7a9zMmkxuXTOh1Z7-rNSICA4dybpM2?usp=sharing)生成。
对于不愿手动(或通过脚本)自行完成格式转换的用户而言,本数据集具备极高的实用价值。本数据集专为[《在Google Colab中微调自有Llama 2(对话)大语言模型》](https://mlabonne.github.io/blog/posts/Fine_Tune_Your_Own_Llama_2_Model_in_a_Colab_Notebook.html)一文打造,该文章详细讲解了如何在Google Colab环境中微调Llama 2(对话)大语言模型。
提供机构:
maas
创建时间:
2025-03-18



