billhdzhao/alpaca-Llama-3.1-8B-Instruct-31k
收藏Hugging Face2025-12-14 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/billhdzhao/alpaca-Llama-3.1-8B-Instruct-31k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为alpaca-Llama-3.1-8B-Instruct-31k,由distilabel工具生成,包含31000个训练样本。数据集结构包括instruction(指令)、input(输入)、output(输出)、text(文本)、generation(生成内容)、distilabel_metadata(生成元数据)和model_name(模型名称)等字段。distilabel_metadata字段进一步包含原始输入文本、原始输出文本和生成统计信息(如输入和输出的token数量)。数据集主要用于指令生成任务,示例展示了如何生成保持健康的三个建议。用户可以通过HuggingFace的datasets库加载该数据集。
This dataset, named alpaca-Llama-3.1-8B-Instruct-31k, was generated using the distilabel tool and contains 31,000 training examples. The dataset structure includes fields such as instruction, input, output, text, generation, distilabel_metadata, and model_name. The distilabel_metadata field further contains raw input text, raw output text, and generation statistics (e.g., the number of input and output tokens). The dataset is primarily used for instruction generation tasks, with examples demonstrating how to generate three tips for staying healthy. Users can load the dataset via the HuggingFace datasets library.
提供机构:
billhdzhao



