billhdzhao/alpaca-Llama-3.1-8B-Instruct-ultrachat_200k
收藏Hugging Face2025-12-15 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/billhdzhao/alpaca-Llama-3.1-8B-Instruct-ultrachat_200k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为alpaca-Llama-3.1-8B-Instruct-ultrachat_200k,是通过distilabel工具创建的合成数据集。它包含207,865个训练样本,主要用于生成任务。数据集中的每个样本包含prompt(提示)、prompt_id(提示ID)、messages(多轮对话消息)、generation(生成内容)等字段。messages字段记录了用户和助手之间的多轮对话内容,展示了如何使用主题功能的具体对话示例。数据集还包含了模型名称、输入输出token统计等元数据信息。该数据集可用于训练和评估对话生成模型,特别是针对主题功能相关的对话场景。
This dataset named alpaca-Llama-3.1-8B-Instruct-ultrachat_200k is a synthetic dataset created using the distilabel tool. It contains 207,865 training examples primarily for generation tasks. Each example in the dataset includes fields such as prompt, prompt_id, messages (multi-turn conversation), and generation. The messages field records the multi-turn dialogue between users and assistants, demonstrating specific conversation examples about theme functionality usage. The dataset also includes metadata such as model names and input/output token statistics. This dataset can be used to train and evaluate dialogue generation models, particularly for theme-related conversation scenarios.
提供机构:
billhdzhao



