billhdzhao/alpaca-Llama-2-7b-chat-hf-ultrachat_200k
收藏Hugging Face2025-12-16 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/billhdzhao/alpaca-Llama-2-7b-chat-hf-ultrachat_200k
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为alpaca-Llama-2-7b-chat-hf-ultrachat_200k,是使用distilabel创建的。它包含了通过管道生成的合成数据,具有prompt、messages、generation和model_name等特征。数据集的结构表明它适用于自然语言处理任务,特别是涉及聊天或对话AI的任务,如messages字段中包含的content和role字段所示。数据集还包括关于生成过程的元数据,如输入和输出文本的令牌计数。
This dataset, named alpaca-Llama-2-7b-chat-hf-ultrachat_200k, was created using distilabel. It contains synthetic data generated through a pipeline, with features such as prompt, messages, generation, and model_name. The structure of the dataset indicates its suitability for NLP tasks, particularly those involving chat or conversational AI, as seen in the messages field which includes content and role fields typical of chat datasets. The dataset also includes metadata about the generation process, such as token counts for input and output texts.
提供机构:
billhdzhao



