rubenroy/GammaCorpus-v2-5m
收藏Hugging Face2025-02-01 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/rubenroy/GammaCorpus-v2-5m
下载链接
链接失效反馈官方服务:
资源简介:
GammaCorpus v2 5m数据集是一个包含500万个结构化多轮对话的数据集,每个对话包括用户提示或问题和AI助手生成的回答。这是GammaCorpus数据集的第二个和最新版本,提供了比第一个版本更高质的对话内容,并经过了大量清洗。数据集以JSONL格式存储,适用于自然语言处理和对话AI研究。
The GammaCorpus v2 5m dataset consists of 5 million structured multi-turn conversations, each including a user prompt or question and an AI-generated response. This is the second and latest version of the GammaCorpus dataset, offering higher quality conversations and substantial cleaning compared to the GammaCorpus v1. The dataset is stored in JSONL format and is suitable for natural language processing and conversational AI research.
提供机构:
rubenroy



