AmazonScience/DocTalk
收藏Hugging Face2025-07-09 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/AmazonScience/DocTalk
下载链接
链接失效反馈官方服务:
资源简介:
DocTalk是一个通过三阶段管道构建的大型合成对话语料库,用于增强大型语言模型(LLM)的对话能力。该语料库包含了730,707个多轮、多主题的信息寻求对话,由相关维基百科文档集群转换而来。
DocTalk is a large-scale synthetic dialogue corpus constructed through a three-stage pipeline to convert clusters of related Wikipedia documents into multi-turn, multi-topic information-seeking conversations for enhancing the conversational capabilities of large language models (LLM).
提供机构:
AmazonScience



