WebCPM
收藏arXiv2025-09-30 收录
下载链接:
https://github.com/thunlp/webcpm
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了2万个样本,旨在增强注意力加强任务训练数据。此外,该数据集中的样本还被用于筛选和提升训练数据的质量。该数据集适用于多文档问答(QA)任务。
This dataset consists of 20,000 samples, intended to augment the training data for attention-enhanced tasks. Furthermore, samples within this dataset are also employed to filter and improve the quality of training data. This dataset is suitable for multi-document question answering (QA) tasks.



