ComplexDataLab/cdl-data-chai-ccnews-20240817
收藏Hugging Face2025-05-14 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/ComplexDataLab/cdl-data-chai-ccnews-20240817
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文章的标题、出版商、URL、作者列表、主题列表、摘要、正文内容、发布日期以及是否免费访问等字段。数据集被划分为训练集,其中包含超过220万条示例,总大小约为7.76GB。数据集支持默认配置,训练集数据文件以特定的路径进行存储。
The dataset includes fields such as article title, publisher, URL, list of authors, list of topics, summary, main text, publishing date, and whether it is free to access. The dataset is split into a training set, which contains more than 2.2 million examples and is about 7.76GB in total size. The dataset supports a default configuration, with training set data files stored at a specific path.
提供机构:
ComplexDataLab



