five

articles.h5

收藏
DataCite Commons2025-08-31 更新2025-09-08 收录
下载链接:
https://figshare.com/articles/dataset/articles_h5/29262743/1
下载链接
链接失效反馈
官方服务:
资源简介:
<b>📚 Dataset Description — 1M-News</b>The <b>1M-News</b> dataset is a large-scale, temporally evolving collection of news articles designed to evaluate <b>online adaptation and concept drift</b> in language models. It contains <b>1,006,004 news articles</b> published over a <b>20-year span (2005–2025)</b>, categorized into <b>nine high-level domains</b>:<br>InternationalCulture &amp; ArtsBusiness &amp; EconomyPolitics &amp; GovernmentSports &amp; AthleticsTech, Science &amp; EducationLifestyle &amp; LeisureHealth &amp; Well-beingOther<br>Each article is stored with:<br>Full text contentPublication date (used for temporal ordering)Category label (mapped into one of the nine domains)<br>
提供机构:
figshare
创建时间:
2025-08-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作