20Newsgroups-新闻数据集
收藏复旦大学社会科学数据平台2020-04-07 更新2025-12-27 收录
下载链接:
https://rdr.fudan.edu.cn/datahome/open/datahome/4556
下载链接
链接失效反馈官方服务:
资源简介:
The 20 Newsgroups data set is a collection of approximately 20,000 newsgroup documents, partitioned (nearly) evenly across 20 different newsgroups. To the best of my knowledge, it was originally collected by Ken Lang, probably for his Newsweeder: Learning to filter netnews paper, though he does not explicitly mention this collection. The 20 newsgroups collection has become a popular data set for experiments in text applications of machine learning techniques, such as text classification and text clustering.
提供机构:
(开源)新闻数据集
创建时间:
2020-01-13



