document data
收藏Figshare2015-12-24 更新2026-04-29 收录
下载链接:
https://figshare.com/articles/dataset/document_data/2057997
下载链接
链接失效反馈官方服务:
资源简介:
The 20 newsgroups corpus is a widely used corpus belonging to 20 related categories,which includes 18,821 documents and 8,156 distinct words.Web-snippets is a set of search snippets belonging to 8 domains/categories, which has 12,340 snippets with 30,338 distinct words.
创建时间:
2015-12-24



