five

WikiProjects Machine Readable Dataset

收藏
DataCite Commons2025-04-01 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/dataset/WikiProjects_Machine_Readable_Dataset/5503819/1
下载链接
链接失效反馈
官方服务:
资源简介:
Machine readable format of WikiProjects listed at https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Council/Directory<br>The dataset is generated using the code at - https://github.com/wiki-ai/drafttopic/<br>The dataset is modeled in the form of a nested tree structure after the original hierarchical mappings on the WikiProejcts home page and its child pages.<br>* Each non-leaf entry represents a sub-category with a name and some associated information like the level in the page it was parsed at and the root url of the page it was parsed from.* Each non-leaf node has a mandatory key "topics" which leads to further sub-categories within it.* Each leaf node is a WikiProject entry, with actual WikiProject name and its active status.
提供机构:
figshare
创建时间:
2017-10-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作