five

Topics for each Wikipedia Article across Languages

收藏
Figshare2020-04-15 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Topics_for_each_Wikipedia_Article_across_Languages/12127434
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the predicted topic(s) for (almost) each Wikipedia article across languages. Each row contains the following columns:Qid,topic,probability,page_id,page_title,wiki_db Where: * Qid: Wikidata Item Id* topic: Topic based on the ORES draft topic model (https://www.mediawiki.org/wiki/Talk:ORES/Draft_topic) * probability: Probability to belong to the topic* page_id: page_id* page_title: page_title* wiki_db: wiki_db, for example for english Wikipedia is enwikiFor exampleQ1000211,Geography.Regions.Europe.Western_Europe,1.0,166578,Frières-Faillouël,euwikiTopics are predicted using the Wikidata-Topic model developed by Isaac Johnson (https://github.com/geohci/wikidata-topic-model)The source code to create this dataset can be found here:https://github.com/digitalTranshumant/wikidata-topic-model
创建时间:
2020-04-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作