five

Topics for each Wikipedia Article across Languages

收藏
DataCite Commons2025-06-01 更新2024-07-28 收录
下载链接:
https://figshare.com/articles/Topics_for_each_Wikipedia_Article_across_Languages/12127434/1
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains the predicted topic(s) for (almost) each Wikipedia article across languages. <br><br>Each row contains the following columns:<pre>Qid,topic,probability,page_id,page_title,wiki_db <br>Where: <br><br>* Qid: Wikidata Item Id<br>* topic: Topic based on the ORES draft topic model (https://www.mediawiki.org/wiki/Talk:ORES/Draft_topic) <br>* probability: Probability to belong to the topic<br>* page_id: page_id<br>* page_title: page_title<br>* wiki_db: wiki_db, for example for english Wikipedia is enwiki<br><br>For example<br>Q1000211,Geography.Regions.Europe.Western_Europe,1.0,166578,Frières-Faillouël,euwiki<br>Topics are predicted using the Wikidata-Topic model developed by Isaac Johnson (https://github.com/geohci/wikidata-topic-model)<br></pre>The source code to create this dataset can be found here:<br>https://github.com/digitalTranshumant/wikidata-topic-model
提供机构:
figshare
创建时间:
2020-04-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作