Scholarly article citations in Wikipedia
收藏DataCite Commons2024-12-16 更新2024-07-25 收录
下载链接:
https://figshare.com/articles/Wikipedia_Scholarly_Article_Citations/1299540/9
下载链接
链接失效反馈官方服务:
资源简介:
This dataset includes a list of citations to scholarly articles from the most recent version of Wikipedia.
<strong>License</strong>
All files included in this datasets are released under CC0: https://creativecommons.org/publicdomain/zero/1.0/
<strong>Projects</strong>
• English Wikipedia
<strong>Identifiers</strong>
• PubMed IDs (pmid) and PubMedCentral IDs (pmcid).<br>• Digital Object Identifiers (doi)
• International Standard Book Number (isbn)
• ArXiv Ids (arxiv)
<strong>Format</strong>
Each row in the dataset represents a citation as a (Wikipedia article, scholarly article) pair. Metadata about when the citation was first added is included.
• page_id -- The identifier of the Wikipedia article (int), e.g. <em>1325125<br>• </em>page_title -- The title of the Wikipedia article (utf-8), e.g.<em> Club cell<br>• </em>rev_id -- The Wikipedia revision where the citation was first added (int), e.g.<em> 282470030<br>• </em>timestamp -- The timestamp of the revision where the citation was first added. (ISO 8601 datetime), e.g.<em> 2009-04-08T01:52:20Z<br>• </em>type -- The type of identifier, e.g.<em> pmid<br>• </em>id -- The id of the cited scholarly article (utf-8), e.g.<em> 18179694</em>
<strong>Source code</strong>
https://github.com/halfak/Extract-scholarly-article-citations-from-Wikipedia (MIT Licensed)
<strong>Notes</strong>
Citation identifers are extracted as-is from Wikipedia article content. Our spot-checking suggests that 98% of identifiers resolve.
<em>• </em>Added ISBNs for the 20150205 dataset.
• Added arXivs for the 20150602 dataset.
提供机构:
Figshare
创建时间:
2017-01-05



