Diff posts' titles, authors, full-text, dates, and tags (from 2008-04-11 to 2023-08-31
收藏DataCite Commons2024-05-18 更新2024-08-19 收录
下载链接:
https://figshare.com/articles/dataset/_i_Diff_posts_titles_authors_full-text_dates_and_tags_from_2008-04-11_to_2023-08-31_i_/25844089/2
下载链接
链接失效反馈官方服务:
资源简介:
This csv contains the <i>titles, authors, full-text, dates, and tags of all Diff posts </i><i>from 2008-04-11 to 2023-08-31</i>.<br>Diff (https://diff.wikimedia.org/) is the collaborative, multilingual and multimedia platform for news, updates, and discussions related to the Wikimedia movement.All URLs were retrieved using this Python code: https://gitlab.wikimedia.org/segt/libraries-in-wikimedia/-/blob/main/diff_posts/scrape_diff_data.ipynbFull text and authors were scraped using R and custom functions:<br>https://gitlab.wikimedia.org/segt/r-code-for-various-wiki-tasks/-/blob/main/diff_scrape-authors.Rhttps://gitlab.wikimedia.org/segt/r-code-for-various-wiki-tasks/-/blob/main/diff_scrape_full-text.R<br><br>The DOI of this dataset is: 10.6084/m9.figshare.25844089<br>
提供机构:
figshare
创建时间:
2024-05-17



