five

Metadata of French Electronic Theses (theses.fr), focused on altmetrics (2020)

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14954419
下载链接
链接失效反馈
官方服务:
资源简介:
For the vast majority of the manuscripts it hosts, the theses.fr archive provides the following metadata: author(s), dissertation title, document type, defense date, domain(s), keywords, language, and the identity of the dissertation director or supervisor. Additionally, two dynamic metrics are displayed on each manuscript’s presentation page: the number of views and the number of downloads. They were collected in 2020. A document is considered “viewed” when a user accesses the webpage displaying both its metadata and the URL link to the PDF. However, it is important to note that the PDF can be accessed and downloaded directly from a search engine like Google, without requiring navigation through the archive or the use of Google Scholar. This process does not generate views, which likely explains why, as we will observe, the number of downloads consistently exceeds the number of views. One limitation of such traffic metrics is that they do not indicate whether the documents are actually read. Metadata were collected on June 23, 2020, for 99,743 manuscripts available in the open repository, using a web scraper written in Python based on the Selenium library.
创建时间:
2025-03-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作