five

Scopus API Scripts for Data Reuse Project

收藏
DataCite Commons2021-04-26 更新2025-04-16 收录
下载链接:
https://databank.illinois.edu/datasets/IDB-0988473
下载链接
链接失效反馈
官方服务:
资源简介:
To generate the bibliographic and survey data to support a data reuse study conducted by several Library faculty and accepted for publication in the Journal of Academic Librarianship, the project team utilized a series of web-based online scripts that employed several different endpoints from the Scopus API. The related dataset: "Data for: An Examination of Data Reuse Practices within Highly Cited Articles of Faculty at a Research University" contains survey design and results. <br> 1) <b>getScopus_API_process_dmp_IDB.asp</b>: used the search API query the Scopus database API for papers by UIUC authors published in 2015 -- limited to one of 9 pre-defined Scopus subject areas -- and retrieve metadata results sorted highest to lowest by the number of times the retrieved articles were cited. The URL for the basic searches took the following form: https://api.elsevier.com/content/search/scopus?query=(AFFIL%28(urbana%20OR%20champaign) AND univ*%29) OR (AF-ID(60000745) OR AF-ID(60005290))&amp;apikey=xxxxxx&amp;start=" &amp; nstart &amp; "&amp;count=25&amp;date=2015&amp;view=COMPLETE&amp;sort=citedby-count&amp;subj=PHYS<br> Here, the variable nstart was incremented by 25 each iteration and 25 records were retrieved in each pass. The subject area was renamed (e.g. from PHYS to COMP for computer science) in each of the 9 runs. This script does not use the Scopus API cursor but downloads 25 records at a time for up to 28 times -- or 675 maximum bibliographic records. The project team felt that looking at the most 675 cited articles from UIUC faculty in each of the 9 subject areas was sufficient to gather a robust, representative sample of articles from 2015. These downloaded records were stored in a temporary table that was renamed for each of the 9 subject areas. <br> 2) <b>get_citing_from_surveys_IDB.asp</b>: takes a Scopus article ID (eid) from the 49 UIUC author returned surveys and retrieves short citing article references, 200 at a time, into a temporary composite table. These citing records contain only one author, no author affiliations, and no author email addresses. This script uses the Scopus API cursor=* feature and is able to download all the citing references of an article 200 records at a time. <br> 3) <b>put_in_all_authors_affil_IDB.asp</b>: adds important data to the short citing records. The script adds all co-authors and their affiliations, the corresponding author, and author email addresses. <br> 4) <b>process_for_final_IDB.asp</b>: creates a relational database table with author, title, and source journal information for each of the citing articles that can be copied as an Excel file for processing by the Qualtrics survey software. This was initially 4,626 citing articles over the 49 UIUC authored articles, but was reduced to 2,041 entries after checking for available email addresses and eliminating duplicates.

为生成文献和调查数据以支持一项由多位图书馆教职员工开展并被《学术图书馆学杂志》接受发表的数据复用研究,项目团队使用了一系列基于网络的在线脚本,这些脚本调用了Scopus API的多个不同端点。相关数据集"Data for: An Examination of Data Reuse Practices within Highly Cited Articles of Faculty at a Research University"包含调查设计与结果。<br> 1) <b>getScopus_API_process_dmp_IDB.asp</b>:使用搜索API查询Scopus数据库中伊利诺伊大学厄巴纳-香槟分校(UIUC)作者2015年发表的论文——仅限9个预定义的Scopus学科领域之一——并按被引次数从高到低排序检索元数据结果。基础搜索的URL格式如下:https://api.elsevier.com/content/search/scopus?query=(AFFIL%28(urbana%20OR%20champaign) AND univ*%29) OR (AF-ID(60000745) OR AF-ID(60005290))&amp;apikey=xxxxxx&amp;start=" &amp; nstart &amp; "&amp;count=25&amp;date=2015&amp;view=COMPLETE&amp;sort=citedby-count&amp;subj=PHYS<br> 其中,变量nstart每次迭代增加25,每次获取25条记录。该脚本不使用Scopus API的cursor功能,而是每次下载25条记录,最多下载28次——即最多675条文献记录。项目团队认为,查看UIUC教职员工在9个学科领域中各最多675篇被引最高的文章,足以收集2015年文章的稳健且具代表性的样本。这些下载的记录存储在临时表中,每个学科领域对应一个重命名的临时表。<br> 2) <b>get_citing_from_surveys_IDB.asp</b>:从49篇UIUC作者的返回调查中提取Scopus文章ID(eid),并每次检索200条简短的引用文章参考文献,存入临时复合表。这些引用记录仅包含一位作者,无作者所属机构及邮箱地址。该脚本使用Scopus API的cursor=*功能,能够每次下载一篇文章的200条引用参考文献,直至获取所有引用。<br> 3) <b>put_in_all_authors_affil_IDB.asp</b>:向简短的引用记录中添加重要数据。该脚本补充所有共同作者及其所属机构、通讯作者以及作者邮箱地址。<br> 4) <b>process_for_final_IDB.asp</b>:创建一个关系数据库表,包含每篇引用文章的作者、标题及来源期刊信息,该表可复制为Excel文件供Qualtrics调查软件处理。最初,49篇UIUC作者文章对应的引用文章共4626篇,但在检查可用邮箱地址并消除重复后,条目数减少至2041条。
提供机构:
University of Illinois at Urbana-Champaign
创建时间:
2021-04-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作