five

anonymized author dataset from the publication "Impact of the COVID-19 pandemic on publishing in astronomy in the initial two years"

收藏
NIAID Data Ecosystem2026-03-14 收录
下载链接:
https://zenodo.org/record/7145195
下载链接
链接失效反馈
官方服务:
资源简介:
We provide the anonymized author dataset that was prepared for the research presented in https://arxiv.org/pdf/2203.15621.pdf . The data is provided as two pickled pandas data frames. The first data frame contains the number of publications each author has written in each year per author position (as 1st author etc.) and their assigned gender. The second file contains the corresponding country of affiliation for each author and year. The two data frames can be matched by author_id. Columns in "matched_author_dataset_publication_counts_zenodo.pkl": ['author_id', 'gender', 'P(gender)', 'pub1_tot', 'first_auth_pub_year', 'pub2_tot', 'auth_2_pub_year', 'pub3_tot', 'auth_3_pub_year', 'pub4_tot', 'auth_4_pub_year', 'pub5_tot', 'auth_5_pub_year', 'pub6_tot', 'auth_6_pub_year', 'pub7_tot', 'auth_7_pub_year', 'pub8_tot', 'auth_8_pub_year', 'pub9_tot', 'auth_9_pub_year', 'pub10_tot', 'auth_10_pub_year', 'pub11_tot', 'auth_11_pub_year', 'pub12_tot', 'auth_12_pub_year', 'pub13_tot', 'auth_13_pub_year', 'pub14_tot', 'auth_14_pub_year', 'pub15_tot', 'auth_15_pub_year', 'pub16_tot', 'auth_16_pub_year', 'tot_pub', 'tot_pub_year', 'last_pub', 'first_pub'] 'author_id' is a unique id assigned to the author. 'gender' contains the author's most likely gender and 'P('gender')' the likelihood of correct assignment. 'pubX_tot' contains the total number of publications the author has written as Xth author, 'auth_X_pub_year' contains a list with one entry per year from 1950 to 2022 counting the number of papers the author has published as Xth author in that year. 'tot_pub' counts the total number of publications (summed over all years) and 'tot_pub_year' for each year/ 'last_pub' and 'first pub' contain the list indices of the years when the author last/first published. Columns in author_dataframe_country.pkl: ['author_id', 'aff_year', 'aff_country_author'] 'aff_year' is a list of years for which country information could be inferred for this author. 'aff_country_author' is a list of countries for each entry in 'aff_year'.
创建时间:
2022-10-05
二维码
社区交流群
二维码
科研交流群
商业服务