Material of the Journal of Informetrics article on self-citations
收藏DataCite Commons2020-08-28 更新2024-08-17 收录
下载链接:
https://figshare.com/articles/Material_of_the_Journal_of_Informetrics_article_on_self-citations/6866660/5
下载链接
链接失效反馈官方服务:
资源简介:
This package contains the materials, data, and results of the experiments introduced in the article "The practice of self-citations: a longitudinal study" by Silvio Peroni, Paolo Ciancarini, Aldo Gangemi, Andrea Giovanni Nuzzolese, Francesco Poggi, and Valentina Presutti, submitted to the Journal of Informetrics.<br><br>In particular, it contains:<br><br>1. A README.txt file (this file).<br><br>2. The directory "data" which contains the original data used for the experiments. In particular it contains two CSV files called "author-self-citations.csv" and "author-network-self-citations.csv". The first file counts all the author self-citations (i.e. those where the citing article and the cited article share at least one author) for each of the articles analysed. Instead, the second file counts all the author network self-citations (i.e. those when a co-author of any author of the citing article is also the author of the cited article) for the same set of articles. The tabular structure followed for these two files is the same:<br>- "id" is the local identifier of the article in consideration;<br>- "year" is the year of publication of the article;<br>- "category" is the discipline to which the article belongs to;<br>- "citation" is the number of bibliographic references in its reference list, i.e. the citations that it does to other works;<br>- "self" is the number of bibliographic references that denotes a self-citation.<br><br>3. The directory "evaluation" that contains several CSV files and images describing, for each discipline considered, the difference in the means of the number of self-citations before and after 2012. In particular, the sub-directory "author-self-citations" contains two CSV files, i.e. "author-self-citations-1959-2016.csv" and "author-self-citations-2009-2016.csv", that contains the aforementioned data about author self-citations considering the 1959-2016 and the 2009-2016 publication windows for the articles, and the related diagram showing the confidence intervals computed in both cases. Similarly, the sub-directory "author-network-self-citations" contains other 2 CSV files and a diagram that describe the same information concerning author network self-citations. The data in the CSV are structured as follows:<br>- "category": is the discipline in consideration;<br>- "# p[year <= 2012]" is the number of articles published by 2012;<br>- "mean p[year <= 2012]" is the mean of self-citations per article, considering those ones published by 2012;<br>- "st p[year <= 2012]" is the standard deviation of the previous mean;<br>- "# p[year > 2012]" is the number of articles published after 2012;<br>- "mean p[year > 2012]" is the mean of self-citations per article, considering those ones published after 2012;<br>- "st p[year > 2012]" is the standard deviation of the previous mean;<br>- "diff" is the difference between the two means;<br>- "ci-low" is the lower confidence interval limit (margin of error) of the previous difference;<br>- "ci-high" is the higher confidence interval limit of the previous difference.<br><br>4. The directory "script" that contains two Python scripts that have been used for calculating the aforementioned results and diagrams. In particular:<br>- "analyse_data.py" has been used to create the CSV files contained in the directory "evaluation" starting from the information contained in the directory "data";<br>- "error_diagram.py" has been used to create the two diagrams included in the directory evaluation.<br><br>All the documents and data in this package are released with a CC0 waiver (https://creativecommons.org/publicdomain/zero/1.0/legalcode), while the Python scripts are licensed with an ISC Licence (https://opensource.org/licenses/ISC). <br>
提供机构:
figshare
创建时间:
2018-08-05



