Data from "Measuring data rot: an analysis of the continued availability of shared data from a single university"
收藏DataCite Commons2024-03-06 更新2024-07-13 收录
下载链接:
https://data.caltech.edu/doi/10.22002/h5e81-spf62
下载链接
链接失效反馈官方服务:
资源简介:
Data files from the article "Measuring Data Rot: An Analysis of the Continued Availability of Shared Data from a Single University" by Kristin Briney.
This research looked at supplemental data links from publications in CaltechAUTHORS and tested them for their availability on the web using web scraping and hand testing in the Chrome browser.
Data in the tables:
Table1_ResearchAreas.csv
Table2_LinkType.csv
Table3_URLwebsites.csv
Table4_DOIwebsites.csv
Table5_UnavailableByType.csv
Table6_UnavailableURLs.csv
Table7_UnavailableDOIs.csv
Data in the figures:
Figure1_LinksByYear.csv
Figure2_UnavailableByYear.csv
Data from the project:
DataRot.csv
Overall dataset supporting this research, with variables defined in the data dictionary. This data contains all of the links tested, listing results of the webscraping but not results of the hand testing.
DataRot_dataDictionary.csv
Data dictionary defining variable names and values for DataRot.csv
DataRot_handTested.csv
Subset of supplemental data links from DataRot.csv that were hand tested and the results of the hand testing ("browser_test = TRUE" means the data was available, "browser_test = FALSE" means the data was not available, and "browser_test = LOGIN" means the webpage asked for a login to see the data).
DataRot_missingData.csv
Subset of DataRot_handTested.csv with fewer variables. This dataset only includes supplemental data links for data that was not available.
CaltechAUTHORS sampling dataset:
Sampling.csv
Contains comparison between 450 articles recorded in CaltechAUTHORS with what is listed in the articles themselves with respect to shared data and supplementary information.
Sampling_dataDictionary.txt
Data dictionary defining variable names and values for Sampling.csv
提供机构:
CaltechDATA
创建时间:
2024-03-06



