Data from "Measuring data rot: an analysis of the continued availability of shared data from a single university"
收藏DataCite Commons2023-06-27 更新2025-04-09 收录
下载链接:
https://data.caltech.edu/doi/10.22002/tevtp-hga12
下载链接
链接失效反馈官方服务:
资源简介:
Data files from the article "Measuring Data Rot" by Kristin Briney.
This research looked at supplemental data links from publications in CaltechAUTHORS and tested them for their availability on the web using web scraping and hand testing in the Chrome browser.
Data in the tables:
Table1_LinkType.csv
Table2_URLwebsites.csv
Table3_DOIwebsites.csv
Table4_UnavailableByType.csv
Table5_UnavailableURLs.csv
Table6_UnavailableDOIs.csv
Data in the figures:
Figure1_LinksByYear.csv
Figure2_UnavailableByYear.csv
Data from the project:
DataRot.csv
Overall dataset supporting this research, with variables defined in the data dictionary. This data contains all of the links tested, listing results of the webscraping but not results of the hand testing.
DataRot_dataDictionary.csv
Data dictionary defining variable names and values for DataRot.csv
DataRot_handTested.csv
Subset of supplemental data links from DataRot.csv that were hand tested and the results of the hand testing ("browser_test = TRUE" means the data was available, "browser_test = FALSE" means the data was not available, and "browser_test = LOGIN" means the webpage asked for a login to see the data).
DataRot_missingData.csv
Subset of DataRot_handTested.csv with fewer variables. This dataset only includes supplemental data links for data that was not available.
提供机构:
CaltechDATA
创建时间:
2023-06-27



