five

Data from "Measuring data rot: an analysis of the continued availability of shared data from a single university"

收藏
DataCite Commons2023-06-27 更新2025-04-09 收录
下载链接:
https://data.caltech.edu/doi/10.22002/tevtp-hga12
下载链接
链接失效反馈
官方服务:
资源简介:
Data files from the article "Measuring Data Rot" by Kristin Briney. This research looked at supplemental data links from publications in CaltechAUTHORS and tested them for their availability on the web using web scraping and hand testing in the Chrome browser. Data in the tables: Table1_LinkType.csv Table2_URLwebsites.csv Table3_DOIwebsites.csv Table4_UnavailableByType.csv Table5_UnavailableURLs.csv Table6_UnavailableDOIs.csv Data in the figures: Figure1_LinksByYear.csv Figure2_UnavailableByYear.csv Data from the project: DataRot.csv Overall dataset supporting this research, with variables defined in the data dictionary. This data contains all of the links tested, listing results of the webscraping but not results of the hand testing. DataRot_dataDictionary.csv Data dictionary defining variable names and values for DataRot.csv DataRot_handTested.csv Subset of supplemental data links from DataRot.csv that were hand tested and the results of the hand testing ("browser_test = TRUE" means the data was available, "browser_test = FALSE" means the data was not available, and "browser_test = LOGIN" means the webpage asked for a login to see the data). DataRot_missingData.csv Subset of DataRot_handTested.csv with fewer variables. This dataset only includes supplemental data links for data that was not available.
提供机构:
CaltechDATA
创建时间:
2023-06-27
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作