Classes of errors in DOI names: output dataset
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4733646
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains a seven-column CSV file, where the first column ("Valid_citing_DOI") contains the DOI of a citing entity retrieved in Crossref, the second column ("Invalid_cited_DOI") contains the invalid DOI of a cited entity identified by looking at the field "reference" in the JSON document returned by querying the Crossref API with the citing DOI, and the third column ("Valid_DOI"), contains the corrected DOI if it has been identified, an empty string otherwhise. Finally, the last four columns ("Already_valid", "Prefix_error", "Suffix_error", "Other-type_error"), contain a 1 if the error in the DOI was related to that class, 0 otherwise.
The citations to invalid DOIs have been retrieved from Citations to invalid DOI-identified entities obtained from processing DOI-to-DOI citations to add in COCI (Peroni, 2021), while the valid DOI names and the related classes of errors are the result of a process described in Cleaning different types of DOI errors found in cited references on Crossref using automated methods, by the same authors of this the dataset.
创建时间:
2021-06-08



