GLAM-Workbench/trove-newspapers-corrections
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12507383
下载链接
链接失效反馈官方服务:
资源简介:
Current version: v2.1
OCR errors in Trove's digitised newspapers can be corrected by users. To help understand patterns in newspaper correction, this dataset has been created to record information about the number of articles with corrections.
The data was extracted from the Trove API using this notebook from the Trove newspapers section of the GLAM Workbench.
There are three files in the dataset:
corrections_by_year.csv – number of articles corrected in each publication year
corrections_by_category.csv – number of articles corrected in each Trove category
corrections_by_title.csv – number of articles corrected in each newspaper
The files are in CSV format and contain the following fields.
corrections_by_year.csv
term – the publication year
total_results – the number of articles with corrections
total_articles – the total number of articles
proportion – the proportion of articles with corrections
corrections_by_category.csv
term – the category name
total_results – the number of articles with corrections
total_articles – the total number of articles
proportion – the proportion of articles with corrections
corrections_by_title.csv
id – the Trove identitifer of the newsspaper title
title – the name of the newspaper
articles_with_corrections – the number of articles with corrections
total_articles – the total number of articles from the newspaper in Trove
percentage_with_corrections – the percentage of articles with corrections
This repository is part of the GLAM Workbench. If you think this project is worthwhile, you might like to sponsor me on GitHub.
创建时间:
2024-09-14



