five

GLAM-Workbench/trove-newspapers-corrections

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12507383
下载链接
链接失效反馈
官方服务:
资源简介:
Current version: v2.1 OCR errors in Trove's digitised newspapers can be corrected by users. To help understand patterns in newspaper correction, this dataset has been created to record information about the number of articles with corrections. The data was extracted from the Trove API using this notebook from the Trove newspapers section of the GLAM Workbench. There are three files in the dataset: corrections_by_year.csv – number of articles corrected in each publication year corrections_by_category.csv – number of articles corrected in each Trove category corrections_by_title.csv – number of articles corrected in each newspaper The files are in CSV format and contain the following fields. corrections_by_year.csv term – the publication year total_results – the number of articles with corrections total_articles – the total number of articles proportion – the proportion of articles with corrections corrections_by_category.csv term – the category name total_results – the number of articles with corrections total_articles – the total number of articles proportion – the proportion of articles with corrections corrections_by_title.csv id – the Trove identitifer of the newsspaper title title – the name of the newspaper articles_with_corrections – the number of articles with corrections total_articles – the total number of articles from the newspaper in Trove percentage_with_corrections – the percentage of articles with corrections This repository is part of the GLAM Workbench. If you think this project is worthwhile, you might like to sponsor me on GitHub.
创建时间:
2024-09-14
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作