Images from Newspaper Navigator predicted as maps, with human corrected labels
收藏NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4156509
下载链接
链接失效反馈官方服务:
资源简介:
The Dataset contains images derived from the Newspaper Navigator (news-navigator.labs.loc.gov/), a dataset of images drawn from the Library of Congress Chronicling America collection (chroniclingamerica.loc.gov/).
[The Newspaper Navigator dataset] consists of extracted visual content for 16,358,041 historic newspaper pages in Chronicling America. The visual content was identified using an object detection model trained on annotations of World War 1-era Chronicling America pages, including annotations made by volunteers as part of the Beyond Words crowdsourcing project.
source: https://news-navigator.labs.loc.gov/
One of these categories is 'maps'. In the original training data for Newspaper Navigator, there were relatively few labelled examples of maps. The predictions for maps have an Average Precision of 69.5%, and 34 images in the validation data.
This dataset contains a sample of these images which have been predicted as 'maps'. It also includes additional labels which indicate whether the predicted map image is a 'map' or 'not a map'.
The data is organised as follows:
The images themselves can be found in 'newspaper_maps.zip'
`2020_30_10_13_19_228_sample.json` contains metadata about each image drawn from the Newspaper Navigator Dataset.
map_labels.csv contains the labels for the images as a CSV file
创建时间:
2021-03-15



