Viral Culture in Early Nineteenth-Century Europe newspaper dataset
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/6697270
下载链接
链接失效反馈官方服务:
资源简介:
Dataset produced during the project Viral Culture in Early Nineteenth-Century Europe.
The project traced text reuse by analysing large OCR'd newspaper collections using a BLAST based algorithm. This algorithm produces text clusters.
This dataset contains two produced cluster datasets based on two different data collections.
For the first dataset, the Austrian ANNO newspaper collection, this dataset contains metadata describing the used newspapers.
For the second dataset, German-language newspapers in the Europeana collection, this dataset contains project produced metadata describing the newspapers used by the project, as well as the OCR's content for these newspaper issues. The OCR is produced with Tesseract OCR from digital page images downloaded from the Europeana services.
创建时间:
2024-07-16



