five

Viral Culture in Early Nineteenth-Century Europe newspaper dataset

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/6697270
下载链接
链接失效反馈
官方服务:
资源简介:
Dataset produced during the project Viral Culture in Early Nineteenth-Century Europe. The project traced text reuse by analysing large OCR'd newspaper collections using a BLAST based algorithm. This algorithm produces text clusters. This dataset contains two produced cluster datasets based on two different data collections. For the first dataset, the Austrian ANNO newspaper collection, this dataset contains metadata describing the used newspapers. For the second dataset, German-language newspapers in the Europeana collection, this dataset contains project produced metadata describing the newspapers used by the project, as well as the OCR's content for these newspaper issues. The OCR is produced with Tesseract OCR from digital page images downloaded from the Europeana services.
创建时间:
2024-07-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作