five

Open Newspapers (HMD14) Full Text and Metadata

收藏
DataCite Commons2025-07-29 更新2026-02-08 收录
下载链接:
https://bl.iro.bl.uk/concern/datasets/2800eb7d-8b49-4398-a6e9-c2c5692a1304
下载链接
链接失效反馈
官方服务:
资源简介:
Full text and metadata of the first 14 newspaper titles digitised by Heritage Made Digital (HMD) after processing by the Alto2Text pipeline created by the Living with Machines Project. The pipeline took as its input the highly verbose AltoXML files of the same newspaper titles and converted them into the more readable plain text format for the benefit of readers and researchers. This includes the following titles: - Colored News - National Register - The British Press; or, Morning Literary Advertiser - The Express - Liverpool Standard and General Commercial Advertiser; Liverpool Standard and General Advertiser; Liverpool Standard and General Commercial Advertiser - The Northern Daily Times (Liverpool); Northern Times; The Daily Times - The Press - The Star - The Statesman - The Sun Individual datasets for each of these title are also available within the BL Research Repository (so you do not need to download the full HMD14 dataset): https://bl.iro.bl.uk/catalog?f%5Bkeyword_sim%5D%5B%5D=HMD14
提供机构:
British Library
创建时间:
2025-06-17
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作