five

llustrated London News Illustration Dataset (1842-1890)

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14169698
下载链接
链接失效反馈
官方服务:
资源简介:
Description This dataset contains comprehensive metadata for 72,081 illustrations extracted from the Illustrated London News (ILN) between 1842-1890. The ILN was the first and most influential illustrated newspaper of the Victorian era, making this dataset a valuable resource for researchers in digital humanities, media history, and visual culture studies. The dataset provides detailed information about each illustration, enabling large-scale analysis of Victorian visual culture and the evolution of newspaper illustration practices.  Content The dataset consists of a CSV file containing the following information for each illustration: Publication date (YYYY-MM-DD format) Volume and issue number Page number within issue Bounding box coordinates (in YOLO format) Model confidence score from the detection model llustration sequence number on page (indicating reading order) OCR-extracted caption text Original Internet Archive item identifier Page URL for accessing the original scan This dataset also contains a pt file with multimodal embeddings (Open-CLIP) for all the illustrations Methods The illustrations were systematically extracted using several computational steps: Collection of 56,699 digitized pages from the Internet Archive's Serials in Microfilm Collection Fine-tuning of YOLOv8 object detection model on 908 manually annotated pages (mAP50: 0.964, mAP95: 0.92) Automated extraction of illustrations using the fine-tuned model Caption text extraction using Tesseract OCR Generation of multimodal embeddings using LAION OpenCLIP model (ViT-L-14-DataComp.XL-s13B-b90K Code Availability All code used to create this dataset is available in two GitHub repositories: Repository: https://github.com/tpsmi/multimodaliln  Jupyter notebooks for downloading ILN pages YOLOv8 fine-tuning code Illustration extraction pipeline OCR processing scripts Embedding generation code Repository: https://github.com/tpsmi/ilnmultimodalsearch  Multimodal search implementation Text-to-image and image-to-image retrieval User interface code API endpoints for search functionality Original Data Source The original page scans are freely available through the Internet Archive's Serials in Microfilm Collection. This dataset builds upon these public domain materials by providing structured metadata and computational annotations. Citation Please cite our dataset paper (will be added) or this dataset (Zenodo DOI) Related Publications [Publication details when available]
创建时间:
2024-11-18
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作