Teklia/Newspapers-finlam
收藏Hugging Face2025-02-17 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Teklia/Newspapers-finlam
下载链接
链接失效反馈官方服务:
资源简介:
Finlam数据集包含了从19至20世纪的149份法文报纸,每份报纸包含多张页面。页面图像被调整到固定高度2000像素。每个页面包含多个区域,每个区域包含多边形坐标、文本、类别和阅读顺序等信息。数据集分为训练集、验证集和测试集,其中大部分报纸为法文,也包含一些英文报纸。
The Finlam dataset includes 149 French newspapers from the 19th to 20th centuries. Each newspaper contains multiple pages with images resized to a fixed height of 2000 pixels. Each page consists of multiple zones, each with polygon coordinates, text, class, and reading order information. The dataset is split into training, validation, and test sets, predominantly in French, with some newspapers in English.
提供机构:
Teklia



