five

Multilingual Character Recognition Dataset for Moroccan Official Documents

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://data.mendeley.com/datasets/xp3hrmywfm
下载链接
链接失效反馈
官方服务:
资源简介:
the printed dataset with standard fonts of the characters that are used in Moroccan official documents are not available in internet with open-source license, specially Tifinagh and Arabic languages, which made us build new raw dataset, where we collected the most used fonts, then based on them we built 6 datasets; which is: Alphabet (contains the alphabet for a to z in lowercase and uppercase), digits (contains the numbers from 0 to 9), Arabic (contains the whole letters), Tifinagh (contains the all Tifinagh letters), French special characters such as “à, é, ç, è…” (contains the all special characters of French language), Symbols such as “?, !, (, )…”, in order to make a data augmentation, we generate more than one character with the same font.
创建时间:
2023-11-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作