five

Muharaf-public

收藏
Mendeley Data2024-06-22 更新2024-06-28 收录
下载链接:
https://zenodo.org/records/11492215
下载链接
链接失效反馈
官方服务:
资源简介:
Manuscripts of Handwritten Arabic dataset (Muharaf) for cursive text recognition. The following files are present in this repositoriy: public_data_files.zip: Contains the public part of Muharaf dataset. It has the images and the corresponding annotation files in JSON and XML format. public_line_images.zip: Contains the line images and their corresponding transcriptions. public_summary_and_keywords.zip: Contains the summary and keywords extracted from the ground truth transcriptions of each image. sfr_files.zip: Contains the preprocessed files for the start_follow_read_arabic system for training the public part of Muharaf dataset. public_1100_untrained.zip: Contains an initiailized trial folder with 3 different random splits of (train, validation, test) to reproduce the experiments reported in the paper on Muharaf-public. public_1100_trained.zip: Contains the results and models weights after training on Muharaf-public. It has results of three different random splits of (train, validation, test) sets. trial_15_untrained.zip: Contains an intialized trial folder with 3 different random splits of (train, validation, test) to reproduce the experiments reported in the paper on training all the files of Muharaf dataset (1500 training images). trial_15.zip: Contains the results and model weights after training on Muharaf. It has results of three different random splits of (train, validation, test) sets.
创建时间:
2024-06-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作