five

AHT2D dataset

收藏
IEEE2026-04-17 收录
下载链接:
https://ieee-dataport.org/documents/aht2d-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
AHT2D dataset is composed of Handwritten Arabic letters with diacritics. In this dataset, we have 28 letter classes according to the number of Arabic letters. Each class contains a multiple letter form. We have different letter images from different sources such as the internet, our writers, etc. The AHT2D dataset includes only isolated letters. In addition, this dataset contains different writing styles, orientations, colors, thicknesses, sizes, and backgrounds, which makes it a very large and rich dataset. AHT2D is composed of three different files, which not only facilitates the tasks of text detection and recognition but also enable good results. These three files are training, testing, and validation. In our proposed model, we first divide our dataset into three sets: the training set, the test set, and the validation set. The split of our dataset is as follows. The training set has 80\% of the total dataset images. The rest is divided into testing and validation. The test set contains 80\% of the remaining images. Furthermore, the validation set contains 20\%. Finally, the processing step enhances and augments the images, as well as the data.
提供机构:
OUALI, Imene; WALI, Ali; BEN HALIMA, Mohamed
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作