five

Khayyam Offline Persian Handwriting Dataset

收藏
arXiv2024-06-03 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2406.01025v1
下载链接
链接失效反馈
官方服务:
资源简介:
Khayyam Offline Persian Handwriting Dataset是由Pourya Jafarzadeh等人创建的大型无约束波斯语手写数据集,包含44000个单词、60000个字母和6000个数字,由400名波斯语母语者填写。数据集旨在解决波斯语手写识别中数据不足的问题,特别是在单词和句子层面。创建过程中,设计了多种表格以收集不同形式的手写样本,并通过高分辨率扫描确保数据质量。该数据集适用于机器学习和手写识别研究,特别是针对波斯语言的特殊挑战,如字母形态多样性和书写风格的变化。

The Khayyam Offline Persian Handwriting Dataset is a large unconstrained Persian handwriting dataset created by Pourya Jafarzadeh et al. It comprises 44,000 words, 60,000 letters, and 6,000 digits, collected from 400 native Persian speakers. This dataset was developed to mitigate the shortage of available data for Persian handwriting recognition, especially at the word and sentence levels. During its creation, multiple specialized forms were designed to gather handwriting samples of various styles, and high-resolution scanning was employed to ensure data quality. This dataset is applicable to machine learning and handwriting recognition research, particularly for addressing the unique challenges of the Persian language, such as the morphological diversity of its letters and variations in writing styles.
提供机构:
未提及
创建时间:
2024-06-03
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作