Khayyam Offline Persian Handwriting Dataset
收藏arXiv2024-06-03 更新2024-08-06 收录
下载链接:
http://arxiv.org/abs/2406.01025v1
下载链接
链接失效反馈官方服务:
资源简介:
Khayyam Offline Persian Handwriting Dataset是由Pourya Jafarzadeh等人创建的大型无约束波斯语手写数据集,包含44000个单词、60000个字母和6000个数字,由400名波斯语母语者填写。数据集旨在解决波斯语手写识别中数据不足的问题,特别是在单词和句子层面。创建过程中,设计了多种表格以收集不同形式的手写样本,并通过高分辨率扫描确保数据质量。该数据集适用于机器学习和手写识别研究,特别是针对波斯语言的特殊挑战,如字母形态多样性和书写风格的变化。
The Khayyam Offline Persian Handwriting Dataset is a large unconstrained Persian handwriting dataset created by Pourya Jafarzadeh et al. It comprises 44,000 words, 60,000 letters, and 6,000 digits, collected from 400 native Persian speakers. This dataset was developed to mitigate the shortage of available data for Persian handwriting recognition, especially at the word and sentence levels. During its creation, multiple specialized forms were designed to gather handwriting samples of various styles, and high-resolution scanning was employed to ensure data quality. This dataset is applicable to machine learning and handwriting recognition research, particularly for addressing the unique challenges of the Persian language, such as the morphological diversity of its letters and variations in writing styles.
提供机构:
未提及
创建时间:
2024-06-03



