Handwriting Recognition
收藏阿里云天池2026-05-16 更新2024-03-07 收录
下载链接:
https://tianchi.aliyun.com/dataset/94124
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含通过慈善项目收集的超过40万个手写姓名。字符识别利用图像处理技术将扫描文档上的字符转换为数字形式。它通常在机器打印的字体中表现良好。然而,由于个人书写风格的巨大差异,对于机器识别手写字符仍然提出了艰巨的挑战。总共有206,799个姓氏和207,024个姓氏。数据分别分为训练集(331,059),测试集(41,382)和验证集(41,382)。
This dataset contains over 400,000 handwritten names collected via charity initiatives. Character recognition leverages image processing techniques to convert characters on scanned documents into digital formats. It typically achieves strong performance on machine-printed fonts. However, owing to the substantial disparities in individual handwriting styles, handwritten character recognition remains a formidable challenge for automated recognition systems. The dataset comprises a total of 206,799 surnames and 207,024 given names, and is split into three subsets respectively: a training set with 331,059 samples, a test set with 41,382 samples, and a validation set with 41,382 samples.
提供机构:
阿里云天池
创建时间:
2021-03-12
搜集汇总
数据集介绍

背景与挑战
背景概述
Handwriting Recognition数据集包含40多万个手写姓名图像,分为训练集、测试集和验证集,用于字符识别研究。数据通过慈善项目收集,覆盖广泛的个人书写风格,适合研究手写字符识别的挑战。
以上内容由遇见数据集搜集并总结生成



