irregular handwritten Chinese character dataset,IHCCD
收藏DataCite Commons2025-04-27 更新2025-04-16 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=7d002b115702442da420c4b83be59ad1
下载链接
链接失效反馈官方服务:
资源简介:
This article collects the first set of irregular handwritten Chinese character dataset (IHCCD), which includes 3755 categories and 30 samples for each category. IHCCD is handwritten by different non-standard handwriting artists on A4 printing paper, and uses a scanner as the input device to convert handwritten character samples into digital image samples. In the process of collecting the dataset, these non-standard handwriting practitioners do not need to write completely in the order of standard Chinese character strokes. They can freely adjust the thickness, length, and position of strokes, enlarge or shrink radicals arbitrarily, change the degree of inclination of Chinese characters, distort character shapes, and disrupt spatial structures, thus achieving the goal of avoiding current text recognition engines. The experiment shows that this dataset has laid the foundation for promoting the development of non-standard handwritten Chinese character recognition, and has facilitated further research and development in this field.
提供机构:
Science Data Bank
创建时间:
2025-03-20
搜集汇总
数据集介绍

背景与挑战
背景概述
IHCCD是一个专门针对不规则手写汉字的数据集,包含3755个类别,每个类别30个样本,总计112,650个样本,通过扫描A4纸上的手写内容采集。其关键特点是书写者可以自由调整笔画、部首和字符结构,以模拟难以被标准识别引擎处理的手写变体,旨在促进非标准手写汉字识别技术的研究与开发。
以上内容由遇见数据集搜集并总结生成



