five

Gujarat-HWD: A Dataset of Gujarati Handwritten Digits

收藏
Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/44std32rxb
下载链接
链接失效反馈
官方服务:
资源简介:
The Gujarat-HWD dataset offers a comprehensive collection of 11,000 images of handwritten digits in the Gujarati language, representing numerals from 0 to 9. This dataset is primarily designed for applications in optical character recognition (OCR), machine learning, and deep learning, with a particular focus on regional language processing. Gujarati, one of the most widely spoken languages in India, uses a script distinct from Devanagari. However, despite its extensive use, there is a notable lack of publicly available datasets for Gujarati handwritten digits. The Gujarat-HWD dataset bridges this gap by providing a clean, labelled, and diverse set of images that can aid researchers and developers in building effective recognition models for regional scripts. The dataset has been developed through a systematic process involving the collection, scanning, and preprocessing of handwritten digit samples provided by more than 350 individuals from various age groups and educational backgrounds. It is well-suited for training and evaluating classification models using standard convolutional neural network (CNN) architectures. Additionally, the dataset can be extended for use in cross-lingual digit recognition, handwriting analysis, and other regional OCR systems.
创建时间:
2025-07-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作