five

Sanskrit Character Dataset

收藏
DataCite Commons2025-01-19 更新2025-04-16 收录
下载链接:
https://ieee-dataport.org/documents/sanskrit-character-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
The "Sanskrit Character Dataset" includes 44 classes of handwritten Sanskrit characters, designed to support research in optical character recognition (OCR) and machine learning for ancient languages. Each class represents a unique Sanskrit letter, collected in various handwriting styles to ensure diversity and robustness. For each class, 50 to 80 images are included. To ensure diversity and real-world applicability, the letters were written in various handwriting styles. The dataset is designed to facilitate research in the field of ancient script recognition, particularly focusing on handwriting variability and pattern recognition. To create this dataset, 8 students each handwrote samples for all 44 classes of Sanskrit characters. Afterward, we carefully photographed each image.

梵文字符数据集(Sanskrit Character Dataset)包含44类手写梵文字符,旨在支撑古语言领域的光学字符识别(Optical Character Recognition,OCR)与机器学习研究。每一类对应一个独特的梵文字母,采集过程涵盖多种手写风格,以保障数据集的多样性与鲁棒性。每类包含50至80张图像样本。为进一步提升数据集的多样性与实际应用适配性,所有梵文字母均采用多样化的手写笔迹完成书写。本数据集旨在推动古文字识别领域的研究,尤其聚焦于手写变体与模式识别方向。为构建该数据集,共邀请8名学生手写全部44类梵文字符的样本,随后对每一张图像进行了精心拍摄。
提供机构:
IEEE DataPort
创建时间:
2025-01-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作