five

TibOCR-Bench: A Comprehensive Benchmark and Training Pipeline for Tibetan Multimodal OCR

收藏
DataCite Commons2025-08-11 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=4eb05a496c554d4791bc23aee3203c42
下载链接
链接失效反馈
官方服务:
资源简介:
To effectively support the training and evaluation of Tibetan OCR models in practical application scenarios involving multiple fonts and complex text structures, we have constructed a multi-source, high-quality Tibetan text image dataset. The overall data construction includes two complementary strategies: forward construction and reverse construction. (1) Positive construction: Firstly, collect Tibetan language images in real scenes, and then manually annotate the corresponding text content. This method ensures the authenticity and practical relevance of the data, effectively covering the diverse language usage scenarios and inherent complexity in Tibetan OCR tasks. (2) Reverse construction: Firstly, select text content suitable for OCR tasks (such as advertising slogans, slogans, or standard documents), then choose appropriate background images and use multiple fonts and visual effects to synthesize the text image dataset. This method efficiently enhances the structural diversity and scale of the dataset. These two strategies complement each other and together form a comprehensive resource library for training and evaluating Tibetan OCR models.
提供机构:
Science Data Bank
创建时间:
2025-08-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作