Kurdish Scene Text Recognition Version 1.0 (KSTRV1) Dataset
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/15038952
下载链接
链接失效反馈官方服务:
资源简介:
KSTRV1 (Kurdish STR) Version 1
KSTRV1 is a large-scale dataset developed for Kurdish Scene Text Recognition (KSTR), addressing the scarcity of resources for non-Latin script like Kurdish. It includes 1,420 real-world scene images and 17,412 extracted word-level samples across Kurdish (Sorani and Badini dialects), Arabic, and English. To expand coverage and improve generalizability, the dataset is augmented with 20,000 synthetic text examples, crafted with diverse typography, multi-angle orientations, simulated distortions, and intricate background textures. This synthesis enhances the dataset’s capacity to handle real-world variability, supporting robust training for text recognition systems in underrepresented languages.
创建时间:
2025-03-17



