TS-TR
收藏TS-TR: Turkish Scene Text Recognition Dataset
关于数据集
土耳其场景文本识别(TS-TR)数据集主要用于填补非英语文本识别资源的空白,特别是针对土耳其语言特有的挑战,如特殊字符和变音符号。该数据集模拟了现实世界中的条件,文本以各种字体、大小、方向和复杂背景显示,来自多个城市和农村环境。这种多样性确保了模型在不同场景下的泛化能力,包括不同的光照条件和复杂的视觉布局。
引用
如果您发现此工作有用,请引用我们的论文:
bibtex @article{YILDIZ2024101881, title = {Turkish scene text recognition: Introducing extensive real and synthetic datasets and a novel recognition model}, journal = {Engineering Science and Technology, an International Journal}, volume = {60}, pages = {101881}, year = {2024}, issn = {2215-0986}, doi = {https://doi.org/10.1016/j.jestch.2024.101881}, url = {https://www.sciencedirect.com/science/article/pii/S2215098624002672}, author = {Serdar Yıldız}, keywords = {Scene text recognition dataset, Synthetic scene text recognition dataset, Patch masking, Position attention, Vision transformers}, }
下载
Kaggle




