STS-TR
收藏STS-TR: Synthetic Turkish Scene Text Recognition Dataset
概述
STS-TR数据集是一个综合的合成数据集,旨在补充真实的土耳其场景文本识别(TS-TR)数据集。该数据集包含超过1200万个合成样本,模拟了各种文本场景。它包括多种土耳其单词和短语,以不同的字体、大小和方向渲染在通用背景场景上,并添加了阴影、模糊和环境失真等现实效果。该数据集增强了针对土耳其语言的模型的训练数据可用性。
示例

引用
如果发现此工作有用,请引用我们的论文: bibtex @article{YILDIZ2024101881, title = {Turkish scene text recognition: Introducing extensive real and synthetic datasets and a novel recognition model}, journal = {Engineering Science and Technology, an International Journal}, volume = {60}, pages = {101881}, year = {2024}, issn = {2215-0986}, doi = {https://doi.org/10.1016/j.jestch.2024.101881}, url = {https://www.sciencedirect.com/science/article/pii/S2215098624002672}, author = {Serdar Yıldız}, keywords = {Scene text recognition dataset, Synthetic scene text recognition dataset, Patch masking, Position attention, Vision transformers}, }
下载




