chengyewang/TexOCR-RL-figures
收藏Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/chengyewang/TexOCR-RL-figures
下载链接
链接失效反馈官方服务:
资源简介:
TexOCR:用于可编译页面到LaTeX重建的文档OCR模型基准测试与推进[ACL 2026 Main]。该存储库提供了TexOCR RL训练中使用的图像数据。数据集名称为TexOCR_RL_figures,类型为训练图像,训练方法为GRPO(强化学习),任务是文档OCR到LaTeX生成。
TexOCR: Benchmarking and Advancing Document OCR Models for Compilable Page-to-LaTeX Reconstruction [ACL 2026 Main]. This repository provides the figure/image data used in TexOCR RL training. Dataset: TexOCR_RL_figures, Type: Training Images, Training Method: GRPO (Reinforcement Learning), Task: Document OCR → LaTeX generation.
提供机构:
chengyewang



