five

krutrim-ai-labs/ocr_rotation_bench

收藏
Hugging Face2026-02-24 更新2026-04-05 收录
下载链接:
https://hf-mirror.com/datasets/krutrim-ai-labs/ocr_rotation_bench
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: other license_name: krutrim-community-license-agreement-version-1.0 license_link: LICENSE.md task_categories: - zero-shot-classification language: - en - hi - bn - as - ml - mr - ta - pa - or - te - kn - gu tags: - image - OCR - rotation - classification pretty_name: seeingstraight --- # ORB Rotation Benchmark This repository contains the dataset for **“Seeing Straight: Document Orientation Detection for Efficient OCR”** (arXiv:2511.04161). ORB Rotation Benchmark is a structured multi-domain dataset designed to evaluate document orientation detection models and their impact on downstream OCR performance. Unlike prior document datasets that assume correct orientation, this benchmark explicitly introduces controlled rotation settings (4-class, 8-class, and 12-class configurations) across English and Indic document datasets, enabling systematic evaluation of rotation classification accuracy and OCR robustness. The benchmark supports: - Rotation classification benchmarking - OCR performance evaluation with/without rotation correction - Angle-wise error analysis - Multilingual document evaluation --- ## Dataset Structure The dataset consists of multiple splits corresponding to different language settings and rotation granularities. ### Splits - `orb_en_4_class_sroie` - `orb_en_4_class_synthdog` - `orb_en_12_class` - `orb_indic_4_class` - `orb_indic_8_class` Each split is stored as a Parquet file and is directly compatible with the Hugging Face Dataset Viewer. --- ## Data Format Each dataset item contains: | Field | Description | |-------|------------| | `image` | Document image | | `rotation_label` | Rotation angle (integer degrees) | | `ground_truth` | OCR ground-truth text (when available) | ### Rotation Settings - 4-class: {0°, 90°, 180°, 270°} - 8-class: evenly spaced rotations - 12-class: finer-grained rotation classification --- ## Subset Description ### orb_en_4_class_sroie English receipt dataset (SROIE-based) with 4 rotation classes and field-level OCR annotations. ### orb_en_4_class_synthdog Synthetic English document dataset with 4 rotation classes and OCR ground truth. ### orb_en_12_class English dataset with finer-grained 12-class rotation labels. ### orb_indic_4_class Indic-language documents with 4 rotation classes. ### orb_indic_8_class Indic-language documents with 8 rotation classes for higher granularity evaluation. --- ## Using the Dataset Example usage: ```python from datasets import load_dataset ds = load_dataset("krutrim-ai-labs/orb_rotation_benchmark", split="orb_en_4_class_sroie") print(ds[0]) ``` ## This dataset is intended for: - Document orientation detection research - OCR robustness evaluation - Vision-language model benchmarking - Multilingual document understanding - Angle-wise performance analysis - Evaluation Protocol ## The dataset enables: - Rotation classification accuracy - Angle-wise breakdown analysis - OCR CER/WER before and after rotation correction - Cross-language performance comparison For full evaluation pipelines and experimental setup, refer to the paper. ## The official inference and evaluation pipeline is described in: "Seeing Straight: Document Orientation Detection for Efficient OCR" arXiv: https://arxiv.org/abs/2511.04161 ### License This repository is licensed under the Krutrim Community License. ### Citation If you use this dataset, please cite: ```bibtex @article{goswami2025seeing, title={Seeing Straight: Document Orientation Detection for Efficient OCR}, author={Suranjan Goswami and Abhinav Ravi and Raja Kolla and Ali Faraz and Shaharukh Khan and Akash and Chandra Khatri and Shubham Agarwal}, year={2025}, eprint={2511.04161}, archivePrefix={arXiv}, primaryClass={cs.AI}, url={https://arxiv.org/pdf/2511.04161}, } ```
提供机构:
krutrim-ai-labs
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作