five

surgeai/GDP.pdf

收藏
Hugging Face2026-04-14 更新2026-05-10 收录
下载链接:
https://hf-mirror.com/datasets/surgeai/GDP.pdf
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: apache-2.0 task_categories: - document-question-answering tags: - benchmark - pdf-parsing - document-understanding pretty_name: GDP.pdf --- # GDP.pdf A benchmark for evaluating PDF parsing capabilities of frontier LLMs. ## Overview | Stat | Value | |------|-------| | Total examples | 50 | | Unique PDFs | 50 | | Rubric criteria | up to 30 each | Each example pairs a **PDF document** with a **prompt** and a set of **rubric criteria** that define what a correct response should contain. ## Dataset Structure ### Columns | Column | Description | |--------|-------------| | `pdf_path` | Relative path to the PDF file in this repo | | `prompt` | The prompt / question for the model | | `rubric - 1. criterion` | Rubric criterion | | `rubric - 2. criterion` | Rubric criterion | | `rubric - 3. criterion` | Rubric criterion | | `rubric - 4. criterion` | Rubric criterion | | `rubric - 5. criterion` | Rubric criterion | | `rubric - 6. criterion` | Rubric criterion | | `rubric - 7. criterion` | Rubric criterion | | `rubric - 8. criterion` | Rubric criterion | | `rubric - 9. criterion` | Rubric criterion | | `rubric - 10. criterion` | Rubric criterion | | `rubric - 11. criterion` | Rubric criterion | | `rubric - 12. criterion` | Rubric criterion | | `rubric - 13. criterion` | Rubric criterion | | `rubric - 14. criterion` | Rubric criterion | | `rubric - 15. criterion` | Rubric criterion | | `rubric - 16. criterion` | Rubric criterion | | `rubric - 17. criterion` | Rubric criterion | | `rubric - 18. criterion` | Rubric criterion | | `rubric - 19. criterion` | Rubric criterion | | `rubric - 20. criterion` | Rubric criterion | | `rubric - 21. criterion` | Rubric criterion | | `rubric - 22. criterion` | Rubric criterion | | `rubric - 23. criterion` | Rubric criterion | | `rubric - 24. criterion` | Rubric criterion | | `rubric - 25. criterion` | Rubric criterion | | `rubric - 26. criterion` | Rubric criterion | | `rubric - 27. criterion` | Rubric criterion | | `rubric - 28. criterion` | Rubric criterion | | `rubric - 29. criterion` | Rubric criterion | | `rubric - 30. criterion` | Rubric criterion | | `domain` | — | ### Files - `data.parquet` — metadata table - `pdfs/` — the PDF documents (stored via Git LFS) ## Usage ```python from datasets import load_dataset from huggingface_hub import hf_hub_download ds = load_dataset("surgeai/GDP.pdf") # Access a row row = ds["train"][0] print(row["prompt"]) # Download the corresponding PDF pdf_local = hf_hub_download( repo_id="surgeai/GDP.pdf", filename=row["pdf_path"], repo_type="dataset", ) ``` ## Evaluation For each example, pass the PDF and prompt to your system, then score the response against the rubric columns. Each rubric column defines a criterion that should be checked independently. ## License apache-2.0
提供机构:
surgeai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作