surgeai/GDP.pdf
收藏Hugging Face2026-04-14 更新2026-05-10 收录
下载链接:
https://hf-mirror.com/datasets/surgeai/GDP.pdf
下载链接
链接失效反馈官方服务:
资源简介:
---
license: apache-2.0
task_categories:
- document-question-answering
tags:
- benchmark
- pdf-parsing
- document-understanding
pretty_name: GDP.pdf
---
# GDP.pdf
A benchmark for evaluating PDF parsing capabilities of frontier LLMs.
## Overview
| Stat | Value |
|------|-------|
| Total examples | 50 |
| Unique PDFs | 50 |
| Rubric criteria | up to 30 each |
Each example pairs a **PDF document** with a **prompt** and a set of
**rubric criteria** that define what a correct response should contain.
## Dataset Structure
### Columns
| Column | Description |
|--------|-------------|
| `pdf_path` | Relative path to the PDF file in this repo |
| `prompt` | The prompt / question for the model |
| `rubric - 1. criterion` | Rubric criterion |
| `rubric - 2. criterion` | Rubric criterion |
| `rubric - 3. criterion` | Rubric criterion |
| `rubric - 4. criterion` | Rubric criterion |
| `rubric - 5. criterion` | Rubric criterion |
| `rubric - 6. criterion` | Rubric criterion |
| `rubric - 7. criterion` | Rubric criterion |
| `rubric - 8. criterion` | Rubric criterion |
| `rubric - 9. criterion` | Rubric criterion |
| `rubric - 10. criterion` | Rubric criterion |
| `rubric - 11. criterion` | Rubric criterion |
| `rubric - 12. criterion` | Rubric criterion |
| `rubric - 13. criterion` | Rubric criterion |
| `rubric - 14. criterion` | Rubric criterion |
| `rubric - 15. criterion` | Rubric criterion |
| `rubric - 16. criterion` | Rubric criterion |
| `rubric - 17. criterion` | Rubric criterion |
| `rubric - 18. criterion` | Rubric criterion |
| `rubric - 19. criterion` | Rubric criterion |
| `rubric - 20. criterion` | Rubric criterion |
| `rubric - 21. criterion` | Rubric criterion |
| `rubric - 22. criterion` | Rubric criterion |
| `rubric - 23. criterion` | Rubric criterion |
| `rubric - 24. criterion` | Rubric criterion |
| `rubric - 25. criterion` | Rubric criterion |
| `rubric - 26. criterion` | Rubric criterion |
| `rubric - 27. criterion` | Rubric criterion |
| `rubric - 28. criterion` | Rubric criterion |
| `rubric - 29. criterion` | Rubric criterion |
| `rubric - 30. criterion` | Rubric criterion |
| `domain` | — |
### Files
- `data.parquet` — metadata table
- `pdfs/` — the PDF documents (stored via Git LFS)
## Usage
```python
from datasets import load_dataset
from huggingface_hub import hf_hub_download
ds = load_dataset("surgeai/GDP.pdf")
# Access a row
row = ds["train"][0]
print(row["prompt"])
# Download the corresponding PDF
pdf_local = hf_hub_download(
repo_id="surgeai/GDP.pdf",
filename=row["pdf_path"],
repo_type="dataset",
)
```
## Evaluation
For each example, pass the PDF and prompt to your system, then score the
response against the rubric columns. Each rubric column defines a criterion
that should be checked independently.
## License
apache-2.0
提供机构:
surgeai



