nanonets/nn-auto-bench-ds
收藏Hugging Face2025-03-13 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/nanonets/nn-auto-bench-ds
下载链接
链接失效反馈官方服务:
资源简介:
nn-auto-bench-ds是一个为关键信息提取(KIE)设计的基准数据集。该数据集包含1000个文档,分为发票、收据、护照和银行对账单等类型。文档主要是英文的,但也有德语和阿拉伯语的文档。每个文档都标注了用于关键信息提取和特定任务的信息。该数据集可用于计算大型语言模型在KIE任务上的oneshot性能。
nn-auto-bench-ds is a benchmark dataset designed for key information extraction (KIE). The dataset includes 1,000 documents, categorized into types such as invoices, receipts, passports, and bank statements. The documents are primarily in English, with some also in German and Arabic. Each document is annotated with information for key information extraction and specific tasks. The dataset can be used to compute the oneshot performance of large language models on KIE tasks.
提供机构:
nanonets



