Francisco-Cruz/InvoicesReceiptsPT
收藏Hugging Face2024-05-02 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/Francisco-Cruz/InvoicesReceiptsPT
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-classification
language:
- pt
tags:
- finance
size_categories:
- 1K<n<10K
license: apache-2.0
---
This is a dataset comprising 1003 images of invoices and receipts, as well as the transcription of relevant fields for each document – seller name, seller address, seller tax identification, buyer tax identification, invoice date, invoice total amount, invoice tax amount, and document reference.
It is organized as:
- folder `1_Images`: files with pictures od the invoices/receipts
- folder `2_Annotations_Json`: text files with the annotations on a json format
Also available at:
- https://zenodo.org/records/7213544
- https://zenodo.org/records/6371710
提供机构:
Francisco-Cruz
原始信息汇总
数据集概述
基本信息
- 任务类别: 文本分类
- 语言: 葡萄牙语
- 标签: 金融
- 数据规模: 1K<n<10K
- 许可证: Apache 2.0
数据内容
- 数据集包含: 1003张发票和收据的图片,以及每个文档的相关字段的转录文本,包括卖家名称、卖家地址、卖家税务识别号、买家税务识别号、发票日期、发票总金额、发票税额和文档参考。
数据组织
- 文件夹
1_Images: 包含发票/收据的图片文件 - 文件夹
2_Annotations_Json: 包含以JSON格式存储的注释文本文件



