HarshvardhanK7271/InvoicesReceiptsPT
收藏Hugging Face2026-03-23 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/HarshvardhanK7271/InvoicesReceiptsPT
下载链接
链接失效反馈官方服务:
资源简介:
---
task_categories:
- text-classification
language:
- pt
tags:
- finance
size_categories:
- 1K<n<10K
license: apache-2.0
---
This is a dataset comprising 1003 images of invoices and receipts, as well as the transcription of relevant fields for each document – seller name, seller address, seller tax identification, buyer tax identification, invoice date, invoice total amount, invoice tax amount, and document reference.
It is organized as:
- folder `1_Images`: files with pictures od the invoices/receipts
- folder `2_Annotations_Json`: text files with the annotations on a json format
Also available at:
- https://zenodo.org/records/7213544
- https://zenodo.org/records/6371710
### 数据集元信息
- 任务类别:文本分类(text-classification)
- 语言:葡萄牙语(pt)
- 标签:金融(finance)
- 规模区间:1000 < 样本量 < 10000
- 许可证:Apache 2.0
本数据集包含1003张发票与收据图像,同时附带每份文档的相关字段转录信息,涵盖销售方名称、销售方地址、销售方税务识别号、采购方税务识别号、发票日期、发票总金额、发票税额以及文档参考编号。
数据集组织结构如下:
- 文件夹`1_Images`:存储发票/收据的图像文件
- 文件夹`2_Annotations_Json`:存储JSON格式的标注文本文件
该数据集亦可通过以下链接获取:
- https://zenodo.org/records/7213544
- https://zenodo.org/records/6371710
提供机构:
HarshvardhanK7271



