five

Synthetic English Gadget and Widget Invoices Datapack

收藏
Snowflake2024-09-12 更新2024-09-13 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZ1MOZ7BX29
下载链接
链接失效反馈
官方服务:
资源简介:
This datapack includes a collection of synthetic invoices for gadgets and widgets in English, meticulously crafted for machine learning applications in invoice processing and automated financial documentation systems. Each invoice represents various formats and structures, simulating real-world scenarios with multiple line items, taxes, and US addresses for billing and shipping information. These documents, created in a 3D environment, replicate conditions like lighting changes, creases, and other deformations to improve machine learning model robustness. The field-level annotations provide exacting detail, ensuring effective training for automated invoice processing solutions. This datapack includes three tables: ANNOTATION_VIEW, IMAGE_VIEW, and ZIP_VIEW. **ANNOTATION_VIEW** contains information for each annotation field including the name of the field, the text within the field, 4 corner coordinates of the field in clockwise order, and the name of the image this annotation belongs to. **IMAGE_VIEW** contains information for each image including its name, its size, its URL, and the coordinates of the document corners in the image. **ZIP_VIEW** contains the URL to download the zip file containing all images and annotations in the format of Mindtech, ICDAR2015 and Wildreceipt. Please contact Mindtech for the full datapack.
提供机构:
Mindtech Global
创建时间:
2024-09-01
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集为英文小工具和部件合成发票集合,包含多种格式的3D模拟发票,具有详细字段注释和真实场景变形效果,用于增强发票处理模型的鲁棒性。数据包包含ANNOTATION_VIEW、IMAGE_VIEW和ZIP_VIEW三个视图表,分别提供字段注释、图像信息和下载链接。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作