five

Synthetic Portuguese Craft Shop Receipt Datapack

收藏
Snowflake2024-09-27 更新2024-09-28 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZ1MOZ7BX2X
下载链接
链接失效反馈
官方服务:
资源简介:
Mindtech offers multiple datapacks of synthetic data, each containing unique images and annotations. Depending on your use case, it may be beneficial to use multiple datapacks. If you're unsure which datapack is best suited to your needs, please contact Mindtech for assistance. <br/>This datapack is one of the Mindtech Portuguese synthetic document datapacks. It offers a collection of synthetic receipts from Portuguese craft shops, designed for training machine learning models focused on retail receipt processing and automated bookkeeping. Each receipt includes itemized purchases, prices, and taxes, all formatted in Portuguese. These receipts are created in a highly realistic 3D environment, simulating various real-world conditions such as folds, smudges, and lighting variations to improve the robustness of machine learning models. Detailed annotations for each field, including product names, quantities, and totals, allow for precise model training and automation in retail receipt handling. <br/>This datapack includes three tables: ANNOTATION_VIEW, IMAGE_VIEW, and ZIP_VIEW.<br/>**ANNOTATION_VIEW** contains information for each annotation field including the name of the field, the text within the field, 4 corner coordinates of the field in clockwise order, and the name of the image this annotation belongs to.<br/>**IMAGE_VIEW** contains information for each image including its name, its size, its URL, and the coordinates of the document corners in the image.<br/>**ZIP_VIEW** contains the URL to download the zip file containing all images and annotations in the format of Mindtech, ICDAR2015 and Wildreceipt.<br/>Please contact Mindtech for the full datapack.
提供机构:
Mindtech Global
创建时间:
2024-09-13
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
该数据集为Mindtech提供的葡萄牙手工艺品店合成收据数据包,包含高度仿真的3D模拟收据图像及详细字段注释,适用于零售收据处理和自动化记账的机器学习模型训练。数据包提供三个结构化视图(ANNOTATION_VIEW、IMAGE_VIEW、ZIP_VIEW),涵盖文本内容、坐标信息及多格式下载链接。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作