Synthetic Invoices for ML Training
收藏Snowflake2022-06-10 更新2024-05-01 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZT1ZO9HCS
下载链接
链接失效反馈官方服务:
资源简介:
While companies grow more data-hungry, privacy restrictions and data scarcity often hamper ML model development. Because of this, synthetic data generation is playing an increasingly central role in AI model training. Innodata designed curated synthetic datasets to address these real-time industry pain points.
Curated datasets include synthetic documents in English, French, German, Italian and Spanish languages. Samples of each document type are available for download on our website. Current documents are:
-Bank statements
-Checks
-Credit card statements
-Credit notes
-Invoices
-Packing lists
-Purchase orders
-Receipts
-Remittance advice
-Utility bills
Each data set is a compilation of handmade templates based on real-world examples (bank statements match recent versions from real banks, etc.), all sourced with ethical data practices. All files are representative of clean-scanned readable PDF documents for easy ingestion into annotation platforms.
提供机构:
Innodata Inc.
创建时间:
2022-06-09



