Synthetic English Resumes Datapack
收藏Snowflake2024-09-11 更新2024-09-12 收录
下载链接:
https://app.snowflake.com/marketplace/listing/GZ1MOZ7BX25
下载链接
链接失效反馈官方服务:
资源简介:
This datapack contains a collection of synthetically generated resumes in English, offering a rich resource for training machine learning models focused on document classification, information extraction, and resume parsing. Created in a fully controlled 3D environment, each resume varies in formatting, structure, and layout, simulating real-world conditions such as variable fonts, margins, and document degradation like folds and light exposure. The dataset includes precise field-level annotations, enabling robust training for models designed to extract key information such as names, job titles, dates, and education details. It is an excellent tool for developers building automated HR and recruitment software.
This datapack includes three tables: ANNOTATION_VIEW, IMAGE_VIEW, and ZIP_VIEW.<br/>**ANNOTATION_VIEW** contains information for each annotation field including the name of the field, the text within the field, 4 corner coordinates of the field in clockwise order, and the name of the image this annotation belongs to.<br/>**IMAGE_VIEW** contains information for each image including its name, its size, its URL, and the coordinates of the document corners in the image.<br/>**ZIP_VIEW** contains the URL to download the zip file containing all images and annotations in the format of Mindtech, ICDAR2015 and Wildreceipt.<br/>Please contact Mindtech for the full datapack.
提供机构:
Mindtech Global
创建时间:
2024-09-01



