TrainingDataPro/generated-usa-passeports-dataset
收藏Hugging Face2024-04-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/TrainingDataPro/generated-usa-passeports-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-nd-4.0
task_categories:
- image-to-image
language:
- en
dataset_info:
features:
- name: original
dtype: image
- name: us_pass_augmentated_1
dtype: image
- name: us_pass_augmentated_2
dtype: image
- name: us_pass_augmentated_3
dtype: image
splits:
- name: train
num_bytes: 224948826
num_examples: 23
download_size: 142865341
dataset_size: 224948826
---
# GENERATED USA Passports Dataset
**Data generation** in machine learning involves creating or manipulating data to train and evaluate machine learning models. The purpose of data generation is to provide diverse and representative examples that cover a wide range of scenarios, ensuring the model's robustness and generalization.
Data augmentation techniques involve applying various transformations to existing data samples to create new ones. These transformations include: *random rotations, translations, scaling, flips, and more*. Augmentation helps in increasing the dataset size, introducing natural variations, and improving model performance by making it more invariant to specific transformations.
The dataset contains **GENERATED** USA passports, which are replicas of official passports but with randomly generated details, such as name, date of birth etc. The primary intention of generating these fake passports is to demonstrate the structure and content of a typical passport document and to train the neural network to identify this type of document.
Generated passports can assist in conducting research without accessing or compromising real user data that is often sensitive and subject to privacy regulations. Synthetic data generation allows researchers to develop and refine models using simulated passport data without risking privacy leaks.
### The dataset is solely for informational or educational purposes and should not be used for any fraudulent or deceptive activities.
.png?generation=1688719414649908&alt=media)
# Get the dataset
### This is just an example of the data
Leave a request on [**https://trainingdata.pro/datasets**](https://trainingdata.pro/datasets/synthetic-data?utm_source=huggingface&utm_medium=cpc&utm_campaign=generated-usa-passeports-dataset) to discuss your requirements, learn about the price and buy the dataset.
# Content
### Folders
- **original**: includes original generated images of USA passports
- **augmentation**: contains subfolders, corresponding to the original photos and including 3 black and white generated passport scans with different photo editing.
The augmentated photos are presented with random rotations, noise and brightness. Augmentation varies depending on the amount of noise and blur in the passport images, from slight (**us_pass_augmentated_1**) to significant (**us_pass_augmentated_3**).
### File with the extension .csv
includes the following information for each media file:
- **original**: link to access the image of the generated passport,
- **us_pass_augmentated_1**: link to the first augmentated image,
- **us_pass_augmentated_2**: link to the second augmentated image,
- **us_pass_augmentated_3**: link to the third augmentated image
# USA Passeport Photos might be generated in accordance with your requirements.
## [**TrainingData**](https://trainingdata.pro/datasets/synthetic-data?utm_source=huggingface&utm_medium=cpc&utm_campaign=generated-usa-passeports-dataset) provides high-quality data annotation tailored to your needs
More datasets in TrainingData's Kaggle account: **https://www.kaggle.com/trainingdatapro/datasets**
TrainingData's GitHub: **https://github.com/Trainingdata-datamarket/TrainingData_All_datasets**
*keywords: image dataset, generated data, passports, passport designs, machine-readable zone, mrz, synthetic data, synthetic data generation, synthetic dataset , gdpr synthetic data, data augmentation, object detection, computer vision, documents, document security, cybersecurity, information security systems, augmentation dataset, data augmentation, augmented images, image augmentation*
提供机构:
TrainingDataPro
原始信息汇总
数据集概述
数据集名称
- GENERATED USA Passports Dataset
数据集内容
- original: 包含原始生成的美国护照图像。
- augmentation: 包含子文件夹,对应于原始照片,并包含3种不同照片编辑的黑白生成护照扫描。
数据集特征
- original: 数据类型为图像。
- us_pass_augmentated_1: 数据类型为图像。
- us_pass_augmentated_2: 数据类型为图像。
- us_pass_augmentated_3: 数据类型为图像。
数据集分割
- train: 包含23个示例,总字节数为224948826。
数据集大小
- 下载大小: 142865341字节
- 数据集大小: 224948826字节
许可证
- cc-by-nc-nd-4.0
语言
- 英语
数据集用途
- 该数据集仅用于信息或教育目的,不应用于任何欺诈或欺骗活动。



