TrainingDataPro/selfies_and_id
收藏Hugging Face2024-04-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/TrainingDataPro/selfies_and_id
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-nc-nd-4.0
task_categories:
- image-to-image
tags:
- code
dataset_info:
features:
- name: id_1
dtype: image
- name: id_2
dtype: image
- name: selfie_1
dtype: image
- name: selfie_2
dtype: image
- name: selfie_3
dtype: image
- name: selfie_4
dtype: image
- name: selfie_5
dtype: image
- name: selfie_6
dtype: image
- name: selfie_7
dtype: image
- name: selfie_8
dtype: image
- name: selfie_9
dtype: image
- name: selfie_10
dtype: image
- name: selfie_11
dtype: image
- name: selfie_12
dtype: image
- name: selfie_13
dtype: image
- name: user_id
dtype: string
- name: set_id
dtype: string
- name: user_race
dtype: string
- name: name
dtype: string
- name: age
dtype: int8
- name: country
dtype: string
- name: gender
dtype: string
splits:
- name: train
num_bytes: 376371811
num_examples: 10
download_size: 374658409
dataset_size: 376371811
---
# Selfies, ID Images dataset
**4083** sets, which includes *2 photos of a person from his documents and 13 selfies*. **571** sets of Hispanics and **3512** sets of Caucasians.
Photo documents contains only a photo of a person. All personal information from the document is hidden
## File with the extension .csv
includes the following information for each media file:
- **SetId**: a unique identifier of a set of 15 media files,
- **UserId**: the identifier of the person who provided the media file,
- **UserRace**: the ethnicity of the person
- **Country**: the country of origin of the person,
- **Age**: the age of the person,
- **Gender**: the gender of the person,
- **Name**: the name of the person
- **FName**: the type of the media file
- **URL**: the URL to access the media file
## Folder "img" with media files
- containg all the photos
- which correspond to the data in the .csv file
**How it works**: *go to the first folder and you will make sure that it contains media files taken by a person whose parameters are specified in the first 15 lines of the .csv file.*
# Get the dataset
### This is just an example of the data
Leave a request on [**https://trainingdata.pro/datasets**](https://trainingdata.pro/datasets/document-photos-and-selfies?utm_source=huggingface&utm_medium=cpc&utm_campaign=selfies_and_id) to discuss your requirements, learn about the price and buy the dataset.
## [**TrainingData**](https://trainingdata.pro/datasets/document-photos-and-selfies?utm_source=huggingface&utm_medium=cpc&utm_campaign=selfies_and_id) provides high-quality data annotation tailored to your needs
More datasets in TrainingData's Kaggle account: **https://www.kaggle.com/trainingdatapro/datasets**
TrainingData's GitHub: **https://github.com/Trainingdata-datamarket/TrainingData_All_datasets**
提供机构:
TrainingDataPro
原始信息汇总
数据集概述
数据集名称
- Selfies, ID Images dataset
数据集内容
- 包含内容: 每个数据集包含2张证件照片和13张自拍照,共计4083个集合。
- 种族分布: 571个集合为西班牙裔,3512个集合为高加索人。
数据集特征
- 图像特征:
- id_1: 图像
- id_2: 图像
- selfie_1 至 selfie_13: 图像
- 元数据特征:
- user_id: 字符串
- set_id: 字符串
- user_race: 字符串
- name: 字符串
- age: 整数(8位)
- country: 字符串
- gender: 字符串
数据集分割
- 训练集:
- 数据量: 376371811字节
- 示例数: 10
文件信息
- .csv文件:
- 包含每个媒体文件的详细信息,如SetId, UserId, UserRace, Country, Age, Gender, Name, FName, URL。
- img文件夹:
- 包含所有照片,与.csv文件中的数据相对应。
许可证
- 许可证: cc-by-nc-nd-4.0



