five

TrainingDataPro/selfies_and_id

收藏
Hugging Face2024-04-24 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/TrainingDataPro/selfies_and_id
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-nc-nd-4.0 task_categories: - image-to-image tags: - code dataset_info: features: - name: id_1 dtype: image - name: id_2 dtype: image - name: selfie_1 dtype: image - name: selfie_2 dtype: image - name: selfie_3 dtype: image - name: selfie_4 dtype: image - name: selfie_5 dtype: image - name: selfie_6 dtype: image - name: selfie_7 dtype: image - name: selfie_8 dtype: image - name: selfie_9 dtype: image - name: selfie_10 dtype: image - name: selfie_11 dtype: image - name: selfie_12 dtype: image - name: selfie_13 dtype: image - name: user_id dtype: string - name: set_id dtype: string - name: user_race dtype: string - name: name dtype: string - name: age dtype: int8 - name: country dtype: string - name: gender dtype: string splits: - name: train num_bytes: 376371811 num_examples: 10 download_size: 374658409 dataset_size: 376371811 --- # Selfies, ID Images dataset **4083** sets, which includes *2 photos of a person from his documents and 13 selfies*. **571** sets of Hispanics and **3512** sets of Caucasians. Photo documents contains only a photo of a person. All personal information from the document is hidden ## File with the extension .csv includes the following information for each media file: - **SetId**: a unique identifier of a set of 15 media files, - **UserId**: the identifier of the person who provided the media file, - **UserRace**: the ethnicity of the person - **Country**: the country of origin of the person, - **Age**: the age of the person, - **Gender**: the gender of the person, - **Name**: the name of the person - **FName**: the type of the media file - **URL**: the URL to access the media file ## Folder "img" with media files - containg all the photos - which correspond to the data in the .csv file **How it works**: *go to the first folder and you will make sure that it contains media files taken by a person whose parameters are specified in the first 15 lines of the .csv file.* # Get the dataset ### This is just an example of the data Leave a request on [**https://trainingdata.pro/datasets**](https://trainingdata.pro/datasets/document-photos-and-selfies?utm_source=huggingface&utm_medium=cpc&utm_campaign=selfies_and_id) to discuss your requirements, learn about the price and buy the dataset. ## [**TrainingData**](https://trainingdata.pro/datasets/document-photos-and-selfies?utm_source=huggingface&utm_medium=cpc&utm_campaign=selfies_and_id) provides high-quality data annotation tailored to your needs More datasets in TrainingData's Kaggle account: **https://www.kaggle.com/trainingdatapro/datasets** TrainingData's GitHub: **https://github.com/Trainingdata-datamarket/TrainingData_All_datasets**
提供机构:
TrainingDataPro
原始信息汇总

数据集概述

数据集名称

  • Selfies, ID Images dataset

数据集内容

  • 包含内容: 每个数据集包含2张证件照片和13张自拍照,共计4083个集合。
  • 种族分布: 571个集合为西班牙裔,3512个集合为高加索人。

数据集特征

  • 图像特征:
    • id_1: 图像
    • id_2: 图像
    • selfie_1 至 selfie_13: 图像
  • 元数据特征:
    • user_id: 字符串
    • set_id: 字符串
    • user_race: 字符串
    • name: 字符串
    • age: 整数(8位)
    • country: 字符串
    • gender: 字符串

数据集分割

  • 训练集:
    • 数据量: 376371811字节
    • 示例数: 10

文件信息

  • .csv文件:
    • 包含每个媒体文件的详细信息,如SetId, UserId, UserRace, Country, Age, Gender, Name, FName, URL。
  • img文件夹:
    • 包含所有照片,与.csv文件中的数据相对应。

许可证

  • 许可证: cc-by-nc-nd-4.0
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作