dpdp69/FairFace
收藏Hugging Face2026-04-09 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/dpdp69/FairFace
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
dataset_info:
- config_name: '0.25'
features:
- name: image
dtype: image
- name: age
dtype:
class_label:
names:
'0': 0-2
'1': 3-9
'2': 10-19
'3': 20-29
'4': 30-39
'5': 40-49
'6': 50-59
'7': 60-69
'8': more than 70
- name: gender
dtype:
class_label:
names:
'0': Male
'1': Female
- name: race
dtype:
class_label:
names:
'0': East Asian
'1': Indian
'2': Black
'3': White
'4': Middle Eastern
'5': Latino_Hispanic
'6': Southeast Asian
- name: service_test
dtype: bool
splits:
- name: train
num_bytes: 512915534.352
num_examples: 86744
- name: validation
num_bytes: 64453996.096
num_examples: 10954
download_size: 563437634
dataset_size: 577369530.448
- config_name: '1.25'
features:
- name: image
dtype: image
- name: age
dtype:
class_label:
names:
'0': 0-2
'1': 3-9
'2': 10-19
'3': 20-29
'4': 30-39
'5': 40-49
'6': 50-59
'7': 60-69
'8': more than 70
- name: gender
dtype:
class_label:
names:
'0': Male
'1': Female
- name: race
dtype:
class_label:
names:
'0': East Asian
'1': Indian
'2': Black
'3': White
'4': Middle Eastern
'5': Latino_Hispanic
'6': Southeast Asian
- name: service_test
dtype: bool
splits:
- name: train
num_bytes: 1860154641.104
num_examples: 86744
- name: validation
num_bytes: 236712623.794
num_examples: 10954
download_size: 2104494732
dataset_size: 2096867264.898
configs:
- config_name: '0.25'
data_files:
- split: train
path: 0.25/train-*
- split: validation
path: 0.25/validation-*
- config_name: '1.25'
data_files:
- split: train
path: 1.25/train-*
- split: validation
path: 1.25/validation-*
---
# Dataset Card for FairFace
## Table of Contents
- [Table of Contents](#table-of-contents)
- [Dataset Description](#dataset-description)
- [Dataset Summary](#dataset-summary)
- [Supported Tasks and Leaderboards](#supported-tasks-and-leaderboards)
- [Languages](#languages)
- [Dataset Structure](#dataset-structure)
- [Data Instances](#data-instances)
- [Data Fields](#data-fields)
- [Data Splits](#data-splits)
- [Dataset Creation](#dataset-creation)
- [Curation Rationale](#curation-rationale)
- [Source Data](#source-data)
- [Annotations](#annotations)
- [Personal and Sensitive Information](#personal-and-sensitive-information)
- [Considerations for Using the Data](#considerations-for-using-the-data)
- [Social Impact of Dataset](#social-impact-of-dataset)
- [Discussion of Biases](#discussion-of-biases)
- [Other Known Limitations](#other-known-limitations)
- [Additional Information](#additional-information)
- [Dataset Curators](#dataset-curators)
- [Licensing Information](#licensing-information)
- [Citation Information](#citation-information)
- [Contributions](#contributions)
## Dataset Description
- **Homepage:** [https://github.com/joojs/fairface](https://github.com/joojs/fairface)
- **Repository:** [https://github.com/joojs/fairface](https://github.com/joojs/fairface)
- **Paper:** [https://openaccess.thecvf.com/content/WACV2021/papers/Karkkainen_FairFace_Face_Attribute_Dataset_for_Balanced_Race_Gender_and_Age_WACV_2021_paper.pdf](https://openaccess.thecvf.com/content/WACV2021/papers/Karkkainen_FairFace_Face_Attribute_Dataset_for_Balanced_Race_Gender_and_Age_WACV_2021_paper.pdf)
- **Leaderboard:**
- **Point of Contact:**
### Dataset Summary
FairFace is a face image dataset which is race balanced. It contains 108,501 images from 7 different race groups: White, Black, Indian, East Asian, Southeast Asian, Middle Eastern, and Latino.
Images were collected from the YFCC-100M Flickr dataset and labeled with race, gender, and age groups.
### Supported Tasks and Leaderboards
[More Information Needed]
### Languages
[More Information Needed]
## Dataset Structure
### Data Instances
Each instance has the following structure:
```
{
'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=448x448 at 0x7FCABA221FA0>,
'age': 6,
'gender': 0,
'race': 0,
'service_test': True
}
```
### Data Fields
- `image`: The image
- `age`: Age class among `["0-2", "3-9", "10-19", "20-29", "30-39", "40-49", "50-59", "60-69", "more than 70"]`
- `gender`: Gender class among `["Male", "Female"]`
- `race`: Race class among `["East Asian", "Indian", "Black", "White", "Middle Eastern", "Latino_Hispanic", "Southeast Asian"]`
- `service_test`: Not sure what this is. See [issue](https://github.com/joojs/fairface/issues/9).
### Data Splits
[More Information Needed]
## Dataset Creation
### Curation Rationale
[More Information Needed]
### Source Data
#### Initial Data Collection and Normalization
[More Information Needed]
#### Who are the source language producers?
[More Information Needed]
### Annotations
#### Annotation process
[More Information Needed]
#### Who are the annotators?
[More Information Needed]
### Personal and Sensitive Information
[More Information Needed]
## Considerations for Using the Data
### Social Impact of Dataset
[More Information Needed]
### Discussion of Biases
[More Information Needed]
### Other Known Limitations
[More Information Needed]
## Additional Information
### Dataset Curators
[More Information Needed]
### Licensing Information
[More Information Needed]
### Citation Information
[More Information Needed]
### Contributions
Thanks to [@VictorSanh](https://github.com/VictorSanh) for adding this dataset.
许可证:CC BY 4.0
数据集信息:
- 配置名称:'0.25'
特征:
- 名称:image
数据类型:图像
- 名称:age
数据类型:
类别标签:
类别名称:
'0': 0-2岁
'1': 3-9岁
'2': 10-19岁
'3': 20-29岁
'4': 30-39岁
'5': 40-49岁
'6': 50-59岁
'7': 60-69岁
'8': 70岁以上
- 名称:gender
数据类型:
类别标签:
类别名称:
'0': 男性(Male)
'1': 女性(Female)
- 名称:race
数据类型:
类别标签:
类别名称:
'0': 东亚人(East Asian)
'1': 印度人(Indian)
'2': 黑人(Black)
'3': 白人(White)
'4': 中东人(Middle Eastern)
'5': 拉丁裔/西班牙裔(Latino_Hispanic)
'6': 东南亚人(Southeast Asian)
- 名称:service_test
数据类型:布尔值
划分集:
- 名称:训练集
字节数:512915534.352
样本数:86744
- 名称:验证集
字节数:64453996.096
样本数:10954
下载大小:563437634 字节
数据集总大小:577369530.448 字节
- 配置名称:'1.25'
特征:
- 名称:image
数据类型:图像
- 名称:age
数据类型:
类别标签:
类别名称:
'0': 0-2岁
'1': 3-9岁
'2': 10-19岁
'3': 20-29岁
'4': 30-39岁
'5': 40-49岁
'6': 50-59岁
'7': 60-69岁
'8': 70岁以上
- 名称:gender
数据类型:
类别标签:
类别名称:
'0': 男性(Male)
'1': 女性(Female)
- 名称:race
数据类型:
类别标签:
类别名称:
'0': 东亚人(East Asian)
'1': 印度人(Indian)
'2': 黑人(Black)
'3': 白人(White)
'4': 中东人(Middle Eastern)
'5': 拉丁裔/西班牙裔(Latino_Hispanic)
'6': 东南亚人(Southeast Asian)
- 名称:service_test
数据类型:布尔值
划分集:
- 名称:训练集
字节数:1860154641.104
样本数:86744
- 名称:验证集
字节数:236712623.794
样本数:10954
下载大小:2104494732 字节
数据集总大小:2096867264.898 字节
配置项:
- 配置名称:'0.25'
数据文件:
- 划分:训练集
路径:0.25/train-*
- 划分:验证集
路径:0.25/validation-*
- 配置名称:'1.25'
数据文件:
- 划分:训练集
路径:1.25/train-*
- 划分:验证集
路径:1.25/validation-*
---
# FairFace数据集卡片
## 目录
- [目录](#table-of-contents)
- [数据集描述](#dataset-description)
- [数据集概述](#dataset-summary)
- [支持任务与排行榜](#supported-tasks-and-leaderboards)
- [语言](#languages)
- [数据集结构](#dataset-structure)
- [数据实例](#data-instances)
- [数据字段](#data-fields)
- [数据划分](#data-splits)
- [数据集构建](#dataset-creation)
- [筛选依据](#curation-rationale)
- [源数据](#source-data)
- [标注](#annotations)
- [个人与敏感信息](#personal-and-sensitive-information)
- [数据集使用注意事项](#considerations-for-using-the-data)
- [数据集的社会影响](#social-impact-of-dataset)
- [偏差讨论](#discussion-of-biases)
- [其他已知局限性](#other-known-limitations)
- [附加信息](#additional-information)
- [数据集提供者](#dataset-curators)
- [许可证信息](#licensing-information)
- [引用信息](#citation-information)
- [贡献](#contributions)
## 数据集描述
- **主页**:[https://github.com/joojs/fairface](https://github.com/joojs/fairface)
- **代码仓库**:[https://github.com/joojs/fairface](https://github.com/joojs/fairface)
- **相关论文**:[https://openaccess.thecvf.com/content/WACV2021/papers/Karkkainen_FairFace_Face_Attribute_Dataset_for_Balanced_Race_Gender_and_Age_WACV_2021_paper.pdf](https://openaccess.thecvf.com/content/WACV2021/papers/Karkkainen_FairFace_Face_Attribute_Dataset_for_Balanced_Race_Gender_and_Age_WACV_2021_paper.pdf)
- **排行榜**:
- **联系人**:
### 数据集概述
FairFace是一个种族均衡的人脸图像数据集,包含来自7个不同种族群体的108501张图像:白人(White)、黑人(Black)、印度人(Indian)、东亚人(East Asian)、东南亚人(Southeast Asian)、中东人(Middle Eastern)以及拉丁裔/西班牙裔(Latino_Hispanic)。所有图像均采集自YFCC-100M Flickr数据集,并标注了种族、性别与年龄分组信息。
### 支持任务与排行榜
【需补充更多信息】
### 语言
【需补充更多信息】
## 数据集结构
### 数据实例
每个数据实例遵循以下格式:
{
'image': <PIL.JpegImagePlugin.JpegImageFile image mode=RGB size=448x448 at 0x7FCABA221FA0>,
'age': 6,
'gender': 0,
'race': 0,
'service_test': True
}
### 数据字段
- `image`:人脸图像
- `age`:年龄类别,可选值为["0-2岁", "3-9岁", "10-19岁", "20-29岁", "30-39岁", "40-49岁", "50-59岁", "60-69岁", "70岁以上"]
- `gender`:性别类别,可选值为["男性(Male)", "女性(Female)"]
- `race`:种族类别,可选值为["东亚人(East Asian)", "印度人(Indian)", "黑人(Black)", "白人(White)", "中东人(Middle Eastern)", "拉丁裔/西班牙裔(Latino_Hispanic)", "东南亚人(Southeast Asian)"]
- `service_test`:具体用途暂不明确,详见[议题](https://github.com/joojs/fairface/issues/9)。
### 数据划分
【需补充更多信息】
## 数据集构建
### 筛选依据
【需补充更多信息】
### 源数据
#### 初始数据采集与归一化
【需补充更多信息】
#### 源数据生产者是谁?
【需补充更多信息】
### 标注
#### 标注流程
【需补充更多信息】
#### 标注人员是谁?
【需补充更多信息】
### 个人与敏感信息
【需补充更多信息】
## 数据集使用注意事项
### 数据集的社会影响
【需补充更多信息】
### 偏差讨论
【需补充更多信息】
### 其他已知局限性
【需补充更多信息】
## 附加信息
### 数据集提供者
【需补充更多信息】
### 许可证信息
【需补充更多信息】
### 引用信息
【需补充更多信息】
### 贡献
感谢[@VictorSanh](https://github.com/VictorSanh)为本数据集的上传提供贡献。
提供机构:
dpdp69



