visual-layer/vl-celeba-hq
收藏Hugging Face2023-07-04 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/visual-layer/vl-celeba-hq
下载链接
链接失效反馈官方服务:
资源简介:
---
license: other
dataset_info:
features:
- name: image
dtype: image
- name: label
dtype:
class_label:
names:
'0': female
'1': male
splits:
- name: train
num_bytes: 2674924572.386
num_examples: 27412
- name: validation
num_bytes: 192330079.038
num_examples: 1959
download_size: 2704339198
dataset_size: 2867254651.4240003
---
[](https://www.visual-layer.com)
# Description
The `vl-celeba-hq` is a sanitized version of the original CelebA-HQ dataset.
The following are issues found in the original dataset and removed in this dataset:
<table>
<thead>
<tr>
<th style="text-align: left;">Category</th>
<th style="text-align: left;">Percentage</th>
<th style="text-align: left;">Count</th>
</tr>
</thead>
<tbody>
<tr>
<td style="text-align: left;">Duplicates</td>
<td style="text-align: left;"><div>1.67%</div></td>
<td style="text-align: left;"><div>3,389</div></td>
</tr>
<tr>
<td style="text-align: left;">Outliers</td>
<td style="text-align: left;"><div>0.08%</div></td>
<td style="text-align: left;"><div>157</div></td>
</tr>
<tr>
<td style="text-align: left;">Blur</td>
<td style="text-align: left;"><div>0.51%</div></td>
<td style="text-align: left;"><div>1,037</div></td>
</tr>
<tr>
<td style="text-align: left;">Dark</td>
<td style="text-align: left;"><div>0.001%</div></td>
<td style="text-align: left;"><div>2</div></td>
</tr>
<tr>
<td style="text-align: left;">Mislabels</td>
<td style="text-align: left;"><div>0.01%</div></td>
<td style="text-align: left;"><div>13</div></td>
</tr>
<tr>
<td style="text-align: left;">Leakage</td>
<td style="text-align: left;"><div>0.09%</div></td>
<td style="text-align: left;"><div>188</div></td>
</tr>
<tr>
<td style="text-align: left; font-weight: bold;">Total</td>
<td style="text-align: left; font-weight: bold;"><div>2.362%</div></td>
<td style="text-align: left; font-weight: bold;"><div>4,786</div></td>
</tr>
</tbody>
</table>
Learn more - https://docs.visual-layer.com/docs/available-datasets#vl-celeba-hq
# About Visual-Layer
<div align="center">
<a href="https://www.visual-layer.com">
<img alt="Visual Layer Logo" src="https://github.com/visual-layer/visuallayer/blob/main/imgs/vl_horizontal_logo.png?raw=true" alt="Logo" width="400">
</a>
</div>
Visual Layer is founded by the authors of [XGBoost](https://github.com/apache/tvm), [Apache TVM](https://github.com/apache/tvm) & [Turi Create](https://github.com/apple/turicreate) - [Danny Bickson](https://www.linkedin.com/in/dr-danny-bickson-835b32), [Carlos Guestrin](https://www.linkedin.com/in/carlos-guestrin-5352a869) and [Amir Alush](https://www.linkedin.com/in/amiralush).
Learn more about Visual Layer [here](https://visual-layer.com).
提供机构:
visual-layer
原始信息汇总
数据集概述
数据集名称
vl-celeba-hq
数据集描述
vl-celeba-hq 是原始 CelebA-HQ 数据集的净化版本,移除了以下问题:
- 重复数据:占比 1.67%,共计 3,389 项
- 异常值:占比 0.08%,共计 157 项
- 模糊图像:占比 0.51%,共计 1,037 项
- 过暗图像:占比 0.001%,共计 2 项
- 标签错误:占比 0.01%,共计 13 项
- 数据泄露:占比 0.09%,共计 188 项
- 总计问题占比:2.362%,共计 4,786 项
数据集特征
image:图像数据类型label:分类标签数据类型,包含两个类别:- 0: female
- 1: male
数据集划分
train:包含 27412 个样本,总大小为 2674924572.386 字节validation:包含 1959 个样本,总大小为 192330079.038 字节
数据集大小
- 下载大小:2704339198 字节
- 数据集总大小:2867254651.4240003 字节
许可证
other



