five

opendiffusionai/cc12m-a_woman

收藏
Hugging Face2024-12-06 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/opendiffusionai/cc12m-a_woman
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是从opendiffusionai/cc12m-cleaned中提取的一个便利子集,通过搜索A woman并手动筛选得到。筛选标准包括去除水印、网站品牌标识等可能影响机器学习训练的元素,并选择主焦点清晰、锐利的高质量图像。目前数据集包含几千张图像,目标是达到10万张。获取图像的步骤包括下载.gz文件和crawl.sh脚本,并安装img2dataset工具来运行脚本。数据集也可以以parquet格式获取。

This dataset is a convenience subset of opendiffusionai/cc12m-cleaned, created by searching for A woman and hand-curating the results. The curation process involved removing watermarks, site branding, and other elements that could interfere with ML training, and selecting only images with clear, sharp camera focus on the main subject. Currently, the dataset contains a few thousand images, with a goal of reaching 100,000. To acquire the images, download both the .gz file and the crawl.sh file, and install the img2dataset utility to run the script. The dataset is also available in parquet format.
提供机构:
opendiffusionai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作