opencapybara/CapyWiki-34M-raw
收藏Hugging Face2024-09-06 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/opencapybara/CapyWiki-34M-raw
下载链接
链接失效反馈官方服务:
资源简介:
CapyWiki-34M是一个包含公开许可和公共领域图片的数据集,源自Wikimedia。数据集分为三个部分:public_domain(公共领域,16.5M链接)、cc_by(商业使用许可,3.4M链接)和cc_by_sa(商业使用许可,14.2M链接)。数据集包含照片、插图、扫描件、地图等各类图片,格式为*.parquet。数据集可用于训练和评估神经网络,也可用于其他下游任务。数据格式包括图片URL、描述、作者、许可证信息等。
CapyWiki-34M is a collection of openly licensed and public domain image datasets from Wikimedia. The dataset contains three splits: public_domain, cc_by, and cc_by_sa. Each split contains links to images with different license types, such as public domain, cc-by, and cc-by-sa licenses. The dataset includes various types of images, such as photos, illustrations, scans, maps, etc. Each entry in the dataset contains the image URL, description, author, license information, date, and credit information. The dataset is stored in parquet format and can be loaded using the Hugging Face Datasets library.
提供机构:
opencapybara



