five

SwayStar123/preprocessed_commoncatalog-cc-by_DCAE

收藏
Hugging Face2025-01-29 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/SwayStar123/preprocessed_commoncatalog-cc-by_DCAE
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是预处理的公共目录(CC-BY)DCAE,包含了通过DC-AE f32自动编码器编码的图片和经过moondream2生成、siglip和bert编码的文本描述。图片大小经过调整,确保所有边长可被32整除以适配DCAEf32编码器。文本描述部分,文本向量填充至64个标记,并提供了未填充的长度以便在批处理中剪裁以节省计算资源。

This dataset is the preprocessed Common Catalogue (CC-BY) DCAE, which includes images encoded with the DC-AE f32 autoencoder and text descriptions generated by moondream2 and encoded with siglip and bert. The images are resized to ensure that all side lengths are divisible by 32 to fit the DCAEf32 encoder. For the text descriptions, the text embeddings are padded to 64 tokens, and the unpadded length is provided for pruning to the maximum in the batch to save computational resources.
提供机构:
SwayStar123
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作