five

Recipe1M+

收藏
OpenXLab2026-04-18 收录
下载链接:
https://openxlab.org.cn/datasets/OpenDataLab/Recipe1M_plus
下载链接
链接失效反馈
官方服务:
资源简介:
在本文中,我们介绍了Recipe1M,这是一个新的大型结构化语料库,包含100万多个烹饪食谱和1300万的食物图像。作为最大的公开配方数据集合,Recipe1M具有在对齐的多模式数据上训练高容量模型的能力。使用这些数据,我们训练神经网络以学习配方和图像的联合嵌入,从而在图像配方检索任务中产生令人印象深刻的结果。此外,我们证明了通过添加高级分类目标进行正则化既可以提高检索性能,使其与人类的检索性能相媲美,又可以实现语义向量算法。我们假设这些嵌入将为进一步探索Recipe1M数据集以及一般的食物和烹饪提供基础。代码、数据和模型是公开的。

In this paper, we introduce Recipe1M, a novel large-scale structured corpus comprising over one million culinary recipes and 13 million food images. As the largest publicly available recipe dataset to date, Recipe1M empowers the training of high-capacity models on aligned multimodal data. Using this dataset, we train neural networks to learn joint embeddings of recipes and images, yielding impressive performance on the image-recipe retrieval task. Furthermore, we demonstrate that adding advanced classification objectives for regularization can both improve retrieval performance to match human-level results and enable semantic vector arithmetic. We hypothesize that these embeddings will serve as a foundational resource for further exploration of the Recipe1M dataset, as well as food and cooking in general. Code, data, and models are publicly available.
提供机构:
OpenDataLab
创建时间:
2022-11-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作