NutriGreen Image Dataset: A Collection of Annotated Nutrition, Organic, and Vegan Food Products

Mendeley Data2024-05-10 更新2024-06-29 收录

下载链接：

https://zenodo.org/records/10020545

下载链接

链接失效反馈

资源简介：

The generated dataset is an annotated collection, with each image carrying labels (NutriScore, V-label and Bio). The presence of annotated data is essential for developing a supervised machine-learning model capable of automatically identifying labels in new images. In our case, we utilize this data to train a model that can autonomously recognize labels on new images not present in the dataset, achieving a model accuracy of 94%. In the future, you have the option to train a new model using the dataset to achieve higher accuracy or employ the existing model to automatically identify bio and nutri labels in newly collected images, eliminating the need for manual review. We should emphasize that these resources should be utilized by a data science team. There is an opportunity for this model to be integrated with a mobile app, but this is a direction for future work, we included in the revised version. In this research, we introduce the NutriGreen dataset, which is a collection of images representing packaged food products. Each image in the dataset comes with three distinct labels: one indicating its nutritional value using the Nutri-Score, another denoting whether it's vegan or vegetarian with the V-label, and a third displaying the EU organic certification (BIO) logo. The dataset comprises a total of 10,472 images. Among these, the Nutri-Score label is distributed across five sub-labels: A with 1,250 images, B with 1,107 images, C with 867 images, D with 1,001 images, and E with 967 images. Additionally, there are 870 images featuring the V-Label, 2,328 images showcasing the BIO label, and 3201 images with no labels. Furthermore, we have fine-tuned the YOLOv5 model to demonstrate the practicality of using these annotated datasets, achieving an impressive accuracy of 94.0%. These promising results indicate that this dataset has significant potential for training innovative systems capable of detecting food labels. Moreover, it can serve as a valuable benchmark dataset for emerging computer vision systems.

创建时间：

2023-10-26

5,000+

优质数据集

54 个

任务类型

进入经典数据集