five

NutriGreen Image Dataset: A Collection of Annotated Nutrition, Organic, and Vegan Food Products

收藏
Mendeley Data2024-05-10 更新2024-06-29 收录
下载链接:
https://zenodo.org/records/10020545
下载链接
链接失效反馈
资源简介:
The generated dataset is an annotated collection, with each image carrying labels (NutriScore, V-label and Bio). The presence of annotated data is essential for developing a supervised machine-learning model capable of automatically identifying labels in new images. In our case, we utilize this data to train a model that can autonomously recognize labels on new images not present in the dataset, achieving a model accuracy of 94%. In the future, you have the option to train a new model using the dataset to achieve higher accuracy or employ the existing model to automatically identify bio and nutri labels in newly collected images, eliminating the need for manual review. We should emphasize that these resources should be utilized by a data science team. There is an opportunity for this model to be integrated with a mobile app, but this is a direction for future work, we included in the revised version. In this research, we introduce the NutriGreen dataset, which is a collection of images representing packaged food products. Each image in the dataset comes with three distinct labels: one indicating its nutritional value using the Nutri-Score, another denoting whether it's vegan or vegetarian with the V-label, and a third displaying the EU organic certification (BIO) logo. The dataset comprises a total of 10,472 images. Among these, the Nutri-Score label is distributed across five sub-labels: A with 1,250 images, B with 1,107 images, C with 867 images, D with 1,001 images, and E with 967 images. Additionally, there are 870 images featuring the V-Label, 2,328 images showcasing the BIO label, and 3201 images with no labels. Furthermore, we have fine-tuned the YOLOv5 model to demonstrate the practicality of using these annotated datasets, achieving an impressive accuracy of 94.0%. These promising results indicate that this dataset has significant potential for training innovative systems capable of detecting food labels. Moreover, it can serve as a valuable benchmark dataset for emerging computer vision systems.
创建时间:
2023-10-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作