five

Egoshots

收藏
arXiv2020-03-27 更新2024-06-21 收录
下载链接:
https://github.com/NataliaDiaz/Egoshots
下载链接
链接失效反馈
官方服务:
资源简介:
Egoshots数据集是由法国国家信息与自动化研究所创建的,包含978张真实生活场景的图像,旨在评估图像字幕模型的多样性和鲁棒性。该数据集通过Autographer相机随机拍摄,涵盖室内外多种场景,用于解决现有图像字幕模型在日常生活中的应用限制问题。数据集的创建过程涉及使用预训练的图像字幕和对象识别网络进行图像标注,并提出了一种新的图像字幕评估指标——基于对象的语义保真度(SF),以评估无标注图像的字幕质量。Egoshots数据集的应用领域包括支持视觉障碍人士和推荐系统等,旨在提高这些系统的可靠性和实用性。

The Egoshots dataset was developed by the Institut National de Recherche en Informatique et en Automatique (INRIA), consisting of 978 images depicting real-life scenarios. Its core objective is to evaluate the diversity and robustness of image captioning models. Captured randomly via Autographer cameras, the dataset covers a wide range of indoor and outdoor scenes, aiming to address the application limitations of existing image captioning models in daily life. The construction of this dataset involves using pre-trained image captioning and object recognition networks for image annotation, and proposes a novel evaluation metric for image captioning: Object-based Semantic Fidelity (SF), which is used to assess the caption quality of unannotated images. Application scenarios of the Egoshots dataset include assisting visually impaired individuals and recommendation systems, with the goal of improving the reliability and practicality of these systems.
提供机构:
法国国家信息与自动化研究所 (INRIA)
创建时间:
2020-03-26
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作