five

M-VAD Names

收藏
arXiv2019-03-05 更新2024-06-21 收录
下载链接:
https://github.com/aimagelab/mvad-names-dataset
下载链接
链接失效反馈
官方服务:
资源简介:
M-VAD Names数据集是由恩佐·法拉利工程学院,摩德纳与雷焦艾米利亚大学创建,旨在支持视频字幕架构的命名能力开发。该数据集包含63,442个视觉轨迹和34,388个文本提及,均与角色身份关联。通过半自动标注过程,数据集精确地标注了角色的视觉外观,并手动纠正了原始M-VAD标注中的错误。此数据集不仅用于视频字幕,还可用于动作识别和视频的视觉-语义空间训练,旨在解决现有视频字幕模型无法正确提及角色名称的问题。

The M-VAD Names dataset was developed by the Enzo Ferrari School of Engineering, University of Modena and Reggio Emilia, with the goal of supporting the development of naming capabilities for video captioning architectures. This dataset contains 63,442 visual trajectories and 34,388 textual mentions, all associated with character identities. Through a semi-automatic annotation pipeline, the dataset accurately annotates the visual appearance of characters and manually corrects errors in the original M-VAD annotations. This dataset can be applied not only to video captioning, but also to action recognition and visual-semantic space training for videos, and is designed to address the issue that existing video captioning models fail to correctly mention character names.
提供机构:
恩佐·法拉利工程学院,摩德纳与雷焦艾米利亚大学
创建时间:
2019-03-05
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作