VISIOCITY
收藏arXiv2021-01-26 更新2024-06-21 收录
下载链接:
https://visiocity.github.io/
下载链接
链接失效反馈官方服务:
资源简介:
VISIOCITY是一个包含67个长视频的数据集,涵盖六个不同类别,如电视节目、体育、教育和个人视频等。这些视频平均时长约55分钟,具有密集的概念注释,支持多种视频摘要技术和计算机视觉任务,如字幕生成和动作识别。数据集旨在解决现有数据集视频时长短、类别单一的问题,适用于深入研究特定领域的视频摘要技术。
VISIOCITY is a dataset containing 67 long-form videos covering six distinct categories such as TV shows, sports, educational content, personal videos and more. With an average duration of about 55 minutes per video and dense conceptual annotations, it supports a variety of video summarization techniques and computer vision tasks, including caption generation and action recognition. This dataset aims to solve the problems of short video length and single category in existing datasets, and is suitable for in-depth research on domain-specific video summarization technologies.
提供机构:
印度理工学院孟买分校计算机科学系
创建时间:
2021-01-26



