Replication Data for: Summarizing First-Person Videos from Third Persons' Points of Views

Mendeley Data2024-03-27 更新2024-06-28 收录

下载链接：

https://dataverse.lib.nycu.edu.tw/citation?persistentId=doi:10.57770/WBH60V

下载链接

链接失效反馈

官方服务：

资源简介：

Video highlight or summarization is among interesting topics in computer vision, which benefits a variety of applications like viewing, searching, or storage. However, most existing studies rely on training data of third-person videos, which cannot easily generalize to highlight the first-person ones. With the goal of deriving an effective model to summarize first-person videos, we propose a novel deep neural network architecture for describing and discriminating vital spatiotemporal information across videos with different points of view. Our proposed model is realized in a semi-supervised setting, in which fully annotated third-person videos, unlabeled first-person videos, and a small number of annotated first-person ones are presented during training. In our experiments, qualitative and quantitative evaluations on both benchmarks and our collected first-person video datasets are presented.

创建时间：

2023-06-28

5,000+

优质数据集

54 个

任务类型

进入经典数据集