Chuntianli/CrossVid
收藏Hugging Face2025-11-13 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/Chuntianli/CrossVid
下载链接
链接失效反馈官方服务:
资源简介:
CrossVid是一个大规模的多任务数据集,旨在推动视觉语言模型在跨视频理解能力上的发展。该数据集包含10种不同的任务类型,要求模型能够跨多个视频进行推理,理解时间动态、空间关系以及复杂的叙事结构。
CrossVid is a large-scale multi-task dataset designed to advance cross-video understanding capabilities in vision-language models. The dataset encompasses 10 diverse task types that require models to reason across multiple videos, understand temporal dynamics, spatial relationships, and complex narrative structures.
提供机构:
Chuntianli



