Alibaba-NLP/UVRB

Name: Alibaba-NLP/UVRB
Creator: Alibaba-NLP
Published: 2025-11-06 06:14:25
License: 暂无描述

Hugging Face2025-11-06 更新2025-11-15 收录

下载链接：

https://hf-mirror.com/datasets/Alibaba-NLP/UVRB

下载链接

链接失效反馈

官方服务：

资源简介：

UVRB是一个用于全面评估视频检索模型泛化能力的基准数据集，包含16个子数据集，支持3种查询类型（文本到视频、组合查询、视觉查询），并从6个能力维度进行评估。它不仅关注准确性，还关注模型成功或失败的原因。UVRB能够揭示传统基准测试（如MSRVTT）所忽略的空间推理、时间动态、组合理解和长上下文检索等方面的关键差距。

UVRB is a comprehensive benchmark suite designed to evaluate the generalization ability of video embedding models, containing 16 sub-datasets supporting 3 query types (text-to-video, composed query, visual query), and assessed across 6 capability dimensions. It not only focuses on accuracy but also on why the model succeeds or fails. UVRB reveals critical gaps in spatial reasoning, temporal dynamics, compositional understanding, and long-context retrieval that traditional benchmarks like MSRVTT completely miss.

提供机构：

Alibaba-NLP

5,000+

优质数据集

54 个

任务类型

进入经典数据集