CompAct Dataset for Sequential Compositional Generalization in Multimodal Models
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/records/13683664
下载链接
链接失效反馈官方服务:
资源简介:
CompAct (Compositional Activities) presents a comprehensive benchmark for assessing the compositional generalization abilities of Sequential Multimodal Models. CompAct is a carefully constructed, perceptually grounded dataset set within a rich backdrop of egocentric kitchen activity videos. Each instance in our dataset is represented with a combination of raw video footage, naturally occurring sound, and crowd-sourced step-by-step descriptions. More importantly, our setup ensures that the individual concepts are consistently distributed across training and evaluation sets, while their compositions are novel in the evaluation set. We conduct a comprehensive assessment of several unimodal and multimodal models.
创建时间:
2024-09-07



