VideoCount
收藏CountVid: Open-World Object Counting in Videos 数据集概述
数据集基本信息
- 作者: Niki Amini-Naieni & Andrew Zisserman
- 论文: Open-World Object Counting in Videos
- 代码库: CountVid GitHub
- 实现框架: PyTorch
数据集组成
主要数据集
-
FSCD-147
- 下载链接: FSCD-147
- 配置文件:
config/datasets_fscd147_val.jsonconfig/datasets_fscd147_test.json
-
VideoCount
- 下载链接: VideoCount
- 包含子数据集:
- Crystals
- MOT20-Count
- Penguins
- TAO-Count
补充数据
- TAO验证集视频: 2-TAO_VAL.zip
- MOT20训练视频: MOT20.zip
数据集结构
VideoCount目录结构
VideoCount/ |Crystals/ |anno/ |crystals-count-gt.json |crystals-frame-level-counts-gt.json |exemplars/ |frames/ |MOT20-Count/ |anno/ |frames/ |MOT20-01/ |MOT20-02/ |MOT20-05/ |Penguins/ |TAO-Count/ |anno/ |frames/ |val/ |ArgoVerse/ |AVA/ |BDD/ |Charades/ |HACS/ |LaSOT/ |YFCC100M/
标注信息
- 全局计数文件:
[benchmark_name]-count-gt.json - 帧级计数文件:
[benchmark_name]-frame-level-counts-gt.json
特殊说明
-
Science-Count (Penguins)
- 使用文本提示"penguin"检测所有海鸟(企鹅和鸬鹚)
-
Science-Count (Crystals)
- 高密度帧的帧级计数可能存在5%误差
预训练模型
- CountGD-Box模型: countgd_box.pth
- SAM 2.1权重: sam2.1_hiera_large.pt
引用
bibtex @article{AminiNaieni25, title={Open-World Object Counting in Videos}, author={Amini-Naieni, N. and Zisserman, A.}, journal={arXiv preprint arXiv:2506.15368}, year={2025} }
@InProceedings{AminiNaieni24, title = {CountGD: Multi-Modal Open-World Counting}, author = {Amini-Naieni, N. and Han, T. and Zisserman, A.}, booktitle = {Advances in Neural Information Processing Systems (NeurIPS)}, year = {2024}, }
致谢
- 使用了以下代码库:
- 数据来源:
- TAO: arXiv:2005.10356
- MOT20: arXiv:2003.09003
- 资金支持: UKRI Grant VisualAI



