five

Video Training Dataset

收藏
arXiv2025-09-30 收录
下载链接:
https://github.com/NVIDIA/Cosmos
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是一个精心筛选的视频训练集,涵盖了包括驾驶、人体运动、自然动态等多个物理人工智能应用类别。该数据集包含了不同长度和分辨率的视频(从720p到4k),并经过多步骤的精选过程,以提高数据质量,更好地服务于模型训练。该数据集规模约为2000万小时的原始视频,其中约1亿个视频片段用于预训练,1000万个视频片段用于微调。其任务是训练物理人工智能模型。

This dataset is a carefully curated video training set covering multiple physical artificial intelligence application categories, including driving scenarios, human motion, natural dynamics, and more. It contains videos of varying lengths and resolutions ranging from 720p to 4K, and has undergone a multi-step screening process to enhance data quality, thereby better supporting model training. The dataset totals approximately 20 million hours of raw video, with around 100 million video clips allocated for pre-training and 10 million clips reserved for fine-tuning. Its core purpose is to train physical artificial intelligence models.
提供机构:
Cosmos World Foundation Model Platform
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作