BAAI-DataCube_花视频数据集
收藏魔搭社区2025-12-06 更新2025-09-06 收录
下载链接:
https://modelscope.cn/datasets/BAAI/BAAI-DataCube_Flower-Dataset
下载链接
链接失效反馈官方服务:
资源简介:
## 数据集简介
* 智源数据魔方( https://datacube.baai.ac.cn )系列数据集由智源数据魔方产品根据不同场景和需求自动构建,欢迎加入用户沟通群

* 涵盖7大方向,40个高质量视频数据,详情如下:
| 系列 | 子方向 | 数据集中文名称 | 数据集英文名称 | 应用场景 |
| ------- | ------- | ----------- | -------------------------------- | --------------------------------------------------- |
| 人物动作与交互 | 手部动作 | 手部动作视频数据集 | Gesture-Recognition | 用于手势识别模型、人机交互模型,VR/AR 手势控制、智能家居指令识别等场景 |
| | | 手工/DIY视频数据集 | DIY | 用于操作步骤识别、手部动作跟踪等场景,适合训练机器人 学习装配/制作任务,也可用于动作分解和模仿学习。 |
| | | 烹饪视频数据集 | Cooking | 适合动作分解学习(切菜、翻炒等),任务规划(烹饪步骤预测)等场景 |
| | 身体动作 | 健身运动视频数据集 | Fitness-Exercise | 用于动作识别与纠错模型,支持健身指导、姿态矫正、智能健身教练等场景 |
| | | 室外运动视频数据集 | Outdoor-Sports | 适合人体关键点识别、动作分析等场景,支持体育赛事分析、智能裁判和运动辅助机器人等应用 |
| | | 儿童活动视频数据集 | Children’s-Activities | 用于行为检测、儿童安全监控等场景,适合教育机器人、陪伴机器人等应用 |
| | 表情与情绪 | 情绪视频数据集 | Facial-Expression | 用于训练情绪识别模型,支持人机交互情感计算、客服机器人、心理健康监测等场景 |
| | | 表情视频数据集 | Fine-grained-Facial Expression | 细粒度表情识别数据集,支持微表情分析、多模态对话系统等应用,提升 LLM 在具身交互中的情感理解能力 |
| | 社会交互 | 双人互动视频数据集 | Two-person-Interaction | 训练社交互动识别等模型,支持对话场景检测、机器人社交技能模拟等场景 |
| | | 人群聚集视频数据集 | Crowd | 用于人群检测、密度估计、城市安防、公共场所监控中的异常检测等场景 |
| | | 校园视频数据集 | Campus | 训练群体交互识别模型,支持校园安全、课堂场景分析、教育 AI 系统建设等应用 |
| 第一人称与导航 | 驾驶与第一人称 | 第一人称驾驶视频数据集 | First-person-Driving | 自动驾驶和第一人称导航任务,支持 驾驶行为预测、路径规划等应用 |
| | 导航与镜头移动 | 室内场景视频数据集 | Indoor-Scene-Dataset | 支持训练室内导航等模型,支持服务机器人在 室内环境中导航、避障等任务 |
| | | 公路场景视频数据集 | Road-trip-Dataset | 用于道路场景感知、自动驾驶场景模拟等任务和场景 |
| 表演与舞台 | 舞蹈与表演 | 舞蹈表演视频数据集 | Dance-Performance Dataset | 用于人体动作生成、舞蹈动作模仿等场景 |
| | | 歌曲表演视频数据集 | Song-Performance Dataset | 用于歌唱动作识别、舞台表演动作预测等应用 |
| | 舞台/演讲 | 演讲视频数据集 | Speech-Dataset | 训练姿态-语音多模态模型,支持 演讲者手势分析、虚拟主持人等场景 |
| | | 表演视频数据集 | Performing-Arts-Dataset | 支持虚拟表演生成、数字人训练、舞台动作理解等场景 |
| | | 新闻播报视频数据集 | Breaking-News-Dataset | 支持虚拟新闻主播生成等应用 |
| 热点与节日 | 情人节 | 情人节日视频数据集 | Valentine’s-Day-Dataset | 训适合广告推荐、节日事件检测、情感交互等应用 |
| 安全与安防 | 火灾 | 火灾视频数据集 | Fire-Incident-Dataset | 训练火灾检测等模型,支持智能监控、机器人应急响应 |
| | 洪水 | 洪水视频数据集 | Flood-Dataset | 训练自然灾害识别等模型,支持防灾预警等应用 |
| | 加油站 | 加油站视频数据集 | Gas-Station-Dataset | 训练特定场景监控等安全模型,支持安全生产和异常行为检测等应用 |
| | 劳动安全 | 工人穿戴视频数据集 | Worker-Wearing-Detection-Dataset | 训练安全检测模型,支持工业安全与合规监控等应用 |
| 生活与日常 | 家庭生活 | 家庭生活视频数据集 | Living-Room-Dataset | 训练 日常行为识别等模型,适合家庭服务机器人、智能安防等应用 |
| | 儿童教育 | 儿童学习视频数据集 | Children-Classroom Dataset | 训练 教育场景行为识别等模型,支持课堂互动检测、教育机器人等应用 |
| 动物与自然 | 动物 | 鸟视频数据集 | Bird-Dataset | 训练动物识别等模型 |
| | | 猫视频数据集 | Cat-Dataset | 支持动物识别与分类、宠物行为分析、视频监控中小动物检测等应用 |
| | | 狗视频数据集 | Dog-Dataset | 犬类动作识别、支持导盲犬/工作犬行为建模、宠物表情识别、多模态视频问答等应用 |
| | | 鱼视频数据集 | Fishes-Dataset | 支持水下检测与识别、海洋生物监测等应用 |
| | | 昆虫视频数据集 | Insect-Dataset | 支持生态环境监测、害虫检测与农业智能化、微小目标检测与跟踪研究等应用 |
| | 自然 | 雨林视频数据集 | Rainforest-Dataset | 支持复杂场景识别、环境变化监测、生态保护研究等应用 |
| | | 沙漠视频数据集 | Desert-Dataset | 支持极端环境识别、自动驾驶在稀疏地貌中的视觉感知训练、气候变化研究 |
| | | 山川视频数据集 | Mountain-Dataset | 户外场景识别、无人机导航、AR/VR 场景生成、地貌分类 |
| | | 河流视频数据集 | River -Dataset | 支持水域检测与监控、水文环境建模、洪涝灾害监测等应用 |
| | | 海洋视频数据集 | Ocean-Dataset | 支持海洋场景分割、水下机器人感知、海洋生物识别等模型训练和应用 |
| | | 草原视频数据集 | Grassland-Dataset | 应用于大面积生态环境识别、无人机遥感任务、野生动物栖息地分析等模型 |
| | | 天空视频数据集 | Sky-Dataset | 应用于天气识别、航拍与卫星视觉、自动驾驶视觉背景建模等场景 |
| | | 森林视频数据集 | Forest-Dataset | 森林防火监控、无人机林业巡检、复杂植被环境建模等场景 |
| | | 花视频数据集 | Flower-Dataset | 植物识别与分类、园艺/农业智能化、细粒度图像识别、多模态美学生成等场景 |
| | | 雨视频数据集 | Rain-Dataset | 天气感知与预测、视觉去雨任务训练、自动驾驶在恶劣天气下的测试等场景 |
## 数据魔方产品介绍
* 智源研究院研发的数据魔方(DataCube)可实现从“数据集Level”到“数据样本Level”的精准检索,满足用户个性化的数据需求,只需自然语言输入数据需求,平台便能快速构建专属的个性化数据集,彻底打破以往操作流程的桎梏。
* 在核心技术架构层面,数据魔方依托智源数据平台支持的100+数据处理算子,实现 PB 级数据的自动化处理Pipeline;深度融合 CLIP Understanding 引擎,对多模态样本从本体、行为、视角与风格等多维细进行细粒度语义解析,逐条构建精细数据画像;辅以 Hybrid Retrieval 体系,实现跨模态特征的毫秒级精准召回;最终通过 Data Evaluation 评估数据集构建效果,实现个性化高质量数据集生成。数据魔方具有以下特点:
*  响应速度“快”,自然语言输入需求后,最快能秒级反馈并生成数据集
* 成本“省”,免去繁琐的数据筛选过滤过程,极大降低了人力与时间成本
* 数据量“多”,目前已汇聚5000W+数据样本基础,且数量仍在持续增长
* 数据质量“好”,内置深度语义理解算法,确保检索到的数据相关性极高
* 数据魔方使用演示:
[数据魔方演示-DIY-8月31日.mov](https://resources.ks3-cn-beijing.ksyuncs.com/projects/DataCube/demo.mov)
## 许可与使用
* 本数据集仅限学术研究与非商业用途
* 使用即视为同意智源研究院数据魔方产品相关协议和使用条款
* 意见反馈与业务合作,欢迎联系:<BAAI_data@baai.ac.cn> 
## Dataset Introduction
* The DataCube series datasets (https://datacube.baai.ac.cn) are automatically constructed by the BAAI DataCube product according to different scenarios and requirements. You are welcome to join the user communication group

* Covering 7 categories and 40 high-quality video datasets, details are as follows:
| Series | Sub-direction | Chinese Dataset Name | English Dataset Name | Application Scenarios |
|-----------------------|---------------------|-------------------------------|-----------------------------------------------|---------------------------------------------------------------------------------------|
| Human Motion & Interaction | Hand Motions | Hand Motion Video Dataset | Gesture-Recognition | For scenarios including gesture recognition model training, human-computer interaction model training, VR/AR gesture control, smart home instruction recognition, etc. |
| | | Handcraft/DIY Video Dataset | DIY | For operation step recognition, hand motion tracking and other scenarios, suitable for training robots to learn assembly/production tasks, as well as action decomposition and imitation learning. |
| | | Cooking Video Dataset | Cooking | Suitable for action decomposition learning (chopping, stir-frying, etc.), task planning (cooking step prediction) and other scenarios |
| | Body Motions | Fitness Exercise Video Dataset | Fitness-Exercise | For motion recognition and error correction models, supporting scenarios such as fitness guidance, posture correction, and intelligent fitness coaches |
| | | Outdoor Sports Video Dataset | Outdoor-Sports | Suitable for human keypoint recognition, motion analysis and other scenarios, supporting applications such as sports event analysis, intelligent referees, and sports-assisted robots |
| | | Children's Activities Video Dataset | Children’s-Activities | For behavior detection, child safety monitoring and other scenarios, suitable for educational robots, companion robots and other applications |
| | Facial Expression & Emotion | Emotion Video Dataset | Facial-Expression | For training emotion recognition models, supporting human-computer interaction affective computing, customer service robots, mental health monitoring and other scenarios |
| | | Fine-grained Facial Expression Video Dataset | Fine-grained-Facial Expression | Fine-grained facial expression recognition dataset, supporting micro-expression analysis, multimodal dialogue systems and other applications, to enhance the emotional understanding ability of LLMs in embodied interaction |
| | Social Interaction | Two-person Interaction Video Dataset | Two-person-Interaction | Training social interaction recognition and other models, supporting scenarios such as dialogue scene detection, robot social skill simulation |
| | | Crowd Gathering Video Dataset | Crowd | For crowd detection, density estimation, urban security, anomaly detection in public place monitoring and other scenarios |
| | | Campus Video Dataset | Campus | Training group interaction recognition models, supporting applications such as campus security, classroom scene analysis, and educational AI system construction |
| First-person View & Navigation | Driving & First-person | First-person Driving Video Dataset | First-person-Driving | Autonomous driving and first-person navigation tasks, supporting driving behavior prediction, path planning and other applications |
| | Navigation & Camera Movement | Indoor Scene Video Dataset | Indoor-Scene-Dataset | Supporting training indoor navigation and other models, supporting service robots in indoor environment navigation, obstacle avoidance and other tasks |
| | | Road Trip Video Dataset | Road-trip-Dataset | For road scene perception, autonomous driving scene simulation and other tasks and scenarios |
| Performance & Stage | Dance & Performance | Dance Performance Video Dataset | Dance-Performance Dataset | For human motion generation, dance motion imitation and other scenarios |
| | | Song Performance Video Dataset | Song-Performance Dataset | For singing action recognition, stage performance action prediction and other applications |
| | Stage/Speech | Speech Video Dataset | Speech-Dataset | Training posture-speech multimodal models, supporting speaker gesture analysis, virtual host and other scenarios |
| | | Performing Arts Video Dataset | Performing-Arts-Dataset | Supporting virtual performance generation, digital human training, stage motion understanding and other scenarios |
| | | News Broadcast Video Dataset | Breaking-News-Dataset | Supporting applications such as virtual news anchor generation |
| Hot Topics & Festivals | Valentine's Day | Valentine's Day Video Dataset | Valentine’s-Day-Dataset | Suitable for advertising recommendation, festival event detection, emotional interaction and other applications |
| Security & Public Safety | Fire | Fire Incident Video Dataset | Fire-Incident-Dataset | Training fire detection and other models, supporting intelligent monitoring, robot emergency response |
| | Flood | Flood Video Dataset | Flood-Dataset | Training natural disaster recognition and other models, supporting disaster prevention and early warning and other applications |
| | Gas Station | Gas Station Video Dataset | Gas-Station-Dataset | Training specific scene monitoring and other security models, supporting safe production and abnormal behavior detection and other applications |
| | Labor Safety | Worker PPE Video Dataset | Worker-Wearing-Detection-Dataset | Training safety detection models, supporting industrial safety and compliance monitoring and other applications |
| Daily Life & Household | Household Life | Household Life Video Dataset | Living-Room-Dataset | Training daily behavior recognition and other models, suitable for household service robots, intelligent security and other applications |
| | Children's Education | Children's Learning Video Dataset | Children-Classroom Dataset | Training education scene behavior recognition and other models, supporting classroom interaction detection, educational robots and other applications |
| Animals & Nature | Animals | Bird Video Dataset | Bird-Dataset | Training animal recognition and other models |
| | | Cat Video Dataset | Cat-Dataset | Supporting animal recognition and classification, pet behavior analysis, small animal detection in video monitoring and other applications |
| | | Dog Video Dataset | Dog-Dataset | Canine motion recognition, supporting guide dog/working dog behavior modeling, pet expression recognition, multimodal video QA and other applications |
| | | Fish Video Dataset | Fishes-Dataset | Supporting underwater detection and recognition, marine life monitoring and other applications |
| | | Insect Video Dataset | Insect-Dataset | Supporting ecological environment monitoring, pest detection and agricultural intelligence, tiny target detection and tracking research and other applications |
| | Nature | Rainforest Video Dataset | Rainforest-Dataset | Supporting complex scene recognition, environmental change monitoring, ecological protection research and other applications |
| | | Desert Video Dataset | Desert-Dataset | Supporting extreme environment recognition, visual perception training of autonomous driving in sparse terrain, climate change research |
| | | Mountain Video Dataset | Mountain-Dataset | Outdoor scene recognition, UAV navigation, AR/VR scene generation, landform classification |
| | | River Video Dataset | River -Dataset | Supporting water area detection and monitoring, hydrological environment modeling, flood disaster monitoring and other applications |
| | | Ocean Video Dataset | Ocean-Dataset | Supporting marine scene segmentation, underwater robot perception, marine organism recognition and other model training and applications |
| | | Grassland Video Dataset | Grassland-Dataset | Applied to large-area ecological environment recognition, UAV remote sensing tasks, wildlife habitat analysis and other models |
| | | Sky Video Dataset | Sky-Dataset | Applied to weather recognition, aerial and satellite vision, autonomous driving vision background modeling and other scenarios |
| | | Forest Video Dataset | Forest-Dataset | Forest fire prevention monitoring, UAV forest patrol, complex vegetation environment modeling and other scenarios |
| | | Flower Video Dataset | Flower-Dataset | Plant recognition and classification, horticultural/agricultural intelligence, fine-grained image recognition, multimodal aesthetic generation and other scenarios |
| | | Rain Video Dataset | Rain-Dataset | Weather perception and prediction, visual deraining task training, autonomous driving testing in severe weather and other scenarios |
## DataCube Product Introduction
* DataCube, developed by Beijing Academy of Artificial Intelligence (BAAI), enables precise retrieval from "Dataset Level" to "Data Sample Level" to meet users' personalized data needs. By simply inputting data requirements in natural language, the platform can quickly build exclusive personalized datasets, completely breaking through the shackles of previous operating procedures.
* In terms of core technical architecture, DataCube relies on more than 100 data processing operators supported by the BAAI data platform to realize an automated processing pipeline for petabyte-scale data. It deeply integrates the CLIP Understanding engine to perform fine-grained semantic parsing of multimodal samples from multiple dimensions including ontology, behavior, perspective and style, and constructs detailed data profiles for each sample. Supplemented by the Hybrid Retrieval system, it achieves millisecond-level precise recall of cross-modal features. Finally, it evaluates the dataset construction effect through Data Evaluation to realize the generation of personalized high-quality datasets. DataCube has the following characteristics:
* Fast response speed: After inputting requirements in natural language, it can generate datasets with feedback as fast as within seconds
* Low cost: Eliminating tedious data screening and filtering processes, greatly reducing labor and time costs
* Large data volume: Currently with a base of more than 50 million data samples, and the number is still growing continuously
* High data quality: Built with deep semantic understanding algorithms to ensure extremely high relevance of retrieved data
* DataCube Demo:
[DataCube Demo - DIY - August 31.mov](https://resources.ks3-cn-beijing.ksyuncs.com/projects/DataCube/demo.mov)
## License and Usage
* This dataset is only for academic research and non-commercial use
* By using this dataset, you are deemed to have agreed to the relevant agreements and terms of service of the BAAI DataCube product
* For feedback and business cooperation, please contact: <BAAI_data@baai.ac.cn>
提供机构:
maas
创建时间:
2025-09-03
搜集汇总
数据集介绍

以上内容由遇见数据集搜集并总结生成



