下载链接：

https://modelscope.cn/datasets/BAAI/BAAI-DataCube_Forest-Dataset

下载链接

链接失效反馈

官方服务：

资源简介：

## 数据集简介 * 智源数据魔方（ https://datacube.baai.ac.cn ）系列数据集由智源数据魔方产品根据不同场景和需求自动构建，欢迎加入用户沟通群 ![](https://resources.ks3-cn-beijing.ksyuncs.com/projects/DataCube/QR.png) * 涵盖7大方向，40个高质量视频数据，详情如下： | 系列 | 子方向 | 数据集中文名称 | 数据集英文名称 | 应用场景 | | ------- | ------- | ----------- | -------------------------------- | --------------------------------------------------- | | 人物动作与交互 | 手部动作 | 手部动作视频数据集 | Gesture-Recognition | 用于手势识别模型、人机交互模型，VR/AR 手势控制、智能家居指令识别等场景 | | | | 手工/DIY视频数据集 | DIY | 用于操作步骤识别、手部动作跟踪等场景，适合训练机器人学习装配/制作任务，也可用于动作分解和模仿学习。 | | | | 烹饪视频数据集 | Cooking | 适合动作分解学习（切菜、翻炒等），任务规划（烹饪步骤预测）等场景 | | | 身体动作 | 健身运动视频数据集 | Fitness-Exercise | 用于动作识别与纠错模型，支持健身指导、姿态矫正、智能健身教练等场景 | | | | 室外运动视频数据集 | Outdoor-Sports | 适合人体关键点识别、动作分析等场景，支持体育赛事分析、智能裁判和运动辅助机器人等应用 | | | | 儿童活动视频数据集 | Children’s-Activities | 用于行为检测、儿童安全监控等场景，适合教育机器人、陪伴机器人等应用 | | | 表情与情绪 | 情绪视频数据集 | Facial-Expression | 用于训练情绪识别模型，支持人机交互情感计算、客服机器人、心理健康监测等场景 | | | | 表情视频数据集 | Fine-grained-Facial Expression | 细粒度表情识别数据集，支持微表情分析、多模态对话系统等应用，提升 LLM 在具身交互中的情感理解能力 | | | 社会交互 | 双人互动视频数据集 | Two-person-Interaction | 训练社交互动识别等模型，支持对话场景检测、机器人社交技能模拟等场景 | | | | 人群聚集视频数据集 | Crowd | 用于人群检测、密度估计、城市安防、公共场所监控中的异常检测等场景 | | | | 校园视频数据集 | Campus | 训练群体交互识别模型，支持校园安全、课堂场景分析、教育 AI 系统建设等应用 | | 第一人称与导航 | 驾驶与第一人称 | 第一人称驾驶视频数据集 | First-person-Driving | 自动驾驶和第一人称导航任务，支持驾驶行为预测、路径规划等应用 | | | 导航与镜头移动 | 室内场景视频数据集 | Indoor-Scene-Dataset | 支持训练室内导航等模型，支持服务机器人在室内环境中导航、避障等任务 | | | | 公路场景视频数据集 | Road-trip-Dataset | 用于道路场景感知、自动驾驶场景模拟等任务和场景 | | 表演与舞台 | 舞蹈与表演 | 舞蹈表演视频数据集 | Dance-Performance Dataset | 用于人体动作生成、舞蹈动作模仿等场景 | | | | 歌曲表演视频数据集 | Song-Performance Dataset | 用于歌唱动作识别、舞台表演动作预测等应用 | | | 舞台/演讲 | 演讲视频数据集 | Speech-Dataset | 训练姿态-语音多模态模型，支持演讲者手势分析、虚拟主持人等场景 | | | | 表演视频数据集 | Performing-Arts-Dataset | 支持虚拟表演生成、数字人训练、舞台动作理解等场景 | | | | 新闻播报视频数据集 | Breaking-News-Dataset | 支持虚拟新闻主播生成等应用 | | 热点与节日 | 情人节 | 情人节日视频数据集 | Valentine’s-Day-Dataset | 训适合广告推荐、节日事件检测、情感交互等应用 | | 安全与安防 | 火灾 | 火灾视频数据集 | Fire-Incident-Dataset | 训练火灾检测等模型，支持智能监控、机器人应急响应 | | | 洪水 | 洪水视频数据集 | Flood-Dataset | 训练自然灾害识别等模型，支持防灾预警等应用 | | | 加油站 | 加油站视频数据集 | Gas-Station-Dataset | 训练特定场景监控等安全模型，支持安全生产和异常行为检测等应用 | | | 劳动安全 | 工人穿戴视频数据集 | Worker-Wearing-Detection-Dataset | 训练安全检测模型，支持工业安全与合规监控等应用 | | 生活与日常 | 家庭生活 | 家庭生活视频数据集 | Living-Room-Dataset | 训练日常行为识别等模型，适合家庭服务机器人、智能安防等应用 | | | 儿童教育 | 儿童学习视频数据集 | Children-Classroom Dataset | 训练教育场景行为识别等模型，支持课堂互动检测、教育机器人等应用 | | 动物与自然 | 动物 | 鸟视频数据集 | Bird-Dataset | 训练动物识别等模型 | | | | 猫视频数据集 | Cat-Dataset | 支持动物识别与分类、宠物行为分析、视频监控中小动物检测等应用 | | | | 狗视频数据集 | Dog-Dataset | 犬类动作识别、支持导盲犬/工作犬行为建模、宠物表情识别、多模态视频问答等应用 | | | | 鱼视频数据集 | Fishes-Dataset | 支持水下检测与识别、海洋生物监测等应用 | | | | 昆虫视频数据集 | Insect-Dataset | 支持生态环境监测、害虫检测与农业智能化、微小目标检测与跟踪研究等应用 | | | 自然 | 雨林视频数据集 | Rainforest-Dataset | 支持复杂场景识别、环境变化监测、生态保护研究等应用 | | | | 沙漠视频数据集 | Desert-Dataset | 支持极端环境识别、自动驾驶在稀疏地貌中的视觉感知训练、气候变化研究 | | | | 山川视频数据集 | Mountain-Dataset | 户外场景识别、无人机导航、AR/VR 场景生成、地貌分类 | | | | 河流视频数据集 | River -Dataset | 支持水域检测与监控、水文环境建模、洪涝灾害监测等应用 | | | | 海洋视频数据集 | Ocean-Dataset | 支持海洋场景分割、水下机器人感知、海洋生物识别等模型训练和应用 | | | | 草原视频数据集 | Grassland-Dataset | 应用于大面积生态环境识别、无人机遥感任务、野生动物栖息地分析等模型 | | | | 天空视频数据集 | Sky-Dataset | 应用于天气识别、航拍与卫星视觉、自动驾驶视觉背景建模等场景 | | | | 森林视频数据集 | Forest-Dataset | 森林防火监控、无人机林业巡检、复杂植被环境建模等场景 | | | | 花视频数据集 | Flower-Dataset | 植物识别与分类、园艺/农业智能化、细粒度图像识别、多模态美学生成等场景 | | | | 雨视频数据集 | Rain-Dataset | 天气感知与预测、视觉去雨任务训练、自动驾驶在恶劣天气下的测试等场景 | ## 数据魔方产品介绍 * 智源研究院研发的数据魔方（DataCube）可实现从“数据集Level”到“数据样本Level”的精准检索，满足用户个性化的数据需求，只需自然语言输入数据需求，平台便能快速构建专属的个性化数据集，彻底打破以往操作流程的桎梏。 * 在核心技术架构层面，数据魔方依托智源数据平台支持的100+数据处理算子，实现 PB 级数据的自动化处理Pipeline；深度融合 CLIP Understanding 引擎，对多模态样本从本体、行为、视角与风格等多维细进行细粒度语义解析，逐条构建精细数据画像；辅以 Hybrid Retrieval 体系，实现跨模态特征的毫秒级精准召回；最终通过 Data Evaluation 评估数据集构建效果，实现个性化高质量数据集生成。数据魔方具有以下特点： *  响应速度“快”，自然语言输入需求后，最快能秒级反馈并生成数据集 * 成本“省”，免去繁琐的数据筛选过滤过程，极大降低了人力与时间成本 * 数据量“多”，目前已汇聚5000W+数据样本基础，且数量仍在持续增长 * 数据质量“好”，内置深度语义理解算法，确保检索到的数据相关性极高 * 数据魔方使用演示： [数据魔方演示-DIY-8月31日.mov](https://resources.ks3-cn-beijing.ksyuncs.com/projects/DataCube/demo.mov) ## 许可与使用 * 本数据集仅限学术研究与非商业用途 * 使用即视为同意智源研究院数据魔方产品相关协议和使用条款 * 意见反馈与业务合作，欢迎联系：<BAAI_data@baai.ac.cn> 

## Dataset Introduction * The DataCube series of datasets (https://datacube.baai.ac.cn) are automatically constructed by the DataCube product of Beijing Academy of Artificial Intelligence (BAAI) according to different scenarios and requirements. Welcome to join the user communication group. ![QR Code](https://resources.ks3-cn-beijing.ksyuncs.com/projects/DataCube/QR.png) * Covers 7 major categories and 40 high-quality video datasets, with details as follows: | Series | Sub-direction | Chinese Name of Dataset | English Name of Dataset | Application Scenarios | | --------------------- | ------------------- | ----------------------------- | ------------------------------------------------ | ------------------------------------------------------------------------------------- | | Human Motion and Interaction | Hand Motions | Hand Motion Video Dataset | Gesture-Recognition | For gesture recognition models, human-computer interaction models, VR/AR gesture control, smart home command recognition, etc. | | | | Handmade/DIY Video Dataset | DIY | For operation step recognition, hand motion tracking, etc., suitable for training robots to learn assembly/manufacturing tasks, as well as action decomposition and imitation learning. | | | | Cooking Video Dataset | Cooking | Suitable for action decomposition learning (chopping, stir-frying, etc.), task planning (cooking step prediction), and other scenarios | | | Body Motions | Fitness Exercise Video Dataset | Fitness-Exercise | For motion recognition and error correction models, supporting fitness guidance, posture correction, intelligent fitness coach, and other scenarios | | | | Outdoor Sports Video Dataset | Outdoor-Sports | Suitable for human keypoint recognition, motion analysis, etc., supporting sports event analysis, intelligent referee, motion-assisted robot, and other applications | | | | Children's Activities Video Dataset | Children’s-Activities | For behavior detection, child safety monitoring, etc., suitable for educational robots, companion robots, and other applications | | | Facial Expression and Emotion | Emotion Video Dataset | Facial-Expression | For training emotion recognition models, supporting human-computer interaction affective computing, customer service robots, mental health monitoring, and other scenarios | | | | Fine-grained Facial Expression Video Dataset | Fine-grained-Facial Expression | A fine-grained facial expression recognition dataset, supporting micro-expression analysis, multimodal dialogue systems, and other applications, to improve the emotional understanding ability of LLMs in embodied interaction | | | Social Interaction | Two-person Interaction Video Dataset | Two-person-Interaction | For training social interaction recognition models, supporting dialogue scenario detection, robot social skill simulation, and other scenarios | | | | Crowd Gathering Video Dataset | Crowd | For crowd detection, density estimation, urban security, anomaly detection in public place monitoring, and other scenarios | | | | Campus Video Dataset | Campus | For training group interaction recognition models, supporting campus security, classroom scenario analysis, educational AI system construction, and other applications | | First-person View and Navigation | Driving and First-person View | First-person Driving Video Dataset | First-person-Driving | For autonomous driving and first-person navigation tasks, supporting driving behavior prediction, path planning, and other applications | | | Navigation and Camera Movement | Indoor Scene Video Dataset | Indoor-Scene-Dataset | For training indoor navigation models, supporting service robots in indoor environment navigation, obstacle avoidance, and other tasks | | | | Road Scene Video Dataset | Road-trip-Dataset | For road scene perception, autonomous driving scenario simulation, and other tasks and scenarios | | Performance and Stage | Dance and Performance | Dance Performance Video Dataset | Dance-Performance Dataset | For human motion generation, dance motion imitation, and other scenarios | | | | Song Performance Video Dataset | Song-Performance Dataset | For singing motion recognition, stage performance motion prediction, and other applications | | | Stage/Speech | Speech Video Dataset | Speech-Dataset | For training posture-speech multimodal models, supporting speaker gesture analysis, virtual host, and other scenarios | | | | Performing Arts Video Dataset | Performing-Arts-Dataset | For virtual performance generation, digital human training, stage motion understanding, and other scenarios | | | | News Broadcast Video Dataset | Breaking-News-Dataset | For virtual news anchor generation, and other applications | | Hot Topics and Festivals | Valentine's Day | Valentine's Day Video Dataset | Valentine’s-Day-Dataset | Suitable for advertising recommendation, festival event detection, emotional interaction, and other applications | | Security and Public Safety | Fire | Fire Incident Video Dataset | Fire-Incident-Dataset | For training fire detection models, supporting intelligent monitoring, robot emergency response, and other applications | | | Flood | Flood Video Dataset | Flood-Dataset | For training natural disaster recognition models, supporting disaster prevention and early warning, and other applications | | | Gas Station | Gas Station Video Dataset | Gas-Station-Dataset | For training specific scenario monitoring security models, supporting safe production and abnormal behavior detection, and other applications | | | Labor Safety | Worker Wear Detection Video Dataset | Worker-Wearing-Detection-Dataset | For training safety detection models, supporting industrial safety and compliance monitoring, and other applications | | Daily Life | Family Life | Family Life Video Dataset | Living-Room-Dataset | For training daily behavior recognition models, suitable for home service robots, smart security, and other applications | | | Children's Education | Children's Learning Video Dataset | Children-Classroom Dataset | For training education scenario behavior recognition models, supporting classroom interaction detection, educational robots, and other applications | | Animals and Nature | Animals | Bird Video Dataset | Bird-Dataset | For training animal recognition models, etc. | | | | Cat Video Dataset | Cat-Dataset | Supporting animal recognition and classification, pet behavior analysis, small animal detection in video monitoring, and other applications | | | | Dog Video Dataset | Dog-Dataset | Canine motion recognition, supporting guide dog/working dog behavior modeling, pet expression recognition, multimodal video question answering, and other applications | | | | Fish Video Dataset | Fishes-Dataset | Supporting underwater detection and recognition, marine organism monitoring, and other applications | | | | Insect Video Dataset | Insect-Dataset | Supporting ecological environment monitoring, pest detection and agricultural intelligence, tiny target detection and tracking research, and other applications | | | Nature | Rainforest Video Dataset | Rainforest-Dataset | Supporting complex scene recognition, environmental change monitoring, ecological protection research, and other applications | | | | Desert Video Dataset | Desert-Dataset | Supporting extreme environment recognition, visual perception training for autonomous driving in sparse terrain, climate change research, etc. | | | | Mountain Video Dataset | Mountain-Dataset | Outdoor scene recognition, drone navigation, AR/VR scene generation, landform classification, etc. | | | | River Video Dataset | River -Dataset | Supporting water area detection and monitoring, hydrological environment modeling, flood disaster monitoring, and other applications | | | | Ocean Video Dataset | Ocean-Dataset | Supporting marine scene segmentation, underwater robot perception, marine organism recognition, and other model training and applications | | | | Grassland Video Dataset | Grassland-Dataset | Applied to large-area ecological environment recognition, drone remote sensing tasks, wildlife habitat analysis, and other models | | | | Sky Video Dataset | Sky-Dataset | Applied to weather recognition, aerial and satellite vision, autonomous driving vision background modeling, and other scenarios | | | | Forest Video Dataset | Forest-Dataset | Forest fire prevention monitoring, drone forestry patrol, complex vegetation environment modeling, and other scenarios | | | | Flower Video Dataset | Flower-Dataset | Plant recognition and classification, horticultural/agricultural intelligence, fine-grained image recognition, multimodal aesthetic generation, and other scenarios | | | | Rain Video Dataset | Rain-Dataset | Weather perception and prediction, visual deraining task training, autonomous driving testing in severe weather, and other scenarios | ## DataCube Product Introduction * DataCube, developed by Beijing Academy of Artificial Intelligence (BAAI), enables precise retrieval from "dataset level" to "data sample level", meeting users' personalized data needs. Users only need to input data requirements in natural language, and the platform can quickly build a customized personalized dataset, completely breaking the shackles of previous operating procedures. * At the core technical architecture level, DataCube relies on more than 100 data processing operators supported by the BAAI data platform to realize an automated processing pipeline for petabyte-level data; it deeply integrates the CLIP Understanding Engine to perform fine-grained semantic parsing of multimodal samples from multiple dimensions such as ontology, behavior, perspective and style, and build fine data profiles one by one; supplemented by the Hybrid Retrieval System, it realizes millisecond-level accurate recall of cross-modal features; finally, it evaluates the dataset construction effect through Data Evaluation to realize the generation of personalized high-quality datasets. DataCube has the following characteristics: * Fast response speed: After inputting requirements in natural language, it can feedback and generate datasets in seconds at the fastest * Low cost: Eliminates tedious data screening and filtering processes, greatly reducing labor and time costs * Large data volume: Currently, it has accumulated a base of more than 50 million data samples, and the number is still growing continuously * High data quality: Built-in deep semantic understanding algorithm ensures that the retrieved data has extremely high relevance * DataCube Usage Demo: [DataCube Demo - DIY - August 31.mov](https://resources.ks3-cn-beijing.ksyuncs.com/projects/DataCube/demo.mov) ## License and Usage * This dataset is only for academic research and non-commercial use * Use of this dataset constitutes acceptance of the relevant agreements and terms of service of the BAAI DataCube product * For feedback and business cooperation, please contact: <BAAI_data@baai.ac.cn>

应用场景：

BAAI-DataCube_森林视频数据集