DoMSEV (Dataset of Multimodal Semantic Egocentric Video)

Name: DoMSEV (Dataset of Multimodal Semantic Egocentric Video)
Creator: OpenDataLab
Published: 2026-05-24 10:30:26
License: 暂无描述

OpenDataLab2026-05-24 更新2024-05-09 收录

下载链接：

https://opendatalab.org.cn/OpenDataLab/DoMSEV

下载链接

链接失效反馈

官方服务：

资源简介：

我们提出了一个 80 小时的多模式语义以自我为中心的视频 (DoMSEV) 数据集，涵盖了广泛的活动。这些视频是使用 GoPro Hero 相机或由连接到英特尔 Realsense R200 RGB-D 相机的 3D 惯性运动单元 (IMU) 组成的内置设置录制的。不同的人在不同的照明和天气条件下录制了视频。记录器标记视频，告知拍摄某些片段的场景（例如，室内、城市、拥挤的环境或自然）、进行的活动（步行、跑步、站立、浏览、驾驶、骑自行车、吃饭、烹饪、吃饭、观察，在谈话、玩耍或购物中），如果有什么东西引起了他们的注意，以及他们与某个物体互动时。此外，我们为每个记录者创建了一个配置文件，代表他们对一组对象和视觉概念的偏好。

We present an 80-hour multimodal semantic egocentric video dataset (DoMSEV) that covers a wide range of activities. These videos were recorded using either a GoPro Hero camera or a built-in setup consisting of a 3D inertial measurement unit (IMU) connected to an Intel RealSense R200 RGB-D camera. The videos were captured by different individuals under varying lighting and weather conditions. Recorders annotated the videos to indicate the scene (e.g., indoor, urban, crowded environments, or natural settings) for the recorded segments, the activities being performed (walking, running, standing, browsing, driving, cycling, eating, cooking, eating, observing, conversing, playing, or shopping), whether anything attracted their attention, and when they interacted with an object. Additionally, we created a profile for each recorder that represents their preferences over a set of objects and visual concepts.

提供机构：

OpenDataLab

创建时间：

2022-08-19

搜集汇总

数据集介绍