MM-Office Dataset: multi-view and multi-modal dataset in an office environment
收藏Mendeley Data2024-05-10 更新2024-06-27 收录
下载链接:
https://zenodo.org/records/6088572
下载链接
链接失效反馈官方服务:
资源简介:
MM-office is a multi-view and multi-modal dataset in an office environment (MM-Office) that records events, e.g., 'enter' to the office room, 'sit down' on the chair, and 'take out' something from a shelf, in the room assuming the daily work. These events are recorded simultaneously using eight non-directional microphones and four cameras. The audio and video clips are divided into scenes, each about 30 to 90 seconds. The amount of data was 880 clips per point and sensor. The labels available for training are given as multi-labels that indicate which each clip contains what event. Only the test data is annotated with a strong label containing the onset/offset time of each event. License: see the file named LICENSE.pdf Further information is available at [1] and Github: https://github.com/nttrd-mdlab/mm-office [1] Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito, Noboru Harada “Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion,” in IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2022.
创建时间:
2023-06-28
搜集汇总
数据集介绍

背景与挑战
背景概述
MM-Office数据集是一个多视角和多模态的办公室环境数据集,用于记录和分析日常办公事件。数据集包含音频和视频数据,通过多个传感器采集,并提供详细的标签信息,适用于事件检测和多传感器融合研究。
以上内容由遇见数据集搜集并总结生成



