MM-Office Dataset: multi-view and multi-modal dataset in an office environment
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/6088571
下载链接
链接失效反馈官方服务:
资源简介:
MM-office is a multi-view and multi-modal dataset in an office environment (MM-Office) that records events, e.g., 'enter' to the office room, 'sit down' on the chair, and 'take out' something from a shelf, in the room assuming the daily work. These events are recorded simultaneously using eight non-directional microphones and four cameras. The audio and video clips are divided into scenes, each about 30 to 90 seconds. The amount of data was 880 clips per point and sensor. The labels available for training are given as multi-labels that indicate which each clip contains what event. Only the test data is annotated with a strong label containing the onset/offset time of each event.
License: see the file named LICENSE.pdf
Further information is available at [1] and Github: https://github.com/nttrd-mdlab/mm-office
[1] Masahiro Yasuda, Yasunori Ohishi, Shoichiro Saito, Noboru Harada “Multi-view and Multi-modal Event Detection Utilizing Transformer-based Multi-sensor fusion,” in IEEE Int. Conf. Acoust. Speech Signal Process. (ICASSP), 2022.
创建时间:
2024-07-17



