INSAIT-Institute/Articulate3D
收藏Hugging Face2025-12-02 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/INSAIT-Institute/Articulate3D
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-4.0
task_categories:
- robotics
tags:
- 3D
- scene-understanding
- segmentation
- articulation
---
<p align="right">
<img src="https://cdn-uploads.huggingface.co/production/uploads/637e1f8cf7e01589cc17bf7e/p6d0YFHjWCQ3S12jWqO1m.png" width="700">
</p>
# Dataset Card for Articulate3D
<!-- Provide a quick summary of the dataset. -->
Articulate3D is a dataset providing part segmentation and articulation annotations for 3D indoor scene scans. It supplies structured labels describing object parts, hierarchical relationships, and articulation mechanisms (motion parameters).
The annotations are provided on 280 high-quality ScanNet++ V1 scans, enabling utilization of the ScanNet++ object segmentations.
Articulate3D was created to address significant gaps in existing 3D indoor scene datasets,
which often lack the articulation, connectivity, and fine-grained part-level detail required for holistic scene understanding,
interaction modeling, and embodied AI. Prior datasets typically include only object-level semantics or provide
limited part or articulation information, restricting their usefulness for simulation, robotics, or interaction-oriented tasks.
Articulate3D provides richly annotated, real-world, high-resolution 3D scenes with complete articulation metadata - including
motion types, axes, origins, ranges, interactable parts, and fixed attachments - enabling realistic physical simulation and
advanced reasoning over scene hierarchies, object structure, and part mobility.
It is also designed for compatibility with the USD (Universal Scene Description) format,
which supports scalable 3D content creation, physics-aware simulation, and seamless integration into robotics and simulation frameworks.
Before use, please obtain the ScanNet++ scenes separately. Our segmentations follow the ScanNet++ `.ply` meshes, but we do NOT provide those files within the dataset.

## Dataset Details
### Dataset Description
<!-- Provide a longer summary of what this dataset is. -->
Articulate3D provides high-quality part segmentation and articulation annotations for scenes from the ScanNet++ dataset. It includes hierarchical part labels, per-face and per-point segmentation, and detailed articulation metadata describing how movable parts relate to parent objects. The dataset is released as JSON files corresponding to ScanNet++ scene IDs and is intended to support research in 3D scene understanding, part reasoning, articulation modeling, and simulation.
- **Curated by:** INSAIT (Articulate3D authors; see citation)
- **Funded by:** Ministry of Education and Science of Bulgaria (support for INSAIT, part of the Bulgarian National Roadmap for Research Infrastructure)
- **Shared by:** INSΑIT
- **License:** CC-BY 4.0
### Dataset Sources
<!-- Provide the basic links for the dataset. -->
- **Repository:** https://github.com/insait-institute/Articulate3D
- **Paper:** [*Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description* (Halacheva et al., ICCV 2025) ](https://openaccess.thecvf.com/content/ICCV2025/papers/Halacheva_Articulate3D_Holistic_Understanding_of_3D_Scenes_as_Universal_Scene_Description_ICCV_2025_paper.pdf)
- **Demo / Challenge Website:** https://insait-institute.github.io/articulate3d.github.io/challenge.html
## Uses
<!-- Address questions around how the dataset is intended to be used. -->
### Direct Use
<!-- This section describes suitable use cases for the dataset. -->
- 3D part segmentation research
- Articulated object understanding
- Scene-level structure and hierarchy modeling
- Robotics, simulation, or digital twin applications requiring articulated components
- Benchmarks for OpenSUN3D Workshop Challenge, Track 3
### Out-of-Scope Use
<!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. -->
- Use without acquiring the original ScanNet++ scans
- Applications requiring raw sensor data (not included)
## Dataset Structure
<!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. -->
The dataset contains JSON annotation files following the naming scheme: `{scannetpp_scan_id}_{parts|artic}.json`
- **parts.json** — Part segmentation annotations
- `triIndices`: face-level segmentation. The face indices follow the indexing with the mesh `.ply` files in the corresponding ScanNet++ scan.
- `vertIndices`: derived per-vertex segmentation using a voting mechanism. The vertex indices follow the indexing with the point cloud `.ply` files in the corresponding ScanNet++ scan.
- Hierarchy encoded via label indices, e.g.:
- `3.1.cabinet`
- `3.1.2_1.door`
- `3.1.2_1.3_1.handle`
- **artic.json** — Articulation annotations
- `pid`: ID of the articulated part (refers to part segmentation’s `partId`)
- `base`: base part for articulation
- Base can be inferred as the parent in the hierarchy
## Dataset Creation
### Curation Rationale
<!-- Motivation for the creation of this dataset. -->
Enable holistic scene understanding targeting functionality understanding within indoor scenes.
### Source Data
<!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). -->
#### Data Collection and Processing
<!-- This section describes the data collection and processing process such as data selection criteria, filtering and normalization methods, tools and libraries used, etc. -->
Articulate3D is built on top of the 280 publicly available training scenes from ScanNet++.
Articulate3D does not modify the raw scans; instead, it adds several new layers of annotation:
- **Part segmentation**
- **Connectivity graphs**
- **Articulation roles** (movable, interactable, fixed)
- **Motion parameters** (motion type, axis, origin, range)
- **Mass annotations** for simulation
#### Who are the source data producers?
<!-- This section describes the people or systems who originally created the data. It should also include self-reported demographic or identity information for the source data creators if this information is available. -->
Expert-annotated data.
Five expert annotators conducted primary annotations, with a sixth expert performing review and refinement.
<!-- If the dataset contains annotations which are not part of the initial data collection, use this section to describe them. -->
#### Who are the annotators?
<!-- This section describes the people or systems who created the annotations. -->
Annotations were produced by **five expert annotators**, with a **sixth expert reviewer** performing quality checks, corrections, and validation. Tools extended from MultiScan’s annotation suite—with added support for connectivity and articulation—were used to ensure consistency and scalability.
## Citation
<!-- If there is a paper or blog post introducing the dataset, the APA and Bibtex information for that should go in this section. -->
**BibTeX:**
```
@InProceedings{Halacheva_2025_ICCV,
author = {Halacheva, Anna-Maria and Miao, Yang and Zaech, Jan-Nico and Wang, Xi and Van Gool, Luc and Paudel, Danda Pani},
title = {Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description},
booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
month = {October},
year = {2025},
pages = {5633-5644}
}
```
## Dataset Card Authors
Anna-Maria Halacheva, INSAIT
anna-maria.halacheva@insait.ai
## Dataset Card Contact
Anna-Maria Halacheva, INSAIT
anna-maria.halacheva@insait.ai
提供机构:
INSAIT-Institute



