five

INSAIT-Institute/Articulate3D

收藏
Hugging Face2025-12-02 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/INSAIT-Institute/Articulate3D
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: cc-by-4.0 task_categories: - robotics tags: - 3D - scene-understanding - segmentation - articulation --- <p align="right"> <img src="https://cdn-uploads.huggingface.co/production/uploads/637e1f8cf7e01589cc17bf7e/p6d0YFHjWCQ3S12jWqO1m.png" width="700"> </p> # Dataset Card for Articulate3D <!-- Provide a quick summary of the dataset. --> Articulate3D is a dataset providing part segmentation and articulation annotations for 3D indoor scene scans. It supplies structured labels describing object parts, hierarchical relationships, and articulation mechanisms (motion parameters). The annotations are provided on 280 high-quality ScanNet++ V1 scans, enabling utilization of the ScanNet++ object segmentations. Articulate3D was created to address significant gaps in existing 3D indoor scene datasets, which often lack the articulation, connectivity, and fine-grained part-level detail required for holistic scene understanding, interaction modeling, and embodied AI. Prior datasets typically include only object-level semantics or provide limited part or articulation information, restricting their usefulness for simulation, robotics, or interaction-oriented tasks. Articulate3D provides richly annotated, real-world, high-resolution 3D scenes with complete articulation metadata - including motion types, axes, origins, ranges, interactable parts, and fixed attachments - enabling realistic physical simulation and advanced reasoning over scene hierarchies, object structure, and part mobility. It is also designed for compatibility with the USD (Universal Scene Description) format, which supports scalable 3D content creation, physics-aware simulation, and seamless integration into robotics and simulation frameworks. Before use, please obtain the ScanNet++ scenes separately. Our segmentations follow the ScanNet++ `.ply` meshes, but we do NOT provide those files within the dataset. ![Teaser](https://github.com/amhalacheva/Hf_resources/blob/6caf023ce6656924a0db9b0c197977771058be39/teaser_iccv.drawio.png?raw=true) ## Dataset Details ### Dataset Description <!-- Provide a longer summary of what this dataset is. --> Articulate3D provides high-quality part segmentation and articulation annotations for scenes from the ScanNet++ dataset. It includes hierarchical part labels, per-face and per-point segmentation, and detailed articulation metadata describing how movable parts relate to parent objects. The dataset is released as JSON files corresponding to ScanNet++ scene IDs and is intended to support research in 3D scene understanding, part reasoning, articulation modeling, and simulation. - **Curated by:** INSAIT (Articulate3D authors; see citation) - **Funded by:** Ministry of Education and Science of Bulgaria (support for INSAIT, part of the Bulgarian National Roadmap for Research Infrastructure) - **Shared by:** INSΑIT - **License:** CC-BY 4.0 ### Dataset Sources <!-- Provide the basic links for the dataset. --> - **Repository:** https://github.com/insait-institute/Articulate3D - **Paper:** [*Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description* (Halacheva et al., ICCV 2025) ](https://openaccess.thecvf.com/content/ICCV2025/papers/Halacheva_Articulate3D_Holistic_Understanding_of_3D_Scenes_as_Universal_Scene_Description_ICCV_2025_paper.pdf) - **Demo / Challenge Website:** https://insait-institute.github.io/articulate3d.github.io/challenge.html ## Uses <!-- Address questions around how the dataset is intended to be used. --> ### Direct Use <!-- This section describes suitable use cases for the dataset. --> - 3D part segmentation research - Articulated object understanding - Scene-level structure and hierarchy modeling - Robotics, simulation, or digital twin applications requiring articulated components - Benchmarks for OpenSUN3D Workshop Challenge, Track 3 ### Out-of-Scope Use <!-- This section addresses misuse, malicious use, and uses that the dataset will not work well for. --> - Use without acquiring the original ScanNet++ scans - Applications requiring raw sensor data (not included) ## Dataset Structure <!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. --> The dataset contains JSON annotation files following the naming scheme: `{scannetpp_scan_id}_{parts|artic}.json` - **parts.json** — Part segmentation annotations - `triIndices`: face-level segmentation. The face indices follow the indexing with the mesh `.ply` files in the corresponding ScanNet++ scan. - `vertIndices`: derived per-vertex segmentation using a voting mechanism. The vertex indices follow the indexing with the point cloud `.ply` files in the corresponding ScanNet++ scan. - Hierarchy encoded via label indices, e.g.: - `3.1.cabinet` - `3.1.2_1.door` - `3.1.2_1.3_1.handle` - **artic.json** — Articulation annotations - `pid`: ID of the articulated part (refers to part segmentation’s `partId`) - `base`: base part for articulation - Base can be inferred as the parent in the hierarchy ## Dataset Creation ### Curation Rationale <!-- Motivation for the creation of this dataset. --> Enable holistic scene understanding targeting functionality understanding within indoor scenes. ### Source Data <!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). --> #### Data Collection and Processing <!-- This section describes the data collection and processing process such as data selection criteria, filtering and normalization methods, tools and libraries used, etc. --> Articulate3D is built on top of the 280 publicly available training scenes from ScanNet++. Articulate3D does not modify the raw scans; instead, it adds several new layers of annotation: - **Part segmentation** - **Connectivity graphs** - **Articulation roles** (movable, interactable, fixed) - **Motion parameters** (motion type, axis, origin, range) - **Mass annotations** for simulation #### Who are the source data producers? <!-- This section describes the people or systems who originally created the data. It should also include self-reported demographic or identity information for the source data creators if this information is available. --> Expert-annotated data. Five expert annotators conducted primary annotations, with a sixth expert performing review and refinement. <!-- If the dataset contains annotations which are not part of the initial data collection, use this section to describe them. --> #### Who are the annotators? <!-- This section describes the people or systems who created the annotations. --> Annotations were produced by **five expert annotators**, with a **sixth expert reviewer** performing quality checks, corrections, and validation. Tools extended from MultiScan’s annotation suite—with added support for connectivity and articulation—were used to ensure consistency and scalability. ## Citation <!-- If there is a paper or blog post introducing the dataset, the APA and Bibtex information for that should go in this section. --> **BibTeX:** ``` @InProceedings{Halacheva_2025_ICCV, author = {Halacheva, Anna-Maria and Miao, Yang and Zaech, Jan-Nico and Wang, Xi and Van Gool, Luc and Paudel, Danda Pani}, title = {Articulate3D: Holistic Understanding of 3D Scenes as Universal Scene Description}, booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month = {October}, year = {2025}, pages = {5633-5644} } ``` ## Dataset Card Authors Anna-Maria Halacheva, INSAIT anna-maria.halacheva@insait.ai ## Dataset Card Contact Anna-Maria Halacheva, INSAIT anna-maria.halacheva@insait.ai
提供机构:
INSAIT-Institute
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作