thomas-yanxin/multi3drefer-mirror

Name: thomas-yanxin/multi3drefer-mirror
Creator: thomas-yanxin
Published: 2026-03-30 09:51:48
License: 暂无描述

Hugging Face2026-03-30 更新2026-04-12 收录

下载链接：

https://hf-mirror.com/datasets/thomas-yanxin/multi3drefer-mirror

下载链接

链接失效反馈

官方服务：

资源简介：

--- language: - en size_categories: - 10K<n<100K license: mit --- # Multi3DRefer Dataset This repository contains the Multi3DRefer dataset, introduced in [Multi3DRefer: Grounding Text Description to Multiple 3D Objects](https://3dlg-hcvc.github.io/multi3drefer/). [![GitHub](https://img.shields.io/badge/github-%23121011.svg?style=for-the-badge&logo=github&logoColor=white)](https://github.com/3dlg-hcvc/multi3drefer) [![PyPI](https://img.shields.io/badge/pypi-3775A9?style=for-the-badge&logo=pypi&logoColor=white)](https://pypi.org/project/torchmetrics-ext/) [![arXiv](https://img.shields.io/badge/arXiv-2309.05251-b31b1b.svg?style=for-the-badge)](https://arxiv.org/abs/2309.05251) ## Dataset Attributes **scene_id: (str)** the scene identifier as defined in [ScanNetv2](http://www.scan-net.org/). \ **object_name: (str)** the name of the target object(s). \ **ann_id: (int)** the local annotation ID within a scene. \ **description: (str)** the language description of the target object(s) in the scene. \ **object_ids: (List[int])** the target object ID(s) as defined in [ScanNetv2](http://www.scan-net.org/). \ **eval_type: (str)** the evaluation type of the data, possible values: "zt" (zero target), "mt" (multiple targets), "st_w_d" (single target with distractors) and "st_wo_d" (single target without distractors). \ **spatial: (bool)** indicates if the description mentions spatial information about the target object(s). \ **color: (bool)** indicates if the description mentions the color of the target object(s). \ **texture: (bool)** indicates if the description mentions texture information about the target object(s). \ **shape: (bool)** indicates if the description mentions shape information about the target object(s). ## Citation ```bibtex @inproceedings{zhang2023multi3drefer, author={Zhang, Yiming and Gong, ZeMing and Chang, Angel X}, title={Multi3DRefer: Grounding Text Description to Multiple 3D Objects}, booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month={October}, year={2023}, pages={15225-15236} } ```

--- 语言： - 英语规模类别： - 10K < n < 100K 许可证：MIT许可证 --- # Multi3DRefer 数据集本代码仓库包含Multi3DRefer数据集，该数据集由论文《Multi3DRefer: Grounding Text Description to Multiple 3D Objects》提出，论文链接：https://3dlg-hcvc.github.io/multi3drefer/。 [![GitHub 徽章](https://img.shields.io/badge/github-%23121011.svg?style=for-the-badge&logo=github&logoColor=white)](https://github.com/3dlg-hcvc/multi3drefer) [![PyPI 徽章](https://img.shields.io/badge/pypi-3775A9?style=for-the-badge&logo=pypi&logoColor=white)](https://pypi.org/project/torchmetrics-ext/) [![arXiv 徽章](https://img.shields.io/badge/arXiv-2309.05251-b31b1b.svg?style=for-the-badge)](https://arxiv.org/abs/2309.05251) ## 数据集属性 **scene_id：（字符串类型）** 即ScanNetv2（ScanNetv2）中定义的场景标识符。 **object_name：（字符串类型）** 目标对象的名称。 **ann_id：（整数类型）** 单一场景内的本地标注ID。 **description：（字符串类型）** 场景中目标对象的自然语言描述文本。 **object_ids：（整数列表类型）** 即ScanNetv2（ScanNetv2）中定义的目标对象ID列表。 **eval_type：（字符串类型）** 数据的评估类型，可选取值为："zt"（零目标（zero target））、"mt"（多目标（multiple targets））、"st_w_d"（带干扰物的单目标（single target with distractors））以及"st_wo_d"（无干扰物的单目标（single target without distractors））。 **spatial：（布尔类型）** 用于标记描述是否提及目标对象空间信息的属性。 **color：（布尔类型）** 用于标记描述是否提及目标对象颜色信息的属性。 **texture：（布尔类型）** 用于标记描述是否提及目标对象纹理信息的属性。 **shape：（布尔类型）** 用于标记描述是否提及目标对象形状信息的属性。 ## 引用格式 bibtex @inproceedings{zhang2023multi3drefer, author={Zhang, Yiming and Gong, ZeMing and Chang, Angel X}, title={Multi3DRefer: Grounding Text Description to Multiple 3D Objects}, booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)}, month={October}, year={2023}, pages={15225-15236} }

提供机构：

thomas-yanxin

5,000+

优质数据集

54 个

任务类型

进入经典数据集