Articulate-Anything Dataset
收藏Articulate Anything 数据集概述
数据集简介
Articulate Anything 是一个用于通过多种输入模态(文本、图像、视频)来描述和生成3D对象的强大视觉语言模型(VLM)系统。
数据集特点
- 文本输入:通过文本描述生成3D对象。
- 图像输入:通过图像生成3D对象。
- 视频输入:通过视频生成3D对象。
数据集下载
预处理后的 PartNet-Mobility 数据集可通过以下链接下载:
数据集使用
安装与环境设置
-
克隆仓库: bash git clone https://github.com/vlongle/articulate-anything.git cd articulate-anything
-
设置Python环境: bash conda create -n articulate-anything python=3.9 conda activate articulate-anything pip install -e .
-
下载并解压 PartNet-Mobility 数据集: bash mkdir datasets mv partnet-mobility-v0.zip datasets/partnet-mobility-v0.zip cd datasets mkdir partnet-mobility-v0 unzip partnet-mobility-v0 -d partnet-mobility-v0
数据集预处理
-
文本模态: bash python articulate_anything/preprocess/preprocess_partnet.py parallel={int} modality=text
-
图像模态: bash python articulate_anything/preprocess/preprocess_partnet.py parallel={int} modality=image
-
视频模态: bash python articulate_anything/preprocess/preprocess_partnet.py parallel={int} modality=video
数据集应用
-
文本生成3D对象: bash python articulate.py modality=text prompt="suitcase with a retractable handle" out_dir=results/text/suitcase
-
图像生成3D对象: bash python articulate.py modality=image prompt="datasets/in-the-wild-dataset/images/suitcase.jpg" out_dir=results/image/suitcase
-
视频生成3D对象: bash python articulate.py modality=video prompt="datasets/in-the-wild-dataset/videos/suitcase.mp4" out_dir=results/video/suitcase
数据集引用
如果使用该数据集,请引用以下论文: bibtex @article{le2024articulate, title={Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model}, author={Le, Long and Xie, Jason and Liang, William and Wang, Hung-Ju and Yang, Yue and Ma, Yecheng Jason and Vedder, Kyle and Krishna, Arjun and Jayaraman, Dinesh and Eaton, Eric}, journal={arXiv preprint arXiv:2410.13882}, year={2024} }




