3D Multimodal Grounding Reasoning Dataset

Name: 3D Multimodal Grounding Reasoning Dataset
Creator: Harvard Dataverse
Published: 2025-05-14 02:43:37
License: 暂无描述

DataCite Commons2025-05-14 更新2025-05-17 收录

下载链接：

https://dataverse.harvard.edu/citation?persistentId=doi:10.7910/DVN/JNGYBC

下载链接

链接失效反馈

官方服务：

资源简介：

Our dataset is the first 3D grounded medical image reasoning dataset to date, designed to challenge Vision Language Models in perceiving fine-grained visual details and executing diagnostic reasoning processes analogous to expert clinicians. Focusing on the MRI Osteoarthritis Knee Score (MOAKS), it explicitly models the localized clinical diagnostic workflow through structured chains of thought: anatomical localization, abnormality detection, severity estimation, and final score assignment. We curated 560,000 high-quality quintuples pairing: (1) 3D knee MRI volumes from 2,652 patients, (2) region-specific clinical questions addressing specific MOAKS dimensions, (3) 3D bounding boxes of relevant anatomical structures, (4) expert-crafted diagnostic reasoning steps, and (5) structured MOAKS-based assessments.

提供机构：

Harvard Dataverse

创建时间：

2025-05-14

5,000+

优质数据集

54 个

任务类型

进入经典数据集