five

TriView2CAD

收藏
魔搭社区2026-05-16 更新2025-06-07 收录
下载链接:
https://modelscope.cn/datasets/zhuofanChen/TriView2CAD
下载链接
链接失效反馈
官方服务:
资源简介:
# TriView2CAD Dataset **TriView2CAD** is the first large-scale, open-source benchmark specifically designed for orthographic projection reasoning in Computer-Aided Design (CAD). It supports tasks ranging from engineering drawing interpretation to 3D geometry reconstruction, and serves as a foundation for training and evaluating multi-modal models in industrial scenarios. ## 📦 Dataset Overview TriView2CAD consists of: - **200,000 synthetic samples** for training and in-domain evaluation; Each sample is annotated with **precise dimension labels** and includes **six interoperable data modalities**: 1. **Parameter Tables** (JSON) 2. **Vector CAD Drawings** (DXF) 3. **Raster Images** (PNG; front, top, side views) 4. **Executable CAD Scripts** 5. **STEP Files** (3D models) 6. **B-Rep Files** (3D models) This rich format enables seamless support for tasks across design, simulation, and manufacturing workflows. ## ⚙️ Dataset Construction We focus on **prefabricated bridge piers**, a modular structure commonly found in civil engineering. A constraint-guided pipeline ensures the geometric and structural validity of each sample: ### Step 1: Constraint-Guided Parameter Sampling - A 15-dimensional parameter space is defined. - **Intra-view constraints** ensure physical validity and topology closure. - **Inter-view constraints** guarantee consistency across orthographic views (height, width, depth). ### Step 2: 2D Drawing & Raster Image Generation - The `ezdxf` library converts sampled parameters into vectorized DXF drawings. - Geometric primitives (lines, circles, arcs) are organized into semantic layers. - High-resolution raster images (front, top, side) are rendered in FreeCAD. ### Step 3: 3D Model Generation - Using FreeCAD's scripting interface, 3D models are programmatically reconstructed. - Both **STEP** and **B-Rep** files are generated to support CAD simulation and interoperability. ## 🧠 Supported Reasoning Tasks TriView2CAD enables evaluation of complex reasoning capabilities via three key task categories: 1. **Dimension Recognition and Mapping** Identify each annotated dimension and match it to its geometric feature. 2. **Primitive Counting** Count the number of specified components (e.g., pier columns, pile bases). 3. **Composite Parameter Computation** Derive engineering-critical values based on multiple parameters (e.g., spacing = width + gap). A total of **15 parameters** are evaluated per sample, including: - 6 recognition attributes - 3 counting attributes - 6 computed attributes Evaluation accuracy is computed per parameter and averaged across the test set. ## 📊 Benchmarking & Generalization TriView2CAD has been used to benchmark seven leading vision–language models. Our experiments demonstrate: - Significant performance gaps between pretrained VLMs and reasoning-tuned models. - Real-world samples introduce realistic noise and complexity (e.g., overlapping annotations, occlusions). - Chain-of-Thought (CoT) and curriculum-based fine-tuning dramatically improve reasoning accuracy and OOD robustness.

# TriView2CAD 数据集 **TriView2CAD** 是首个专为计算机辅助设计(Computer-Aided Design,CAD)领域的正投影推理任务打造的大规模开源基准数据集。它支持从工程图纸解读到三维几何重建的各类任务,可作为工业场景下多模态模型训练与评估的基础平台。 ## 📦 数据集概览 TriView2CAD 包含以下内容: - **20万条合成样本**,用于训练与域内评估; 每条样本均标注有**精准的尺寸标签**,并包含六种可互操作的数据模态: 1. **参数表**(JSON格式) 2. **矢量CAD图纸**(DXF格式) 3. **光栅图像**(PNG格式,包含主视、俯视、侧视三视图) 4. **可执行CAD脚本** 5. **STEP文件**(三维模型) 6. **边界表示(Boundary Representation,B-Rep)文件**(三维模型) 该丰富的数据格式可无缝支持设计、仿真与制造全流程中的各类任务。 ## ⚙️ 数据集构建流程 本次构建聚焦**预制桥墩**——这是土木工程中常见的模块化结构。我们采用约束引导的构建流程,确保每条样本的几何与结构合法性: ### 步骤1:约束引导的参数采样 - 定义了15维参数空间。 - **视图内约束**确保物理合理性与拓扑闭合性。 - **视图间约束**保证正投影三视图(高度、宽度、深度)的一致性。 ### 步骤2:二维图纸与光栅图像生成 - 借助`ezdxf`库将采样得到的参数转换为矢量化DXF图纸。 - 将几何图元(直线、圆、圆弧)按语义分层组织。 - 在FreeCAD中渲染得到高分辨率的主视、俯视、侧视光栅图像。 ### 步骤3:三维模型生成 - 通过FreeCAD的脚本接口,以编程方式重建三维模型。 - 生成**STEP**与**B-Rep**两类文件,以支持CAD仿真与跨平台互操作。 ## 🧠 支持的推理任务 TriView2CAD 可通过三大核心任务类别,实现对复杂推理能力的评估: 1. **尺寸识别与映射** 识别每条标注尺寸,并将其与对应几何特征进行匹配。 2. **图元计数** 统计指定构件的数量(例如桥墩柱、桩基础)。 3. **复合参数计算** 基于多组参数推导工程关键数值(例如间距=宽度+间隙)。 每条样本共评估**15项参数**,涵盖: - 6项识别属性 - 3项计数属性 - 6项计算属性 评估准确率按单参数计算,并在测试集上取平均。 ## 📊 基准测试与泛化能力 目前TriView2CAD已用于7款主流视觉语言模型(Vision-Language Models,VLMs)的基准测试。我们的实验结果表明: - 预训练视觉语言模型与经过推理微调的模型之间存在显著性能差距。 - 真实世界样本引入了符合实际的噪声与复杂度(例如标注重叠、遮挡问题)。 - 思维链(Chain-of-Thought,CoT)与基于课程的微调方法可显著提升推理准确率与分布外(Out-of-Distribution,OOD)鲁棒性。
提供机构:
maas
创建时间:
2025-06-04
搜集汇总
数据集介绍
main_image_url
背景与挑战
背景概述
TriView2CAD是首个专为CAD正交投影推理设计的大规模开源基准,包含20万个合成样本,每个样本提供六种互操作数据模态(如参数表、矢量图纸和3D模型),专注于预制桥墩结构。它支持维度识别、原始计数和复合参数计算等任务,用于训练和评估工业场景中的多模态模型,并已基准测试多个领先模型以提升推理准确性。
以上内容由遇见数据集搜集并总结生成
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作