MMSVG-Illustration
收藏魔搭社区2026-01-07 更新2026-01-03 收录
下载链接:
https://modelscope.cn/datasets/AI-ModelScope/MMSVG-Illustration
下载链接
链接失效反馈官方服务:
资源简介:
<h1>OmniSVG: A Unified Scalable Vector Graphics Generation Model</h1>
[![Project Page]](https://omnisvg.github.io/)
# Dataset Card for MMSVG-Illustration
## Dataset Description
This dataset contains SVG illustration examples for training and evaluating SVG models for text-to-SVG and image-to-SVG task.
## Dataset Structure
### Features
The dataset contains the following fields:
| Field Name | Description |
| :--------- | :---------- |
| `id` | Unique ID for each SVG |
| `svg` | SVG code (resized to 200×200, simplified with picosvg) |
| `description` | Description of the SVG |
| `keywords` | Keywords associated with the SVG |
| `detail` | Detailed description of the SVG |
| `image` | PNG image preview (resized to 448×448) |
| `token len` | Token length (OmniSVG tokenizer) |
## Changelog
### v2.0 (2025-12-22)
- **Data Volume**: Increased from 65,751 to 255,412 samples
- **Deduplication**: Removed duplicate SVGs based on MD5 hash (keeping only one instance per unique SVG)
- **Added PNG Previews**: Each SVG now includes a corresponding PNG image
- **Enhanced Captions**: Increased caption diversity with `description`, `keywords`, and `detail` fields
- **SVG Normalization**: All SVGs resized to 200×200 and simplified using [picosvg](https://github.com/nicothin/picosvg)
- **PNG Standardization**: All PNG images resized to 448×448 for uniform input size
## Citation
```bibtex
@article{yang2025omnisvg,
title={OmniSVG: A Unified Scalable Vector Graphics Generation Model},
author={Yiying Yang and Wei Cheng and Sijin Chen and Xianfang Zeng and Jiaxu Zhang and Liao Wang and Gang Yu and Xinjun Ma and Yu-Gang Jiang},
journal={arXiv preprint arxiv:2504.06263},
year={2025}
}
```
## Tags
- scalable vector graphics (SVG)
- vision language models
- multimodal
- Illustration
# OmniSVG:统一可缩放矢量图形(Scalable Vector Graphics,SVG)生成模型
[![项目主页]](https://omnisvg.github.io/)
# MMSVG-插画数据集卡片
## 数据集概述
本数据集收录SVG插画样本,用于训练与评估面向文本转SVG、图像转SVG任务的SVG生成模型。
## 数据集结构
### 字段说明
本数据集包含以下字段:
| 字段名 | 字段说明 |
| :--------- | :---------- |
| `id` | 每个SVG的唯一标识符 |
| `svg` | SVG代码(已调整至200×200分辨率,经picosvg工具简化) |
| `description` | SVG插画的描述文本 |
| `keywords` | 与该SVG插画关联的关键词 |
| `detail` | SVG插画的详细描述文本 |
| `image` | PNG格式预览图(已调整至448×448分辨率) |
| `token len` | Token长度(采用OmniSVG分词器) |
## 更新日志
### 版本2.0(2025年12月22日)
- **数据规模**:样本量从65751条增至255412条
- **去重处理**:基于MD5哈希值移除重复SVG样本(每个唯一SVG仅保留一条实例)
- **新增PNG预览图**:为每个SVG添加对应的PNG格式预览图
- **增强标注多样性**:新增`description`、`keywords`与`detail`三个字段,提升标注丰富度
- **SVG标准化**:所有SVG均调整至200×200分辨率,并通过[picosvg](https://github.com/nicothin/picosvg)工具进行简化
- **PNG标准化**:所有PNG预览图均调整至448×448分辨率,以保证输入尺寸统一
## 引用格式
bibtex
@article{yang2025omnisvg,
title={OmniSVG: A Unified Scalable Vector Graphics Generation Model},
author={Yiying Yang and Wei Cheng and Sijin Chen and Xianfang Zeng and Jiaxu Zhang and Liao Wang and Gang Yu and Xinjun Ma and Yu-Gang Jiang},
journal={arXiv preprint arxiv:2504.06263},
year={2025}
}
## 标签
- 可缩放矢量图形(Scalable Vector Graphics,SVG)
- 视觉语言模型
- 多模态
- 插画
提供机构:
maas
创建时间:
2025-04-11



