CGFormer
收藏魔搭社区2025-11-16 更新2025-01-04 收录
下载链接:
https://modelscope.cn/datasets/Toneyaya/CGFormer
下载链接
链接失效反馈官方服务:
资源简介:
Datasets for open-vocabulary settings, including the RefCOCO series, were utilized for **CGFormer (CVPR 2023): Contrastive Grouping with Transformer for Referring Image Segmentation.**
This paper first introduces learnable query tokens to represent objects and then alternately queries linguistic features and groups visual features into the query tokens for object-aware cross-modal reasoning. CGFormer achieves cross-level interaction by jointly updating the query tokens and decoding masks in every two consecutive layers. In addition, we introduce new splits on datasets for evaluating generalization for referring image segmentation models.
### Download
from modelscope.msdatasets
import MsDataset ds = MsDataset.load('Toneyaya/CGFormer')
or
modelscope download --dataset Toneyaya/CGFormer
本研究将适用于开放词汇场景(open-vocabulary settings)的数据集(包含RefCOCO系列数据集)应用于**CGFormer(CVPR 2023:面向指代表象分割的Transformer对比分组方法)**。
该论文首先提出可学习查询Token(learnable query tokens)以表征目标对象,随后交替查询语言特征并将视觉特征聚合至查询Token中,实现面向目标感知的跨模态推理。CGFormer通过在每连续两层中同步更新查询Token与解码掩码,实现跨层级交互。此外,本研究还构建了数据集的全新划分方案,用于评估指代表象分割模型的泛化性能。
### 下载
python
from modelscope.msdatasets
import MsDataset
ds = MsDataset.load('Toneyaya/CGFormer')
或者
bash
modelscope download --dataset Toneyaya/CGFormer
提供机构:
maas
创建时间:
2024-12-28



