"Tigr100k: A Benchmark for Graded Relevance and Fine-Grained Text-to-Image Retrieval in Compositional Queries"

Name: "Tigr100k: A Benchmark for Graded Relevance and Fine-Grained Text-to-Image Retrieval in Compositional Queries"
Creator: IEEE DataPort
Published: 2026-04-05 13:34:23
License: 暂无描述

DataCite Commons2026-04-05 更新2026-05-03 收录

下载链接：

https://ieee-dataport.org/documents/tigr100k-benchmark-graded-relevance-and-fine-grained-text-image-retrieval-compositional

下载链接

链接失效反馈

官方服务：

资源简介：

"Despite the remarkable success of CLIP models in text-to-image retrieval, their performance remains suboptimal when confronted with complex queries encompassing multiple subjects, attributes, or relations. Existing benchmarks and prevailing metrics, like R@k, are fundamentally insufficient, as they treat relevance as a binary state and fail to capture the nuanced ranking quality required for complex search results.To address this critical evaluative gap, we introduce Tigr100k, a novel and robust benchmark dataset specifically designed for rigorous complex query evaluation. Tigr100k surpasses existing resources by comprising 1,044 bilingual complex query pairs and images annotated with fine-grained, multi-level relevance grading (highly, moderately, and irrelevant). Crucially, we couple Tigr100k with hierarchical evaluation metrics that enable a comprehensive assessment of both result coverage and ranking quality.Extensive experiments across multiple CLIP variants using CsdCLIP not only demonstrate significant performance gains, but also confirm the critical value of Tigr100k and its metrics in discerning and accurately measuring true compositional capabilities."

提供机构：

IEEE DataPort

创建时间：

2026-04-05

5,000+

优质数据集

54 个

任务类型

进入经典数据集