Ideogram-V2_t2i_human_preference
收藏魔搭社区2025-11-27 更新2025-04-26 收录
下载链接:
https://modelscope.cn/datasets/Rapidata/Ideogram-V2_t2i_human_preference
下载链接
链接失效反馈官方服务:
资源简介:
<style>
.vertical-container {
display: flex;
flex-direction: column;
gap: 60px;
}
.image-container img {
max-height: 250px; /* Set the desired height */
margin:0;
object-fit: contain; /* Ensures the aspect ratio is maintained */
width: auto; /* Adjust width automatically based on height */
box-sizing: content-box;
}
.image-container {
display: flex; /* Aligns images side by side */
justify-content: space-around; /* Space them evenly */
align-items: center; /* Align them vertically */
gap: .5rem
}
.container {
width: 90%;
margin: 0 auto;
}
.text-center {
text-align: center;
}
.score-amount {
margin: 0;
margin-top: 10px;
}
.score-percentage {Score:
font-size: 12px;
font-weight: semi-bold;
}
</style>
# Rapidata Ideogram-V2 Preference
<a href="https://www.rapidata.ai">
<img src="https://cdn-uploads.huggingface.co/production/uploads/66f5624c42b853e73e0738eb/jfxR79bOztqaC6_yNNnGU.jpeg" width="400" alt="Dataset visualization">
</a>
This T2I dataset contains over 195k human responses from over 42k individual annotators, collected in just ~1 Day using the [Rapidata Python API](https://docs.rapidata.ai), accessible to anyone and ideal for large scale evaluation.
Evaluating Ideogram-V2 across three categories: preference, coherence, and alignment.
Explore our latest model rankings on our [website](https://www.rapidata.ai/benchmark).
If you get value from this dataset and would like to see more in the future, please consider liking it.
## Overview
This T2I dataset contains over 195k human responses from over 42k individual annotators, collected in just ~1 Day.
Evaluating Ideogram-v2 across three categories: preference, coherence, and alignment.
The evaluation consists of 1v1 comparisons between Ideogram V2 and 11 other models: Recraft V2, Lumina-15-2-25, Frames-23-1-25, Imagen-3, Flux-1.1-pro, Flux-1-pro, DALL-E 3, Midjourney-5.2, Stable Diffusion 3, Aurora and Janus-7b.
## Alignment
The alignment score quantifies how well an video matches its prompt. Users were asked: "Which image matches the description better?".
<div class="vertical-container">
<div class="container">
<div class="text-center">
<q>A green banana and a yellow chair.</q>
</div>
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Score: 100%</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/vdcTP7xWesLU3_Nz5shJU.jpeg" width=500>
</div>
<div>
<h3 class="score-amount">Janus-7B </h3>
<div class="score-percentage">Score: 0%</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/MXJAFPlbHuCeQpZnutWpT.jpeg" width=500>
</div>
</div>
</div>
<div class="container">
<div class="text-center">
<q>A chair on a cat.</q>
</div>
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2</h3>
<div class="score-percentage">Score: 0%</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/pUWGJaIZrtRuXgw5STgw6.jpeg" width=500>
</div>
<div>
<h3 class="score-amount">Dalle-3</h3>
<div class="score-percentage">Score: 100%</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/ZFPgbaJtgPA_JNXG_uQxY.jpeg" width=500>
</div>
</div>
</div>
</div>
## Coherence
The coherence score measures whether the generated video is logically consistent and free from artifacts or visual glitches. Without seeing the original prompt, users were asked: "Which image has **more** glitches and is **more** likely to be AI generated?"
<div class="vertical-container">
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Glitch Rating: 0%</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/nrc6AGTwgr57G4fD1z-Vv.png" width=500>
</div>
<div>
<h3 class="score-amount">Aurora </h3>
<div class="score-percentage">Glitch Rating: 100%</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/GYumGz6rEv8bMQ8oJfWzA.png" width=500>
</div>
</div>
</div>
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Glitch Rating: 100%</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/Hho_uJWFAjz6jB553DMaG.jpeg" width=500>
</div>
<div>
<h3 class="score-amount">Imagen-3</h3>
<div class="score-percentage">Glitch Rating: 0%</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/4iQGA4nWYlM6npaw5-vqW.jpeg" width=500>
</div>
</div>
</div>
</div>
## Preference
The preference score reflects how visually appealing participants found each image, independent of the prompt. Users were asked: "Which image do you prefer?"
<div class="vertical-container">
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2</h3>
<div class="score-percentage">Score: 100%</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/X54GwmAaSk8WaTgkuuSBo.png" width=500>
</div>
<div>
<h3 class="score-amount">Janus-7b</h3>
<div class="score-percentage">Score: 0%</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/WwMLXX1_mb-OwYVPFZLJm.jpeg" width=500>
</div>
</div>
</div>
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Score: 0%</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/75dbUVp42_wQN7ZqrJhI6.png" width=500>
</div>
<div>
<h3 class="score-amount">Flux-1.1 Pro </h3>
<div class="score-percentage">Score: 100%</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/88MyPfxR5As8XIyujL035.jpeg" width=500>
</div>
</div>
</div>
</div>
## About Rapidata
Rapidata's technology makes collecting human feedback at scale faster and more accessible than ever before. Visit [rapidata.ai](https://www.rapidata.ai/) to learn more about how we're revolutionizing human feedback collection for AI development.
# Rapidata Ideogram-V2 偏好评测数据集
本文本到图像(Text-to-Image, T2I)数据集包含来自超4.2万名独立标注人员的逾19.5万条人工标注反馈,仅用约1天时间通过[Rapidata Python API](https://docs.rapidata.ai)完成采集,面向所有用户开放,非常适合大规模模型评测。
本次评测围绕偏好性、连贯性与对齐性三大维度对Ideogram-V2展开。
可前往我们的[官网](https://www.rapidata.ai)查看最新的模型排名榜单。
若您从本数据集获益并希望未来获取更多同类资源,欢迎为数据集点赞。
## 数据集概览
本文本到图像(T2I)数据集包含超19.5万条人工标注反馈,来自4.2万余名独立标注人员,采集周期仅约1天。
本次评测从偏好性、连贯性与对齐性三大维度对Ideogram-v2进行评估。
本次评测采用1v1对比形式,将Ideogram V2与其余11款模型进行对比,分别为:Recraft V2、Lumina-15-2-25、Frames-23-1-25、Imagen-3、Flux-1.1-pro、Flux-1-pro、DALL-E 3、Midjourney-5.2、Stable Diffusion 3、Aurora以及Janus-7b。
## 对齐性评测
对齐性评分用于量化生成图像与输入提示词的匹配程度。标注人员被问及:"Which image matches the description better?"(哪张图像更贴合描述内容?)。
<div class="vertical-container">
<div class="container">
<div class="text-center">
<q>A green banana and a yellow chair.</q>(「一根青香蕉与一把黄色椅子」)
</div>
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Score: 100%(得分:100%)</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/vdcTP7xWesLU3_Nz5shJU.jpeg" width=500>
</div>
<div>
<h3 class="score-amount">Janus-7B </h3>
<div class="score-percentage">Score: 0%(得分:0%)</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/MXJAFPlbHuCeQpZnutWpT.jpeg" width=500>
</div>
</div>
</div>
<div class="container">
<div class="text-center">
<q>A chair on a cat.</q>(「一只猫背上的椅子」)
</div>
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2</h3>
<div class="score-percentage">Score: 0%(得分:0%)</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/pUWGJaIZrtRuXgw5STgw6.jpeg" width=500>
</div>
<div>
<h3 class="score-amount">DALL-E 3</h3>
<div class="score-percentage">Score: 100%(得分:100%)</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/ZFPgbaJtgPA_JNXG_uQxY.jpeg" width=500>
</div>
</div>
</div>
</div>
## 连贯性评测
连贯性评分用于衡量生成图像是否具备逻辑自洽性,且无伪影或视觉瑕疵。在不查看原始提示词的前提下,标注人员被问及:"Which image has **more** glitches and is **more** likely to be AI generated?"(哪张图像的瑕疵更多,且更有可能是AI生成的?)。
<div class="vertical-container">
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Glitch Rating: 0%(瑕疵评分:0%)</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/nrc6AGTwgr57G4fD1z-Vv.png" width=500>
</div>
<div>
<h3 class="score-amount">Aurora </h3>
<div class="score-percentage">Glitch Rating: 100%(瑕疵评分:100%)</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/GYumGz6rEv8bMQ8oJfWzA.png" width=500>
</div>
</div>
</div>
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Glitch Rating: 100%(瑕疵评分:100%)</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/Hho_uJWFAjz6jB553DMaG.jpeg" width=500>
</div>
<div>
<h3 class="score-amount">Imagen-3</h3>
<div class="score-percentage">Glitch Rating: 0%(瑕疵评分:0%)</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/4iQGA4nWYlM6npaw5-vqW.jpeg" width=500>
</div>
</div>
</div>
</div>
## 偏好性评测
偏好性评分反映参与者对单张图像的视觉美观度评价,与原始提示词无关。标注人员被问及:"Which image do you prefer?"(你更偏好哪张图像?)。
<div class="vertical-container">
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2</h3>
<div class="score-percentage">Score: 100%(得分:100%)</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/X54GwmAaSk8WaTgkuuSBo.png" width=500>
</div>
<div>
<h3 class="score-amount">Janus-7b</h3>
<div class="score-percentage">Score: 0%(得分:0%)</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/WwMLXX1_mb-OwYVPFZLJm.jpeg" width=500>
</div>
</div>
</div>
<div class="container">
<div class="image-container">
<div>
<h3 class="score-amount">Ideogram V2 </h3>
<div class="score-percentage">Score: 0%(得分:0%)</div>
<img src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/75dbUVp42_wQN7ZqrJhI6.png" width=500>
</div>
<div>
<h3 class="score-amount">Flux-1.1 Pro </h3>
<div class="score-percentage">Score: 100%(得分:100%)</div>
<img style="border: 3px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/664dcc6296d813a7e15e170e/88MyPfxR5As8XIyujL035.jpeg" width=500>
</div>
</div>
</div>
</div>
## 关于Rapidata
Rapidata的技术让大规模人工反馈采集变得比以往任何时候都更快速、更便捷。请访问[rapidata.ai](https://www.rapidata.ai/)了解更多关于我们如何革新AI开发领域的人工反馈采集技术的信息。
提供机构:
maas
创建时间:
2025-04-22



