five

HunyuanImage-2.1_t2i_human_preference

收藏
魔搭社区2025-12-05 更新2025-12-06 收录
下载链接:
https://modelscope.cn/datasets/Rapidata/HunyuanImage-2.1_t2i_human_preference
下载链接
链接失效反馈
官方服务:
资源简介:
<style> .vertical-container { display: flex; flex-direction: column; gap: 60px; } .horizontal-container { display: flex; flex-direction: row; justify-content: center; gap: 60px; } .image-container img { max-height: 250px; /* Set the desired height */ margin:0; object-fit: contain; /* Ensures the aspect ratio is maintained */ width: auto; /* Adjust width automatically based on height */ box-sizing: content-box; } .image-container img.big { max-height: 350px; /* Set the desired height */ } .image-container { display: flex; /* Aligns images side by side */ justify-content: space-around; /* Space them evenly */ align-items: center; /* Align them vertically */ gap: .5rem } .container { width: 90%; margin: 0 auto; } .text-center { text-align: center; } .score-amount { margin: 0; margin-top: 10px; } .score-percentage {Score: font-size: 12px; font-weight: semi-bold; } </style> # Rapidata Hunyuan Image 2.1 Preference <a href="https://www.rapidata.ai"> <img src="https://cdn-uploads.huggingface.co/production/uploads/66f5624c42b853e73e0738eb/jfxR79bOztqaC6_yNNnGU.jpeg" width="400" alt="Dataset visualization"> </a> This T2I dataset contains over ~400'000 human responses from over ~50'000 individual annotators, collected in less than 7h using the [Rapidata Python API](https://docs.rapidata.ai), accessible to anyone and ideal for large scale evaluation. Evaluating Hunyuan Image 2.1 (version from 19.9.2025) across three categories: preference, coherence, and alignment. Explore our latest model rankings on our [website](https://www.rapidata.ai/benchmark). If you get value from this dataset and would like to see more in the future, please consider liking it ❤️ To add your own model to the benchmark send us an e-mail at: jason@rapidata.ai ## Overview The evaluation consists of 1v1 comparisons between Hunyuan Image 2.1 (version from 24.7.2025) and 18 other models: - 4o - Flux-1-pro - Flux-1.1-pro - imagen 4 ultra - Aurora - Imagen-3 - DALL-E 3 - Midjourney-5.2 - Frames-23-1-25 - Stable Diffusion 3 - Janus-7b. - hidream-l1-full - Recraft V2 - Ideogram V2 - halfmoon-4-4-25 - Lumina-15-2-25 - Imagen 4 Ultra 20.5.25 - Imagen 4 Ultra 24.7.25 - Recraft v3 ## Alignment The alignment score quantifies how well an video matches its prompt. Users were asked: "Which image matches the description better?". <div class="vertical-container"> <div class="container"> <div class="text-center"> <q>there is a white street sign with a orange sign underneath</q> </div> <div class="image-container"> <div> <h3 class="score-amount">Hunyuan Image 2.1 </h3> <div class="score-percentage">Score: 100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/Ns-v3kEIxr1knlHXNNAmf.jpeg" width=500> </div> <div> <h3 class="score-amount">Lumina-15-2-25 </h3> <div class="score-percentage">Score: 0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/mKR6eFEbA7s2tKnRSrauD.jpeg" width=500> </div> </div> </div> <div class="container"> <div class="text-center"> <q>A horse riding an astronaut.</q> </div> <div class="image-container"> <div> <h3 class="score-amount">Hunyuan Image 2.1 </h3> <div class="score-percentage">Score: 0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/8Hk-BHX9xQtx0reEhY2hq.jpeg" width=500> </div> <div> <h3 class="score-amount">4o </h3> <div class="score-percentage">Score: 100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/TwE5O1J4wdeXZq26TWnaG.jpeg" width=500> </div> </div> </div> </div> ## Coherence The coherence score measures whether the generated video is logically consistent and free from artifacts or visual glitches. Without seeing the original prompt, users were asked: "Which image has **more** glitches and is **more** likely to be AI generated?" <div class="vertical-container"> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Hunyuan Image 2.1 </h3> <div class="score-percentage">Glitch Rating: 0%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/S0Y9q4V0k6SyBdTnd1W2-.jpeg" width=500> </div> <div> <h3 class="score-amount">Janus-7b </h3> <div class="score-percentage">Glitch Rating: 100%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/51q5_YQISW663JVGn9OeS.jpeg" width=500> </div> </div> </div> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Hunyuan Image 2.1 </h3> <div class="score-percentage">Glitch Rating: 100%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/hAG_XDMRsdp8ibSpxqxSq.jpeg" width=500> </div> <div> <h3 class="score-amount">Recraft v3 </h3> <div class="score-percentage">Glitch Rating: 0%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/P14uGcfVoyr2TRv4z08Sh.jpeg" width=500> </div> </div> </div> </div> ## Preference The preference score reflects how visually appealing participants found each image, independent of the prompt. Users were asked: "Which image do you prefer?" <div class="vertical-container"> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Hunyuan Image 2.1 </h3> <div class="score-percentage">Score: 100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/raQvlbetTWw4aMufCsu0t.jpeg" width=500> </div> <div> <h3 class="score-amount">Midjourney-5.2 </h3> <div class="score-percentage">Score: 0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/dTyNypeJDsxKyVxrsQPAf.jpeg" width=500> </div> </div> </div> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">Hunyuan Image 2.1 </h3> <div class="score-percentage">Score: 0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/FJdySp53uTqgY8bNxWVt9.jpeg" width=500> </div> <div> <h3 class="score-amount">Frames-23-1-25 </h3> <div class="score-percentage">Score: 100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/7APGfVhAC-A0ZYI92lwEZ.jpeg" width=500> </div> </div> </div> </div> ## About Rapidata Rapidata's technology makes collecting human feedback at scale faster and more accessible than ever before. Visit [rapidata.ai](https://www.rapidata.ai/) to learn more about how we're revolutionizing human feedback collection for AI development.

<style> .vertical-container { display: flex; flex-direction: column; gap: 60px; } .horizontal-container { display: flex; flex-direction: row; justify-content: center; gap: 60px; } .image-container img { max-height: 250px; /* Set the desired height */ margin:0; object-fit: contain; /* Ensures the aspect ratio is maintained */ width: auto; /* Adjust width automatically based on height */ box-sizing: content-box; } .image-container img.big { max-height: 350px; /* Set the desired height */ } .image-container { display: flex; /* Aligns images side by side */ justify-content: space-around; /* Space them evenly */ align-items: center; /* Align them vertically */ gap: .5rem } .container { width: 90%; margin: 0 auto; } .text-center { text-align: center; } .score-amount { margin: 0; margin-top: 10px; } .score-percentage { font-size: 12px; font-weight: semi-bold; } </style> # Rapidata 混元图像2.1 偏好数据集 <a href="https://www.rapidata.ai"> <img src="https://cdn-uploads.huggingface.co/production/uploads/66f5624c42b853e73e0738eb/jfxR79bOztqaC6_yNNnGU.jpeg" width="400" alt="数据集可视化"> </a> 本文本到图像生成(Text-to-Image, T2I)数据集包含超40万条人类标注反馈,由超5万名独立标注员参与,通过[Rapidata Python API](https://docs.rapidata.ai)在7小时内完成采集,面向所有用户开放,是大规模模型评估的理想选择。 本次评估针对混元图像2.1(2025年9月19日版本)的三大维度展开:偏好性、一致性与对齐性。 可在我们的[官网](https://www.rapidata.ai)查看最新的模型排名。 如果您从本数据集获益并希望未来看到更多相关内容,请点赞支持❤️ 若您希望将自己的模型加入基准测试,请发送邮件至:jason@rapidata.ai ## 概览 本次评估包含混元图像2.1(2025年7月24日版本)与其他18款模型的1对1对比: - 4o - Flux-1-pro - Flux-1.1-pro - Imagen 4 Ultra - Aurora - Imagen-3 - DALL-E 3 - Midjourney-5.2 - Frames-23-1-25 - Stable Diffusion 3 - Janus-7b - hidream-l1-full - Recraft V2 - Ideogram V2 - halfmoon-4-4-25 - Lumina-15-2-25 - Imagen 4 Ultra 20.5.25 - Imagen 4 Ultra 24.7.25 - Recraft v3 ## 对齐性 对齐性评分量化了生成图像与提示词的匹配程度。标注员被问及:“哪张图像更贴合描述?” <div class="vertical-container"> <div class="container"> <div class="text-center"> “存在一块白色路牌,下方带有橙色标识” </div> <div class="image-container"> <div> <h3 class="score-amount">混元图像2.1 </h3> <div class="score-percentage">得分:100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/Ns-v3kEIxr1knlHXNNAmf.jpeg" width=500> </div> <div> <h3 class="score-amount">Lumina-15-2-25 </h3> <div class="score-percentage">得分:0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/mKR6eFEbA7s2tKnRSrauD.jpeg" width=500> </div> </div> </div> <div class="container"> <div class="text-center"> “马骑宇航员” </div> <div class="image-container"> <div> <h3 class="score-amount">混元图像2.1 </h3> <div class="score-percentage">得分:0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/8Hk-BHX9xQtx0reEhY2hq.jpeg" width=500> </div> <div> <h3 class="score-amount">4o </h3> <div class="score-percentage">得分:100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/TwE5O1J4wdeXZq26TWnaG.jpeg" width=500> </div> </div> </div> </div> ## 一致性 一致性评分衡量生成图像的逻辑自洽性,以及是否存在伪影或视觉瑕疵。在不查看原始提示词的前提下,标注员被问及:“哪张图像的瑕疵更多,且更可能是AI生成的?” <div class="vertical-container"> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">混元图像2.1 </h3> <div class="score-percentage">瑕疵评级:0%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/S0Y9q4V0k6SyBdTnd1W2-.jpeg" width=500> </div> <div> <h3 class="score-amount">Janus-7b </h3> <div class="score-percentage">瑕疵评级:100%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/51q5_YQISW663JVGn9OeS.jpeg" width=500> </div> </div> </div> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">混元图像2.1 </h3> <div class="score-percentage">瑕疵评级:100%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/hAG_XDMRsdp8ibSpxqxSq.jpeg" width=500> </div> <div> <h3 class="score-amount">Recraft v3 </h3> <div class="score-percentage">瑕疵评级:0%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/P14uGcfVoyr2TRv4z08Sh.jpeg" width=500> </div> </div> </div> </div> ## 偏好性 偏好性评分反映了参与者对每张图像的视觉好感度,与提示词无关。标注员被问及:“你更偏好哪张图像?” <div class="vertical-container"> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">混元图像2.1 </h3> <div class="score-percentage">得分:100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/raQvlbetTWw4aMufCsu0t.jpeg" width=500> </div> <div> <h3 class="score-amount">Midjourney-5.2 </h3> <div class="score-percentage">得分:0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/dTyNypeJDsxKyVxrsQPAf.jpeg" width=500> </div> </div> </div> <div class="container"> <div class="image-container"> <div> <h3 class="score-amount">混元图像2.1 </h3> <div class="score-percentage">得分:0%</div> <img src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/FJdySp53uTqgY8bNxWVt9.jpeg" width=500> </div> <div> <h3 class="score-amount">Frames-23-1-25 </h3> <div class="score-percentage">得分:100%</div> <img style="border: 5px solid #18c54f;" src="https://cdn-uploads.huggingface.co/production/uploads/672b7d79fd1e92e3c3567435/7APGfVhAC-A0ZYI92lwEZ.jpeg" width=500> </div> </div> </div> </div> ## 关于Rapidata Rapidata的技术使大规模人类反馈收集比以往任何时候都更快捷、更易获取。访问[rapidata.ai](https://www.rapidata.ai/)了解我们如何革新AI开发中的人类反馈收集流程。
提供机构:
maas
创建时间:
2025-10-15
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作