tinnel123/defacto_1.5k_benchmark
收藏Hugging Face2026-04-22 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/tinnel123/defacto_1.5k_benchmark
下载链接
链接失效反馈官方服务:
资源简介:
该目录包含一个紧凑的子集,用于评估多模态任务中的证据对齐和忠实推理。数据格式遵循了DeFacto论文的思想,强调答案的正确性和证据区域的对齐。
This directory contains a compact subset for evaluating evidence alignment and faithful reasoning in multimodal tasks. The data format follows the ideas in DeFacto, emphasizing both answer correctness and evidence-region alignment.
提供机构:
tinnel123



