vtllms/sealqa
收藏Hugging Face2025-12-10 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/vtllms/sealqa
下载链接
链接失效反馈官方服务:
资源简介:
SealQA是一个用于评估搜索增强语言模型在事实寻找问题上的挑战基准。该数据集特别适用于网络搜索结果存在冲突、噪声或不帮助的情况。SealQA包含了多个配置,包括seal_0、seal_hard和longseal,用于测试模型在不同难度级别上的表现。
SealQA is a challenge benchmark for evaluating Search-Augmented Language models on fact-seeking questions where web search yields conflicting, noisy, or unhelpful results. It includes multiple configurations such as seal_0, seal_hard, and longseal for testing the performance of models at different difficulty levels.
提供机构:
vtllms



