five

ServiceNow-AI/Abstain-QA

收藏
Hugging Face2025-01-03 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/ServiceNow-AI/Abstain-QA
下载链接
链接失效反馈
官方服务:
资源简介:
Abstain-QA是一个评估大型语言模型拒绝回答能力的数据集,包含2900个多选问题样本,涵盖从简单的事实性问题到复杂的逻辑和概念推理挑战。样本来源于Pop-QA、MMLU和专门针对卡纳提克音乐知识盲区设计的CQA数据集。每个样本都包含一个明确的“我不知道/以上都不是”选项,用于衡量LLM的拒绝回答。

Abstain-QA is a dataset designed to evaluate the Abstention Ability of Large Language Models (LLMs), consisting of 2900 multiple-choice question answering (MCQA) samples. It covers a range from straightforward factual inquiries to complex logical and conceptual reasoning challenges. The samples are sourced from Pop-QA, MMLU, and the CQA dataset, specifically created for this work to address the gap in under-represented knowledge domains related to Carnatic Music. Each sample includes an explicit I Dont Know/None of the above option to measure LLMs abstentions.
提供机构:
ServiceNow-AI
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作