five

FailSafeQA

收藏
魔搭社区2026-05-09 更新2025-03-15 收录
下载链接:
https://modelscope.cn/datasets/LLM-Research/FailSafeQA
下载链接
链接失效反馈
官方服务:
资源简介:
Benchmark data introduced in the paper: \ **Expect the Unexpected: FailSafeQA Long Context for Finance** ([https://arxiv.org/abs/2502.06329](https://arxiv.org/abs/2502.06329)) Dataset count: 220 <img src="FailSafeQA.png" width="700" title="FailSafeQA"> ```json { "idx": int, "tokens": int, "context": string, "ocr_context": string, "answer": string, "query": string, "incomplete_query": string, "out-of-domain_query": string, "error_query": string, "out-of-scope_query": string, "citations": string, "citations_tokens": int } ``` Tasks variety: <img src="verb_dobj_base_new.png" width="400" title="FailSafeQA"> Root verbs and their direct objects from the first sentence of each normalized query, the top 20 verbs and their top five direct object. Tasks types: - 83.0% question answering (QA) - 17.0% involving text generation (TG) To cite FailSafeQA, please use: ``` @misc{kiran2024failsafeqa, title={Expect the Unexpected: FailSafeQA Long Context for Finance}, author={Kiran Kamble and Melisa Russak and Dmytro Mozolevskyi and Muayad Ali and Mateusz Russak and Waseem AlShikh}, year={2024}, eprint={2502.06329}, archivePrefix={arXiv}, primaryClass={cs.CL} } ```

本论文提出的基准数据集:**《意料之外:面向金融领域的长上下文FailSafeQA数据集》**(论文链接:https://arxiv.org/abs/2502.06329) 数据集规模:220条 <img src="FailSafeQA.png" width="700" title="FailSafeQA"> 单条数据的字段结构如下: json { "索引(idx)": 整数, "Token数量(tokens)": 整数, "上下文(context)": 字符串, "OCR上下文(ocr_context)": 字符串, "答案(answer)": 字符串, "查询(query)": 字符串, "不完整查询(incomplete_query)": 字符串, "域外查询(out-of-domain_query)": 字符串, "错误查询(error_query)": 字符串, "越域查询(out-of-scope_query)": 字符串, "引用文献(citations)": 字符串, "引用文献Token数(citations_tokens)": 整数 } 任务多样性:<img src="verb_dobj_base_new.png" width="400" title="FailSafeQA"> 该统计基于每条标准化查询的首句提取核心动词及其直接宾语,展示了排名前20的动词及其对应的前5个直接宾语。 任务类型: - 83.0% 为问答(QA)任务 - 17.0% 涉及文本生成(TG)任务 若需引用FailSafeQA数据集,请使用以下格式: @misc{kiran2024failsafeqa, title={Expect the Unexpected: FailSafeQA Long Context for Finance}, author={Kiran Kamble and Melisa Russak and Dmytro Mozolevskyi and Muayad Ali and Mateusz Russak and Waseem AlShikh}, year={2024}, eprint={2502.06329}, archivePrefix={arXiv}, primaryClass={cs.CL} }
提供机构:
maas
创建时间:
2025-03-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作