sumukshashidhar-archive/fantasiq
收藏Hugging Face2025-06-02 更新2025-11-01 收录
下载链接:
https://hf-mirror.com/datasets/sumukshashidhar-archive/fantasiq
下载链接
链接失效反馈官方服务:
资源简介:
FantastiQ是一个虚构推理基准数据集,用于评估语言模型在推理和逻辑能力方面的表现,超越了单纯的记忆。它包括基础事实问答、需要推理的问答、链式思维推理问答和组合推理集等多个数据集。数据集由虚构但内部一致的事实和情景组成,采用JSON Lines格式。
FantastiQ is a fictional reasoning benchmark dataset designed to evaluate the inference and logical capabilities of language models beyond memorization. It includes multiple datasets such as basic factual Q&A, reasoning-required Q&A, chain-of-thought reasoning Q&A, and combined reasoning set. The dataset consists of fictional, internally consistent facts and scenarios, formatted in JSON Lines.
提供机构:
sumukshashidhar-archive



