FLUB
收藏arXiv2024-02-17 更新2024-06-21 收录
下载链接:
https://github.com/THUKElab/FLUB
下载链接
链接失效反馈官方服务:
资源简介:
FLUB是由清华大学创建的一个高质量数据集,专注于评估大型语言模型(LLMs)对谬误理解的能力。该数据集包含844个精心挑选的狡猾问题,这些问题在人类看来容易理解,但对模型来说极具挑战性。FLUB的数据来源于中国知名的在线论坛“弱智吧”,该论坛以其狡猾和不合理的发帖而闻名。数据集的创建过程包括数据清洗和标注,确保了数据的质量和适用性。FLUB的应用领域主要集中在推动LLMs对谬误的理解能力,从而提高它们处理复杂现实世界问题的能力。
FLUB is a high-quality dataset developed by Tsinghua University, focusing on evaluating the capacity of large language models (LLMs) to comprehend fallacies. The dataset contains 844 carefully selected tricky questions that appear easy for humans to understand but are extremely challenging for models. The data of FLUB is sourced from "Ruozhiba Bar", a well-known Chinese online forum famous for its tricky and illogical posts. The creation process of FLUB includes data cleaning and annotation, ensuring the quality and applicability of the dataset. The main application areas of FLUB center on promoting LLMs' understanding of fallacies, thereby improving their ability to handle complex real-world problems.
提供机构:
清华大学
创建时间:
2024-02-17



