opencompass/ReasonZoo
收藏Hugging Face2025-08-27 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/opencompass/ReasonZoo
下载链接
链接失效反馈官方服务:
资源简介:
ReasonZoo数据集是一个用于推理能力评估的基准数据集,包含了逻辑与谜题、数学、科学、编程和形式系统等多个领域的任务。数据集支持多模型,提供了可配置的并行处理架构,能够进行全面的评估,并允许灵活配置评估参数。此外,数据集还支持基于大型语言模型的判断功能。
The ReasonZoo dataset is a benchmark for evaluating the reasoning capabilities of large language models, covering tasks in logic and puzzles, mathematics, science, programming, and formal systems. The dataset supports multiple models, provides a configurable parallel processing architecture, allows for comprehensive evaluation, and enables flexible configuration of evaluation parameters. Additionally, the dataset supports LLM-based judging for complex reasoning tasks.
提供机构:
opencompass



