avemio/German-RAG-HARD-BENCHMARK-REASONING-EVAL-OPEN-SOURCE
收藏Hugging Face2025-01-28 更新2025-04-08 收录
下载链接:
https://hf-mirror.com/datasets/avemio/German-RAG-HARD-BENCHMARK-REASONING-EVAL-OPEN-SOURCE
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个经过评判和评估的困难推理测试数据集,支持德语和英语两种语言。数据集包含了不同模型和配置的测试数据,用于评估模型在困难推理任务上的表现。
The dataset includes multiple judged and evaluated hard reasoning test datasets, supporting both German and English languages. The datasets contain test data for different models and configurations, used for evaluating the performance of models on hard reasoning tasks.
提供机构:
avemio



