metabench - Paper Data
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12819250
下载链接
链接失效反馈官方服务:
资源简介:
Item-wise accuracies in six benchmarks from Open LLM Leaderboard 1 scraped from huggingface.co and used for metabench analyses and construction. Datasets with RMSE's for random benchmark subsets are used as reference in the paper and are included here.
创建时间:
2024-07-25



