OALL/details_Qwen__Qwen2.5-Coder-14B
收藏Hugging Face2024-11-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_Qwen__Qwen2.5-Coder-14B
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是在模型Qwen/Qwen2.5-Coder-14B的评估运行期间自动创建的。它由136个配置组成,每个配置对应一个被评估的任务。数据集是从1次运行中生成的,每次运行在每个配置中表示为特定的分割,分割名称使用运行的时间戳。train分割始终指向最新的结果。此外,名为results的配置存储了运行的所有聚合结果。README还提供了如何使用`datasets`库中的`load_dataset`函数加载数据集的示例。
The dataset is automatically created during the evaluation run of the model Qwen/Qwen2.5-Coder-14B. It consists of 136 configurations, each corresponding to one of the evaluated tasks. The dataset is generated from one run, with each run represented as a specific split named by the timestamp of the run. The train split always points to the latest results. An additional configuration results stores all the aggregated results of the run. The dataset includes various metrics such as acc_norm, acc_norm_stderr, acc, and acc_stderr for different tasks and configurations.
提供机构:
OALL



