five

OALL/details_Qwen__Qwen3-4B-Instruct-2507_v2

收藏
Hugging Face2025-09-12 更新2025-09-13 收录
下载链接:
https://hf-mirror.com/datasets/OALL/details_Qwen__Qwen3-4B-Instruct-2507_v2
下载链接
链接失效反馈
官方服务:
资源简介:
这个数据集是在评估模型Qwen/Qwen3-4B-Instruct-2507期间自动创建的。数据集由116个配置组成,每个配置对应一个评估任务。它包括多个运行的成果,每个运行都存储为一个特定的分割,分割名称使用运行的日期时间戳。train分割始终指向最新的结果。还有一个额外的配置results存储所有聚合的结果。可以使用Python中的datasets库加载数据集,如提供的示例所示。README还包含了特定运行的最新结果的详细信息,展示了各种任务的准确性和标准误差。

Dataset automatically created during the evaluation run of model Qwen/Qwen3-4B-Instruct-2507. The dataset consists of 116 configurations, each corresponding to an evaluated task. It includes results from multiple runs, each stored as a specific split named by the timestamp of the run. The train split always points to the latest results. An additional configuration results stores all aggregated results. The dataset can be loaded using the datasets library in Python, as shown in the provided example. The README also includes details about the latest results from a specific run, showcasing the accuracy and standard error for various tasks.
提供机构:
OALL
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作