five

Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat

收藏
Hugging Face2025-04-03 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Lansechen/details_Lansechen__Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat
下载链接
链接失效反馈
官方服务:
资源简介:
在模型[Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat](https://huggingface.co/Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat)的评估运行期间自动创建的数据集。该数据集包含三个配置,每个配置对应于一个评估任务。数据集由9次运行的结果组成,每次运行都以其时间戳命名的分割形式存在于每个配置中。train分割始终指向最新结果。还有一个名为results的额外配置,用于存储所有运行的聚合结果。

Dataset automatically created during the evaluation run of model [Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat](https://huggingface.co/Lansechen/Qwen2.5-7B-Open-R1-GRPO-math-lighteval-noformat). The dataset consists of three configurations, each corresponding to one of the evaluation tasks. The dataset is composed of results from 9 runs, with each run found as a specific split within each configuration, named using the timestamp of the run. The train split always points to the latest results. An additional configuration results stores all aggregated results of the run.
提供机构:
Lansechen
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作