carbonteq/rg-countdown-instruct-100k
收藏Hugging Face2026-04-06 更新2026-04-12 收录
下载链接:
https://hf-mirror.com/datasets/carbonteq/rg-countdown-instruct-100k
下载链接
链接失效反馈官方服务:
资源简介:
---
license: mit
pretty_name: RLVR (verl) dataset
---
# RLVR generated dataset
Procedural rows from [reasoning-gym](https://github.com/open-thought/reasoning-gym), formatted for [verl](https://github.com/verl-project/verl) GRPO.
## Build metadata
```json
{
"config": "/home/owais/Projects/rlvr/rlvr/configs/datasets/countdown-instruct.yaml",
"template_type": "qwen-instruct",
"developer_prompt": null,
"data_source": "reasoning_gym",
"default_extract": "answer_tag",
"train_rows": 100000,
"test_rows": 4096,
"train_seed": 42,
"test_seed": 43,
"tasks": {
"countdown": {
"weight": 1,
"config": {}
}
},
"reasoning_gym_version": "0.1.19"
}
```
提供机构:
carbonteq



