five

Gen-Verse/LiveCodeBench-ReasonFlux

收藏
Hugging Face2026-02-01 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/Gen-Verse/LiveCodeBench-ReasonFlux
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit --- We use Stdio input/output format here. For example, for the task to calculate the sum of a list, the input and output are in the following format: ```python input = "5\n1 2 3 4 5\n" output = "15" ``` CodeContests and CodeForces are using this format, however, MBPP and part of LiveCodeBench are using functional input/output format, such like ```python assert sum_function([1, 2, 3, 4, 5]) == 15 ``` In this project, we have converted the the functional format to the Stdio format to achieve consistency. [Paper](https://arxiv.org/abs/2506.03136) | [Code](https://github.com/Gen-Verse/CURE) # Citation ``` @article{wang2025cure, title={Co-Evolving LLM Coder and Unit Tester via Reinforcement Learning}, author={Wang, Yinjie and Yang, Ling and Tian, Ye and Shen, Ke and Wang, Mengdi}, journal={arXiv preprint arXiv:2506.03136}, year={2025} } @article{jain2024livecodebench, title={Livecodebench: Holistic and contamination free evaluation of large language models for code}, author={Jain, Naman and Han, King and Gu, Alex and Li, Wen-Ding and Yan, Fanjia and Zhang, Tianjun and Wang, Sida and Solar-Lezama, Armando and Sen, Koushik and Stoica, Ion}, journal={arXiv preprint arXiv:2403.07974}, year={2024} } ```
提供机构:
Gen-Verse
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作