ODEX
收藏arXiv2025-09-30 收录
下载链接:
https://code-eval.github.io/
下载链接
链接失效反馈官方服务:
资源简介:
该数据集名为ODEX,是首个开放领域基于执行的自然语言到Python代码生成数据集,包含了来自Stack Overflow的945个自然语言与代码对,同时配备了1,707个人工编写的测试用例以供执行。ODEX支持四种自然语言:英语、西班牙语、日语和俄语。其规模包括945个自然语言与代码对以及1,707个测试用例,旨在支持开放领域的代码生成与执行评估任务。
This dataset, named ODEX, is the first open-domain execution-based natural language-to-Python code generation dataset. It contains 945 natural language-code pairs sourced from Stack Overflow, alongside 1,707 manually written test cases for execution. ODEX supports four natural languages: English, Spanish, Japanese, and Russian. With 945 natural language-code pairs and 1,707 test cases in total, ODEX is designed to support open-domain code generation and execution evaluation tasks.
提供机构:
Authors of the paper



