five

XUO/SWE-PolyBench

收藏
Hugging Face2025-12-19 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/XUO/SWE-PolyBench
下载链接
链接失效反馈
官方服务:
资源简介:
SWE-PolyBench是一个多语言仓库级软件工程基准测试数据集,目前包含四种语言:Python、Java、Javascript和Typescript。具体实例数量为:Javascript 1017个,Typescript 729个,Python 199个,Java 165个。数据集分为三个子集:完整数据集(AmazonScience/SWE-PolyBench)、分层抽样数据集(AmazonScience/SWE-PolyBench_500,包含500个实例)和已验证数据集(AmazonScience/SWE-PolyBench_Verified,包含394个实例)。数据集主要用于评估开源编码代理/模型,包含多个字段如instance_id、patch、repo、base_commit等,用于详细描述每个实例的信息。数据集的语言主要为英语。

SWE-PolyBench is a multi language repo level software engineering benchmark. Currently it includes 4 languages: Python, Java, Javascript, and Typescript. The number of instances in each language is: Javascript: 1017, Typescript: 729, Python: 199, Java: 165. There are total three datasets available under SWE-PolyBench. `AmazonScience/SWE-PolyBench` is the full dataset, `AmazonScience/SWE-PolyBench_500` is the stratified sampled dataset with 500 instances and `AmazonScience/SWE-PolyBench_Verified` is our verified dataset with 394 instances. The dataset is primarily in English and includes columns like instance_id, patch, repo, base_commit, etc., for detailed description of each instance.
提供机构:
XUO
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作