five

ManySStuBs4J Dataset

收藏
DataCite Commons2023-04-27 更新2025-04-17 收录
下载链接:
https://datashare.ed.ac.uk/handle/10283/3424
下载链接
链接失效反馈
官方服务:
资源简介:
The ManySStuBs4J corpus contains simple statement bugs mined from open-source Java projects hosted in GitHub. There are two variations of the dataset. One mined from the 100 Java Maven Projects and one mined from the top 1000 Java Projects. A project's popularity is determined by computing the sum of z-scores of its forks and watchers. See "README.txt" for further details.

ManySStuBs4J语料库(ManySStuBs4J corpus)收录了从GitHub托管的开源Java项目中挖掘得到的简单语句错误样本。该数据集包含两种变体:一种源自100个Java Maven项目,另一种源自排名前1000的Java项目。项目的流行度通过计算其分叉数(forks)与关注者数(watchers)的z分数(z-score)之和来确定。更多详细信息请参阅"README.txt"文件。
提供机构:
University of Edinburgh. College of Science & Engineering. School of Informatics. Institute for Language, Cognition and Computation (ILCC)
创建时间:
2019-09-30
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作