nyuuzyou/notabug-code
收藏Hugging Face2026-01-09 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nyuuzyou/notabug-code
下载链接
链接失效反馈官方服务:
资源简介:
NotaBug.org代码数据集包含从19,449个在NotABug.org上托管的开源项目和私人仓库的分支中收集的源代码文件。这个数据集包含了多种编程语言,并代表了多样化的开源项目和私人仓库。数据集以JSONL格式压缩存储,并且没有验证集,所有示例都在训练集中。
The NotaBug.org Code Dataset consists of source code files collected from 19,449 repositories and branches hosted on NotABug.org, a diverse collection of open-source projects and personal repositories across various programming languages. The dataset is stored in a compressed JSONL format and includes only a train split, with no validation set.
提供机构:
nyuuzyou



