krytonguard/git-sources-2025
收藏Hugging Face2025-11-02 更新2025-11-15 收录
下载链接:
https://hf-mirror.com/datasets/krytonguard/git-sources-2025
下载链接
链接失效反馈官方服务:
资源简介:
GitHub Code 2025: The Clean Code Manifesto是一个精心策划的数据集,包含超过150万个代表2025年代码生态系统中质量和创新的仓库。数据集分为两个子集:超过2星级的仓库和低于2星级的仓库,分别代表经过验证的质量和模式以及新兴的趋势和创新。经过严格的筛选,去除了噪声、二进制文件和无效内容,确保了数据集的纯净性和可用性。
GitHub Code 2025: The Clean Code Manifesto is a meticulously curated dataset of over 1.5 million repositories representing both quality and innovation in 2025s code ecosystem. The dataset is split into two subsets: repositories with more than 2 stars and those with fewer than 2 stars, each representing proven quality & patterns and emerging trends & innovation respectively. It has been rigorously filtered to remove noise, binary files, and irrelevant content, ensuring the datasets cleanliness and usability.
提供机构:
krytonguard



