nick007x/github-code-2025
收藏Hugging Face2025-10-15 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/nick007x/github-code-2025
下载链接
链接失效反馈官方服务:
资源简介:
GitHub Code 2025是一个精心策划的数据集,包含超过150万个代表2025年代码生态系统中质量和创新的代码仓库。数据集分为两部分:评分超过2星的仓库和评分低于2星的仓库,前者代表已经验证的质量和模式,后者代表新兴的趋势和创新。数据集经过严格筛选和清洗,去除了不必要的噪声和二进制文件,保证了代码的高质量。
GitHub Code 2025 is a meticulously curated dataset of over 1.5 million repositories representing both quality and innovation in 2025s code ecosystem. The dataset is divided into two parts: repositories with more than 2 stars representing proven quality and patterns, and repositories with less than 2 stars representing emerging trends and innovation. The dataset has undergone rigorous selection and cleaning, removing unnecessary noise and binary files to ensure high-quality code.
提供机构:
nick007x



