xszheng2020/the_stack_dedup_python_hits_5
收藏Hugging Face2025-10-05 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/xszheng2020/the_stack_dedup_python_hits_5
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个代码文件的特征信息,如文件哈希值、大小、扩展名、编程语言、最大星标仓库信息(包括路径、名称、星标数等)、最大问题数仓库信息、最大分支数仓库信息以及代码质量信号等。数据集分为训练集,共有336180个示例。
The dataset contains feature information of multiple code files, such as file hash, size, extension, programming language, information of repository with the maximum number of stars (including path, name, number of stars, etc.), repository with the maximum number of issues, repository with the maximum number of forks, and code quality signals. The dataset is split into a training set with a total of 336180 examples.
提供机构:
xszheng2020



