project-droid/Category-1-final
收藏Hugging Face2025-01-19 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/project-droid/Category-1-final
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了来自StarCoder、The-Vault-inline和Project_CodeNet的经过去重和清洗的代码数据。数据清洗过程中移除了元数据标签,并且数据按照AST可解析性、深度、最大行长度、平均行长度、字母数字比例和行数进行了筛选。
This dataset contains deduplicated and cleaned code data from StarCoder, The-Vault-inline, and Project_CodeNet. The cleaning process removed metadata tags and the data was filtered based on AST parsability, depth, maximum line length, average line length, alphanumeric fraction, and number of lines.
提供机构:
project-droid



