tingtang2/the_stack_v2_python_repos_pretraining_dataset_imported_context-dataset
收藏Hugging Face2025-09-20 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/tingtang2/the_stack_v2_python_repos_pretraining_dataset_imported_context-dataset
下载链接
链接失效反馈官方服务:
资源简介:
这是一个包含代码库信息的的数据集,具体包括代码仓库的名称、分支名称、文件路径、文件内容、文件上下文以及文件之间的导入关系。数据集被划分为训练集,提供了字节数和示例数。数据集的总大小为95344749687字节,下载大小为26283750896字节。
This dataset contains information about code repositories, including repository name, branch name, file path, file content, context of the file, and import relationships between files. The dataset is split into a training set, with the number of bytes and examples provided. The total size of the dataset is 95344749687 bytes, and the download size is 26283750896 bytes.
提供机构:
tingtang2



