tingtang2/the_stack_v2_200k_repos-dataset
收藏Hugging Face2025-09-14 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/tingtang2/the_stack_v2_200k_repos-dataset
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含四个字段:仓库名称(repo_name)、分支名称(branch_name)、文件路径(path)和文件内容(content)。数据集主要用于训练,包含了大约2.13亿个示例,总大小为7.15GB。
The dataset includes four fields: repository name (repo_name), branch name (branch_name), file path (path), and file content (content). It is primarily for training purposes, containing approximately 21.38 million examples with a total size of 7.15GB.
提供机构:
tingtang2



