Reset23/the-stack-v2-blamed
收藏Hugging Face2025-01-07 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/Reset23/the-stack-v2-blamed
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了多个编程语言(Python、Java、C++、C)的代码库元数据,涵盖了代码库的ID、路径、许可证信息、时间戳、GitHub事件统计(如star和fork事件)、代码内容等信息。数据集被分割为四个部分,分别对应不同的编程语言,每个部分都有对应的字节大小和样本数量。
This dataset contains metadata of code repositories for multiple programming languages (Python, Java, C++, C), including repository IDs, paths, license information, timestamps, GitHub event statistics (such as star and fork events), and code content. The dataset is divided into four parts, each corresponding to a different programming language, with each part having its own byte size and number of samples.
提供机构:
Reset23



