five

Reset23/the-stack-v2-python

收藏
Hugging Face2025-01-07 更新2024-12-21 收录
下载链接:
https://hf-mirror.com/datasets/Reset23/the-stack-v2-python
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含与代码仓库和GitHub事件相关的信息,涵盖了blob_id、directory_id、path、content_id、detected_licenses、license_type、repo_name等特征字段,以及与GitHub事件相关的信息,如star_events_count、fork_events_count、gha_license_id等。数据集仅包含一个训练集,共有100,000个样本,总大小为512,263,384字节。

This dataset contains information related to code repositories and GitHub events, including features such as blob_id, directory_id, path, content_id, detected_licenses, license_type, repo_name, and GitHub event-related information such as star_events_count, fork_events_count, gha_license_id, etc. The dataset includes only a training set with 100,000 samples and a total size of 512,263,384 bytes.
提供机构:
Reset23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作