wentingzhao/stack-v2-cpp-2011-windows
收藏Hugging Face2024-10-21 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/wentingzhao/stack-v2-cpp-2011-windows
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含与GitHub仓库相关的文件信息,涉及文件路径、许可证信息、仓库名称、时间戳、GitHub事件计数等。数据集的特征字段包括blob_id、directory_id、path、content_id、detected_licenses、license_type、repo_name、snapshot_id、revision_id、branch_name、visit_date、revision_date、committer_date、github_id、star_events_count、fork_events_count、gha_license_id、gha_event_created_at、gha_created_at、gha_language、src_encoding、language、is_vendor、is_generated、length_bytes、extension等。数据集分为训练集,包含224,156个样本,总大小为97,886,359.74字节。
This dataset contains information related to files in GitHub repositories, including file paths, license information, repository names, timestamps, GitHub event counts, and more. The dataset features include blob_id, directory_id, path, content_id, detected_licenses, license_type, repo_name, snapshot_id, revision_id, branch_name, visit_date, revision_date, committer_date, github_id, star_events_count, fork_events_count, gha_license_id, gha_event_created_at, gha_created_at, gha_language, src_encoding, language, is_vendor, is_generated, length_bytes, extension, etc. The dataset is divided into a training set containing 224,156 samples, with a total size of 97,886,359.74 bytes.
提供机构:
wentingzhao



