ZorraZabb/codePuPu
收藏Hugging Face2024-09-13 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/ZorraZabb/codePuPu
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征字段,包括文本内容(text)、目录(dir)、语言(lang)、创建日期(created_date)、更新日期(updated_date)、仓库名称(repo_name)、完整仓库名称(repo_full_name)、星标数(star)和文本长度(len_tokens)。数据集分为训练集(train),包含2031643个样本,总大小为21486181891字节。
This dataset includes multiple feature fields such as text content (text), directory (dir), language (lang), creation date (created_date), update date (updated_date), repository name (repo_name), full repository name (repo_full_name), star count (star), and text length (len_tokens). The dataset is divided into a training set (train) containing 2031643 samples, with a total size of 21486181891 bytes.
提供机构:
ZorraZabb



