five

nicher92/the_stack_formal_v1_all_repos_with_acsl

收藏
Hugging Face2025-01-27 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/nicher92/the_stack_formal_v1_all_repos_with_acsl
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含了多个与代码仓库相关的特征,如仓库文件的特征哈希值(hexsha)、大小(size)、扩展名(ext)、编程语言(lang)、拥有最多star的仓库信息、拥有最多issue的仓库信息、拥有最多fork的仓库信息等。此外,还包括了仓库文件的内容(content)、平均行长度(avg_line_length)、最大行长度(max_line_length)以及字母数字比例(alphanum_fraction)。数据集被划分为训练集(train),其中包含了177980个示例,总大小约为2.23GB,下载大小约为1GB。

The dataset includes multiple features related to code repositories, such as repository file feature hash (hexsha), size (size), extension (ext), programming language (lang), information of the repository with the most stars, issues, and forks. In addition, it includes the content of the repository file (content), average line length (avg_line_length), maximum line length (max_line_length), and the ratio of alphanumeric characters (alphanum_fraction). The dataset is split into a training set (train) containing 177980 examples, with a total size of approximately 2.23GB and a download size of about 1GB.
提供机构:
nicher92
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作