iNeil77/vault-class
收藏Hugging Face2025-10-23 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/iNeil77/vault-class
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个包含代码片段及其相关信息的集合,其中包括代码的SHA值、所属仓库、路径、许可证、使用的编程语言、唯一标识符、原始和简化的文档字符串、代码文本、文档和代码的词法标记、参数及其类型、异常和返回值信息等。数据集还包括代码是否有错误、代码的复杂度(抽象语法树深度)、代码长度和文档字符串长度等指标。数据集提供了一个训练集,以便进行相关研究和分析。
This dataset is a collection of code snippets and their associated information, including SHA values, repository information, file paths, licenses, programming languages, unique identifiers, original and simplified docstrings, code text, lexical tokens of docstrings and code, parameter information with types, exceptions, and return values. The dataset also includes metrics such as whether the code has errors, the complexity of the code (abstract syntax tree depth), code length, and the length of the docstrings. The dataset provides a training set for research and analysis.
提供机构:
iNeil77



