Shuu12121/java-codesearch-dataset-open
收藏Hugging Face2025-03-13 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/Shuu12121/java-codesearch-dataset-open
下载链接
链接失效反馈官方服务:
资源简介:
Java代码数据集,包含从GitHub仓库中提取的Java函数及其文档注释。数据集包括函数代码、文档注释、函数名称、编程语言(始终为Java)、仓库名称、文件路径、GitHub源文件URL和源代码许可证等信息。数据集分为训练集、验证集和测试集,分别包含764,189、125,016和75,350个示例。
Java Code Dataset containing Java functions and their documentation comments extracted from GitHub repositories. The dataset includes function code, documentation comments, function names, programming language (always Java), repository names, file paths, GitHub source file URLs, and source code license information. The dataset is split into training, validation, and test sets with 764,189, 125,016, and 75,350 examples respectively.
提供机构:
Shuu12121



