five

GitHub Java Corpus - Function Identifiers

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4084569
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset contains function identifiers extracted from the GitHub Java Corpus (http://groups.inf.ed.ac.uk/cup/javaGithub/). Each line corresponds to a method declaration. A line contains the name of the method declaration followed by the function identifiers (i.e., function calls) contained within the method body.  The file embeddings_train.json can be used to train a word/sentence embedding model using the code in the Github repository (link below). The corpus was used for the experiments in the paper Combining Code Embedding with Static Analysis for Function-Call Completion. Github repository to replicate the experiments: https://github.com/mweyssow/cse-saner
创建时间:
2020-11-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作