five

Identifier Name Similarities: An Exploratory Study

收藏
Figshare2025-07-22 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Identifier_Name_Similarities_An_Exploratory_Study/29127113
下载链接
链接失效反馈
官方服务:
资源简介:
This is the dataset for the paper: Identifier Name Similarities: An Exploratory StudyThis paper was accepted for publication at the International Symposium on Empirical Software Engineering and Measurement (ESEM 2024) - Emerging Results and Vision TrackPreprint: https://arxiv.org/abs/2507.18081-----The table below describes the types of files that are present in the artifact package.FilePurposeIdentifierTaxonomy.xlsxThis file contains all of the identifiers analyzed and taxonomy label assigned to each.src/pom.xmlThis file is the Maven configuration file that specifies dependencies and build instructions.src/main/java/parser/identifier/This folder contains the Java source code files that perform repository parsing and identifier extraction.src/main/java/parser/identifier/The code is written in Java. Below is a description of the files:project_repos.txt - This file is the input to GitHubRepoParser.java and details which repositories will be cloned and parsed.Database.java - This file contains a helper method to construct the database in GitHubRepoParser.java.ClassVisitor.java - This file contains helper methods that specify which identifiers will be extracted (method names, parameters, etc.) and how they are stored in the database.GitHubRepoParser.java - This file clones all of the repositories specified in project_repos.txt and filters for Java files. It then parses the Java files and stores extracted identifiers to the database.Perform the below steps to execute the code:Ensure that Java (JDK 8 or above) is installed on system.Build the project and dependenciesmvn compileRun the main programmvn exec:java -Dexec.mainClass=“GitHubRepoParser”
创建时间:
2025-07-22
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作