Identifier Name Similarities: An Exploratory Study
收藏Figshare2025-07-22 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Identifier_Name_Similarities_An_Exploratory_Study/29127113
下载链接
链接失效反馈官方服务:
资源简介:
This is the dataset for the paper: Identifier Name Similarities: An Exploratory StudyThis paper was accepted for publication at the International Symposium on Empirical Software Engineering and Measurement (ESEM 2024) - Emerging Results and Vision TrackPreprint: https://arxiv.org/abs/2507.18081-----The table below describes the types of files that are present in the artifact package.FilePurposeIdentifierTaxonomy.xlsxThis file contains all of the identifiers analyzed and taxonomy label assigned to each.src/pom.xmlThis file is the Maven configuration file that specifies dependencies and build instructions.src/main/java/parser/identifier/This folder contains the Java source code files that perform repository parsing and identifier extraction.src/main/java/parser/identifier/The code is written in Java. Below is a description of the files:project_repos.txt - This file is the input to GitHubRepoParser.java and details which repositories will be cloned and parsed.Database.java - This file contains a helper method to construct the database in GitHubRepoParser.java.ClassVisitor.java - This file contains helper methods that specify which identifiers will be extracted (method names, parameters, etc.) and how they are stored in the database.GitHubRepoParser.java - This file clones all of the repositories specified in project_repos.txt and filters for Java files. It then parses the Java files and stores extracted identifiers to the database.Perform the below steps to execute the code:Ensure that Java (JDK 8 or above) is installed on system.Build the project and dependenciesmvn compileRun the main programmvn exec:java -Dexec.mainClass=“GitHubRepoParser”
创建时间:
2025-07-22



