Evaluating Embedding Representations for Multiclass Code Smell Detection: A Comparative Study of CodeBERT and General-Purpose Embeddings
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://figshare.com/articles/dataset/Evaluating_Embedding_Representations_for_Multiclass_Code_Smell_Detection_A_Comparative_Study_of_CodeBERT_and_General-Purpose_Embeddings/31717957
下载链接
链接失效反馈官方服务:
资源简介:
This study uses a dataset derived from the Crowdsmelling dataset, containing instances of Long Method, God Class, and Feature Envy extracted from open-source Java projects. The original instances were consolidated into a single multiclass dataset and manually mapped to their corresponding source code fragments. As part of preprocessing, line comments and block comments were removed, while line breaks were preserved to maintain the structural organization of the code.
创建时间:
2026-03-13



