five

stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-0.6B_strategy_trust_t1_g8_metrics

收藏
Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/stefanocarrera/autophagycode_D_he_train-mercury_Qwen3-0.6B_strategy_trust_t1_g8_metrics
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含代码相关任务的元数据,用于分析代码质量和执行结果。特征包括任务ID、入口点、可执行性、正确性、测试通过/失败数、测试运行时间、错误类型、Halstead复杂度指标(如词汇量、长度、体积、难度、努力程度、时间)、圈复杂度、可维护性指数、代码行数(LOC和SLOC)、注释百分比、TTR(类型-标记比率)、标记字典、香农熵、预测熵(均值和最大值)、定义函数数量以及入口点重复标志。数据集分为训练集,包含164个样本,总大小约240KB,下载大小约104KB。

This dataset contains metadata for code-related tasks, designed for analyzing code quality and execution outcomes. Features include task ID, entry point, executability, correctness, tests passed/failed, test run time, error type, Halstead complexity metrics (e.g., vocabulary, length, volume, difficulty, effort, time), cyclomatic complexity, maintainability index, lines of code (LOC and SLOC), comment percentage, TTR (Type-Token Ratio), token dictionary, Shannon entropy, predictive entropy (mean and max), number of functions defined, and entry point repetition flag. The dataset is split into a training set with 164 examples, total size approximately 240KB, and download size approximately 104KB.
提供机构:
stefanocarrera
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作