five

Metrics from Use Case Diagram, Analysis Class Diagram, and Data Flow Diagram along with Software Size (SLOC) - Datasets

收藏
NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://data.mendeley.com/datasets/7jd45pkch8
下载链接
链接失效反馈
官方服务:
资源简介:
These four datasets consist of software projects—specifically student projects from a private university in Lahore, Pakistan—developed using different programming languages and application types, including desktop, command-line, and web applications. Specifically, Dataset #1 comprises 31 C++ desktop GUI applications, Dataset #2 contains 19 Java desktop GUI projects, Dataset #3 includes 11 Java command-line applications, and Dataset #4 features 12 Java web-based systems. Each dataset includes a comprehensive set of metrics derived from Use Case Diagrams (UCD), Analysis Class Diagrams (ACD), and Data Flow Diagrams (DFD), along with the corresponding software size measured in Source Lines of Code (SLOC). These datasets are utilized to compare the effectiveness of metrics derived from these three diagrams for early software size estimation. The source code (in R programming language) and results for the regression models developed using Dataset #4 are provided. The same programming procedures can be applied to build and validate regression models for other datasets as well.

本数据集组共包含四组软件项目,具体为巴基斯坦拉合尔一所私立大学的学生项目,采用多种编程语言开发,应用类型涵盖桌面应用、命令行应用与Web应用。其中,数据集#1包含31个C++桌面图形用户界面(Graphical User Interface, GUI)应用;数据集#2收录19个Java桌面GUI项目;数据集#3涵盖11个Java命令行应用;数据集#4则包含12个基于Java的Web系统。每组数据集均包含从用例图(Use Case Diagrams, UCD)、分析类图(Analysis Class Diagrams, ACD)及数据流图(Data Flow Diagrams, DFD)中提取的全套度量指标,同时附带以源代码行数(Source Lines of Code, SLOC)计量的软件规模。本数据集旨在对比上述三类图表衍生的度量指标在软件早期规模估算中的应用有效性。 本数据集还提供了使用数据集#4构建的回归模型的源代码(采用R编程语言编写)与实验结果。针对其余三组数据集构建并验证回归模型时,亦可采用相同的编程流程。
创建时间:
2026-04-02
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作