five

Enhanced Bug Prediction in JavaScript Programs with Hybrid Call-Graph Based Invocation Metrics (Training Dataset)

收藏
NIAID Data Ecosystem2026-03-12 收录
下载链接:
https://zenodo.org/record/4281475
下载链接
链接失效反馈
官方服务:
资源简介:
This dataset consists of multiple files which contain bug prediction training data. The entries in the dataset are JavaScript functions either being buggy or non-buggy. Bug related information was obtained from the project EsLint contained in BugsJS (https://github.com/BugsJS/eslint). The buggy instances were collected throughout the lifetime of the project, however we added non-buggy entries from the latest version which is tagged as fix (entries which were previously included as buggy were not included as non-buggy later on). The dataset is based on hybrid call graphs which are constructed by https://github.com/sed-szeged/hcg-js-framework. The result of this tool is a call graph where the edges are associated with a confidence level which shows how likely the given edge is a valid call edge. We used different threshold values from which we considered the edges to be valid. The following threshold values were used: 0.00 0.05 0.20 0.30 The prefix in the dataset file names are coming from the used threshold. The the datasets include coupling metrics NII (Nubmer of Incoming Invocations) and NOI (Number of Outgoing Invocations) which were calculated by a static source code analyzer called SourceMeter. Hybrid counterparts of these metrics (HNII and HNOI) are based on the given threshold values. There are four variants for all of these datasets: Both static (NII, NOi) and hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics and information about the entries (file without any postfix). Column contained only in this dataset are: ID Name Longname Parent ID Component ID Path Line Column EndLine EndColumn Both static (NII, NOi) and hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics (file with '_h+s' postfix) Only static (NII, NOI) coupling metrics are included with additional static source code metrics (file with '_s' postfix) Only hybrid (HNII, HNOI) coupling metrics are included with additional static source code metrics (file with '_h' postfix) Static source code metrics which are contained in all dataset are the following: McCC - McCabe Cyclomatic Complexity NL - Nesting Level NLE - Nesting Level Else If CD - Comment Density CLOC - Comment Lines of Code DLOC - Documentation Lines of Code TCD - Total Comment Density (Comment Lines in an emedded function will be also considered) TCLOC - Total Comment Lines of Code (Comment Lines in an emedded function will be also considered) LLOC - Logical Lines of Code (Comment and empty lines not counted) LOC - Lines of Code (Comment and empty lines are counted) NOS - Number of Statements NUMPAR - Number of Parameters TLLOC - Logical Lines of Code (Lines in embedded functions are also counted) TLOC - Lines of Code (Lines in embedded functions are also counted) TNOS - Total Number of Statements (Statements in embedded functions are also counted)
创建时间:
2020-11-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作