sfakhoury/NL2Fix
收藏Hugging Face2024-04-18 更新2024-04-19 收录
下载链接:
https://hf-mirror.com/datasets/sfakhoury/NL2Fix
下载链接
链接失效反馈官方服务:
资源简介:
---
size_categories:
- n<1K
---
# Dataset Card for NL2Fix
## Dataset Details
### Dataset Description
- **Language(s):** Java
### Cite the Dataset
<!-- Provide the basic links for the dataset. -->
- **Paper:** https://arxiv.org/pdf/2304.03816.pdf
```
@article{fakhoury2023towards,
title={Towards generating functionally correct code edits from natural language issue descriptions},
author={Fakhoury, Sarah and Chakraborty, Saikat and Musuvathi, Madan and Lahiri, Shuvendu K},
journal={arXiv preprint arXiv:2304.03816},
year={2023}
}
```
## Dataset Structure
<!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. -->
- name: {defects4j_project}
dtype: {string}
- name: {defects4j_bug_id}
dtype: {int}
- name: {file_path}
dtype: {string}
- name: {bug_start_line}
dtype: {int}
- name: {bug_end_line}
dtype: {int}
- name: {issue_title}
dtype: {string}
- name: {issue_description}
dtype: {string}
- name: {original_src_wo_comments}
dtype: {string}
- name: {fixed_src_wo_comments}
dtype: {string}

### Source Data
<!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). -->
Defects4J: https://github.com/rjust/defects4j/tree/master
提供机构:
sfakhoury
原始信息汇总
数据集概述
数据集名称
NL2Fix
数据集描述
语言
- 语言(s): Java
引用信息
- 论文: Towards generating functionally correct code edits from natural language issue descriptions
- 作者: Fakhoury, Sarah and Chakraborty, Saikat and Musuvathi, Madan and Lahiri, Shuvendu K
- 发表年份: 2023
数据集结构
- 字段描述:
defects4j_project: 项目名称,数据类型为字符串。defects4j_bug_id: 缺陷ID,数据类型为整数。file_path: 文件路径,数据类型为字符串。bug_start_line: 缺陷起始行号,数据类型为整数。bug_end_line: 缺陷结束行号,数据类型为整数。issue_title: 问题标题,数据类型为字符串。issue_description: 问题描述,数据类型为字符串。original_src_wo_comments: 原始代码(不含注释),数据类型为字符串。fixed_src_wo_comments: 修复后代码(不含注释),数据类型为字符串。
源数据
- 来源: Defects4J



