five

sfakhoury/NL2Fix

收藏
Hugging Face2024-04-18 更新2024-04-19 收录
下载链接:
https://hf-mirror.com/datasets/sfakhoury/NL2Fix
下载链接
链接失效反馈
官方服务:
资源简介:
--- size_categories: - n<1K --- # Dataset Card for NL2Fix ## Dataset Details ### Dataset Description - **Language(s):** Java ### Cite the Dataset <!-- Provide the basic links for the dataset. --> - **Paper:** https://arxiv.org/pdf/2304.03816.pdf ``` @article{fakhoury2023towards, title={Towards generating functionally correct code edits from natural language issue descriptions}, author={Fakhoury, Sarah and Chakraborty, Saikat and Musuvathi, Madan and Lahiri, Shuvendu K}, journal={arXiv preprint arXiv:2304.03816}, year={2023} } ``` ## Dataset Structure <!-- This section provides a description of the dataset fields, and additional information about the dataset structure such as criteria used to create the splits, relationships between data points, etc. --> - name: {defects4j_project} dtype: {string} - name: {defects4j_bug_id} dtype: {int} - name: {file_path} dtype: {string} - name: {bug_start_line} dtype: {int} - name: {bug_end_line} dtype: {int} - name: {issue_title} dtype: {string} - name: {issue_description} dtype: {string} - name: {original_src_wo_comments} dtype: {string} - name: {fixed_src_wo_comments} dtype: {string} ![image/png](https://cdn-uploads.huggingface.co/production/uploads/65486006b8ac1a89ff27544a/OPRTy4jgq9BOsAOg1fs6b.png) ### Source Data <!-- This section describes the source data (e.g. news text and headlines, social media posts, translated sentences, ...). --> Defects4J: https://github.com/rjust/defects4j/tree/master
提供机构:
sfakhoury
原始信息汇总

数据集概述

数据集名称

NL2Fix

数据集描述

语言

  • 语言(s): Java

引用信息

数据集结构

  • 字段描述:
    • defects4j_project: 项目名称,数据类型为字符串。
    • defects4j_bug_id: 缺陷ID,数据类型为整数。
    • file_path: 文件路径,数据类型为字符串。
    • bug_start_line: 缺陷起始行号,数据类型为整数。
    • bug_end_line: 缺陷结束行号,数据类型为整数。
    • issue_title: 问题标题,数据类型为字符串。
    • issue_description: 问题描述,数据类型为字符串。
    • original_src_wo_comments: 原始代码(不含注释),数据类型为字符串。
    • fixed_src_wo_comments: 修复后代码(不含注释),数据类型为字符串。

源数据

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作