CoDesc
收藏魔搭社区2024-09-03 更新2024-08-31 收录
下载链接:
https://modelscope.cn/datasets/OmniData/CoDesc
下载链接
链接失效反馈官方服务:
资源简介:
displayName: CoDesc
license:
- MIT
mediaTypes:
- Text
paperUrl: https://arxiv.org/pdf/2105.14220v1.pdf
publishDate: "2021"
publishUrl: https://github.com/csebuetnlp/CoDesc
publisher:
- Bangladesh University of Engineering and Technology
- University of California, Los Angeles
tags:
- Code
taskTypes:
- Code Search
- Code Summarization
---
# 数据集介绍
## 简介
CoDesc是一个4.2万个Java源代码的大型数据集,并对其并行数据进行了代码搜索和代码总结研究。
## 引文
```
@article{hasan2021codesc,
title={CoDesc: A Large Code-Description Parallel Dataset},
author={Hasan, Masum and Muttaqueen, Tanveer and Ishtiaq, Abdullah Al and Mehrab, Kazi Sajeed and Haque, Md and Anjum, Mahim and Hasan, Tahmid and Ahmad, Wasi Uddin and Iqbal, Anindya and Shahriyar, Rifat},
journal={arXiv preprint arXiv:2105.14220},
year={2021}
}
```
## Download dataset
:modelscope-code[]{type="git"}
displayName: CoDesc
license:
- MIT
mediaTypes:
- 文本
paperUrl: https://arxiv.org/pdf/2105.14220v1.pdf
publishDate: 2021年
publishUrl: https://github.com/csebuetnlp/CoDesc
publisher:
- 孟加拉国工程技术大学(Bangladesh University of Engineering and Technology)
- 加州大学洛杉矶分校(University of California, Los Angeles)
tags:
- 代码(Code)
taskTypes:
- 代码搜索(Code Search)
- 代码摘要(Code Summarization)
---
# 数据集简介
## 简介
CoDesc是一个包含4.2万条Java源代码的大型并行数据集,可用于代码搜索与代码摘要相关研究。
## 引文
@article{hasan2021codesc,
title={CoDesc: A Large Code-Description Parallel Dataset},
author={Hasan, Masum and Muttaqueen, Tanveer and Ishtiaq, Abdullah Al and Mehrab, Kazi Sajeed and Haque, Md and Anjum, Mahim and Hasan, Tahmid and Ahmad, Wasi Uddin and Iqbal, Anindya and Shahriyar, Rifat},
journal={arXiv preprint arXiv:2105.14220},
year={2021}
}
## 下载数据集
:modelscope-code[]{type="git"}
提供机构:
maas
创建时间:
2024-07-08



