five

Context-Aware Models for Automatic Source Code Summarization

收藏
Figshare2024-05-04 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Context-Aware_Models_for_Automatic_Source_Code_Summarization/25596291
下载链接
链接失效反馈
官方服务:
资源简介:
Source Code Summarization is a program comprehension task that consists of writing natural language descriptions of source code. These summaries are important because they are an essential part of software documentation, such as the descriptions in APIs. They are also necessary for maintenance of legacy soft- ware systems. The state-of-the-art for automatic source code summarization, when I started my work were neural networks developed for machine translation. They were designed to accept a snippet of source code, usually a subroutine, as a sequence of tokens and generate an English language description. These techniques were based on sequence-to-sequence learning , i.e., the summary sequence was built one word at a time, using an attention mechanism and code sequence. However, often some of the information required to summarize the subroutine descriptively is not inside the subroutine. The necessary information lives in the ”context” around the code, such as other subroutines, files, and build files, as well as the pre-learnt human knowledge. In this dissertation, I will present my research on modeling various types of contextual information for better automatic source code summarization.
创建时间:
2024-05-04
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作