ASSERT-KTH/stack-smol-docstrings
收藏Hugging Face2025-03-21 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/ASSERT-KTH/stack-smol-docstrings
下载链接
链接失效反馈官方服务:
资源简介:
Stack-Smol-Docstrings数据集包含了从the-stack-smol仓库中提取的Python函数,这些函数经过筛选,确保了文档字符串和实现代码的质量。每个样本包含函数的名称、文档字符串、被注释的代码(函数被替换为注释)、函数的实际实现代码、函数在原始文件中的起始和结束行号,以及原始文件的全部内容。该数据集旨在用于代码补全任务,模型需要根据文档字符串和上下文环境恢复被注释的函数。
The Stack-Smol-Docstrings dataset consists of Python functions extracted from the-stack-smol repository, filtered for high-quality docstrings and implementations. Each sample includes the functions name, docstring, a masked version of the code with the function replaced by a comment, the actual implementation of the function, the start and end line numbers of the function in the original file, and the full content of the original file. The dataset is designed for code completion tasks where the model needs to restore the function that has been replaced with a comment based on the docstring and surrounding context.
提供机构:
ASSERT-KTH



