synthetic-code-training/repost_train
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/synthetic-code-training/repost_train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含与代码函数相关的信息,可能来自软件仓库。数据集包含多个字段,如instance_id、repo、base_commit、file_path、func_name、func_signature、func_docstring、func_docstring_raw、func_body、func_body_start_line、func_body_end_line、func_indent、orig_func、orig_context、eval_script、coverage_rate、coverage_report、sandbox_ast_check、repost_idx和repost_repo_name。这些字段提供了关于代码函数的元数据和度量指标。数据集包含一个train分割,共有7,415个示例,总大小为54,653,030字节。
The dataset contains information related to code functions, possibly from software repositories. It includes various fields such as instance_id, repo, base_commit, file_path, func_name, func_signature, func_docstring, func_docstring_raw, func_body, func_body_start_line, func_body_end_line, func_indent, orig_func, orig_context, eval_script, coverage_rate, coverage_report, sandbox_ast_check, repost_idx, and repost_repo_name. These fields provide metadata and metrics about the code functions. The dataset has a train split with 7,415 examples and a total size of 54,653,030 bytes.
提供机构:
synthetic-code-training



