KrzTyb/fim-dataset-512
收藏Hugging Face2025-10-12 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/KrzTyb/fim-dataset-512
下载链接
链接失效反馈官方服务:
资源简介:
fim-dataset-512是一个用于代码自动完成训练的Fill-in-the-Middle (FIM) 数据集。该数据集包含了带有FIM特殊标记的代码片段,包括前缀代码(<fim_prefix>)、后缀代码(<fim_suffix>)和需要完成的代码(<fim_middle>)。数据集分为训练集和验证集,分别包含33,140和1,744个示例。
fim-dataset-512 is a Fill-in-the-Middle (FIM) dataset for code autocompletion training. The dataset contains code snippets with FIM special tokens, including prefix code (<fim_prefix>), suffix code (<fim_suffix>) and the code to be completed (<fim_middle>). The dataset is split into a training set and a validation set, containing 33,140 and 1,744 examples respectively.
提供机构:
KrzTyb



