KStack-clean
收藏arXiv2025-09-30 收录
下载链接:
https://huggingface.co/datasets/JetBrains/KStack-clean
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是从KStack精选出的一组高质量Kotlin代码样本,这些样本是通过对二元分类器获得的品质分数进行筛选而选出的。此外,该数据集使用了二元分类器从KStack中衍生而来,旨在识别高质量的示例。该数据集包含了25,000个高质量的样本,旨在用于Kotlin语言建模和代码生成任务。
This dataset comprises a curated collection of high-quality Kotlin code samples sourced from KStack, filtered based on quality scores generated by a binary classifier. Moreover, this dataset is derived from KStack using a binary classifier developed to recognize high-quality code examples. It contains 25,000 such high-quality samples, tailored for Kotlin language modeling and code generation tasks.
提供机构:
JetBrains



