LeetCode-Hard

arXiv2025-09-30 收录

下载链接：

https://github.com/gammatauai/leetcode-hard-gym

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集用于评估大型语言模型在生成高质量测试用例方面的性能。在此基础上，采用GPT-4的TestChain方法，相较于基准方法，在准确度上提高了13.84%。该任务的目的是生成测试用例。

This dataset is used to evaluate the performance of large language models (LLMs) in generating high-quality test cases. The GPT-4-powered TestChain method achieves a 13.84% improvement in accuracy compared to the baseline method when evaluated on this dataset. The objective of this task is to generate test cases.

5,000+

优质数据集

54 个

任务类型

进入经典数据集