five

"Mybatis-Spider"

收藏
DataCite Commons2025-06-21 更新2026-05-03 收录
下载链接:
https://ieee-dataport.org/documents/mybatis-spider
下载链接
链接失效反馈
官方服务:
资源简介:
"The Mybatis-Spider dataset is a large-scale, high-quality resource designed for the training and evaluation of models for generating Java MyBatis Mapper XML files from natural language. It is derived from the well-known Spider text-to-SQL dataset through a systematic restructuring and optimization process to better align with real-world software development practices. The dataset addresses the task of generating executable MyBatis Mapper files based on a combination of natural language descriptions, database schemas, and query parameters. To enhance its practical relevance, samples from the Spider dataset with identical SQL logic were consolidated, and queries with similar structures were merged to reflect the use of parameterization in actual MyBatis development. The dataset includes 5,653 pairs of data, each containing a detailed natural language description, parameters, the target Mapper XML file, and the corresponding database context. All Mapper files were generated with the assistance of GPT-4o and have been manually verified to ensure their syntactic correctness and execution accuracy in a live database environment, making it a robust benchmark for code generation tasks in the Java ecosystem."
提供机构:
IEEE DataPort
创建时间:
2025-06-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作