SPIDER

Name: SPIDER
Creator: OpenDataLab
Published: 2024-03-25 05:40:32
License: 暂无描述

OpenCSG2024-03-25 更新2026-01-19 收录

下载链接：

https://opencsg.com/datasets/OpenDataLab/SPIDER?tab=summary

下载链接

链接失效反馈

官方服务：

资源简介：

Spider 是由 11 名耶鲁大学学生注释的大规模复杂跨域语义解析和文本到 SQL 数据集。 Spider 挑战的目标是开发跨域数据库的自然语言接口。它由 10,181 个问题和 5,693 个独特的复杂 SQL 查询组成，涉及 200 个数据库，多个表涵盖 138 个不同的域。在 Spider 1.0 中，不同的复杂 SQL 查询和数据库出现在训练和测试集中。要想在这方面做得好，系统必须不仅能很好地泛化到新的 SQL 查询，还必须能很好地泛化到新的数据库模式。

Spider is a large-scale complex cross-domain semantic parsing and text-to-SQL dataset annotated by 11 Yale University students. The objective of the Spider challenge is to develop natural language interfaces for cross-domain databases. It comprises 10,181 questions and 5,693 unique complex SQL queries, covering 200 databases with multiple tables spanning 138 distinct domains. In Spider 1.0, distinct complex SQL queries and databases appear in both the training and test splits. To perform well on this task, systems must not only generalize effectively to unseen SQL queries but also generalize robustly to novel database schemas.

提供机构：

OpenDataLab

创建时间：

2024-03-25

5,000+

优质数据集

54 个

任务类型

进入经典数据集