birdsql/six-gym-sqlite
收藏Hugging Face2026-03-25 更新2026-03-29 收录
下载链接:
https://hf-mirror.com/datasets/birdsql/six-gym-sqlite
下载链接
链接失效反馈官方服务:
资源简介:
---
license: cc-by-sa-4.0
language:
- en
tags:
- text-to-sql
- database
- sqlite
---
## 📢 Update 2026-03-23
We release [BIRD-Critic-SQLite](https://huggingface.co/datasets/birdsql/bird-critic-1.0-sqlite), a dataset containing 500 high-quality user issues focused on real-world SQLite database applications. This dataset is the train split of BIRD-Critic-SQLite, comprising 5,000 data instances for model training and development. Along with the dataset, we also release three RL-trained models: [BIRD-Talon-14B](https://huggingface.co/birdsql/BIRD-Talon-14b), [BIRD-Talon-7B](https://huggingface.co/birdsql/BIRD-Talon-7b), and [BIRD-Zeno-7B](https://huggingface.co/birdsql/BIRD-Zeno-7b).
### 📋 Dataset Structure
Below is a description of the dataset fields and additional information about the structure:
- **instance_id**: Unique identifier for each task.
- **issue_sql**: The buggy SQL query written by the user.
- **dialect**: The SQL dialect (SQLite).
- **version**: The dialect version (3).
- **db_id**: The name of the database.
- **clean_up_sql**: SQL queries to run after the test cases to revert any changes made to the database.
- **test_cases**: Test case functions (Python code).
- **sol_sql**: Ground-truth solution SQL.
- **query**: The user query rewritten in the BIRD environment.
- **preprocess_sql**: SQL queries to run before executing the solution or prediction.
- **category**: The task category (Query, Management, or Personalization).
提供机构:
birdsql



