samadpls/querypls-prompt2sql-dataset
收藏Hugging Face2023-11-26 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/samadpls/querypls-prompt2sql-dataset
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: context
dtype: string
- name: answer
dtype: string
- name: autotrain_text
dtype: string
splits:
- name: train
num_bytes: 17419604
num_examples: 78577
- name: validation
num_bytes: 17419604
num_examples: 78577
download_size: 13675124
dataset_size: 34839208
configs:
- config_name: default
data_files:
- split: train
path: data/train-*
- split: validation
path: data/validation-*
license: apache-2.0
task_categories:
- text-classification
language:
- en
---
# 📚🤖 Querypls-prompt2sql
## Dataset Information
The Querypls-prompt2sql dataset is designed for text classification tasks related to generating SQL queries. It contains the following features:
- **Context:** String
- **Answer:** String
- **Autotrain Text:** String
The dataset is split into two parts:
- **Training Set:**
- Number of Examples: 78,577
- Size: 17,419,604 bytes
- **Validation Set:**
- Number of Examples: 78,577
- Size: 17,419,604 bytes
The total download size of the dataset is 13,675,124 bytes, and the dataset size is 34,839,208 bytes.
## Dataset Configuration
The default configuration includes the following data files:
- **Training Split:**
- Path: data/train-*
- **Validation Split:**
- Path: data/validation-*
The dataset is licensed under Apache-2.0.
## Task Categories
- Text Classification
## Language
- English
## How to Contribute
For information on contributing to the dataset cards, please refer to the [Hugging Face Datasets Contribution Guidelines](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards).
提供机构:
samadpls
原始信息汇总
Querypls-prompt2sql 数据集概述
数据集信息
Querypls-prompt2sql 数据集旨在用于与生成 SQL 查询相关的文本分类任务。它包含以下特征:
- Context: 字符串类型
- Answer: 字符串类型
- Autotrain Text: 字符串类型
数据集分为两部分:
-
训练集:
- 样本数量: 78,577
- 大小: 17,419,604 字节
-
验证集:
- 样本数量: 78,577
- 大小: 17,419,604 字节
数据集的总下载大小为 13,675,124 字节,数据集大小为 34,839,208 字节。
数据集配置
默认配置包括以下数据文件:
-
训练集分割:
- 路径: data/train-*
-
验证集分割:
- 路径: data/validation-*
数据集采用 Apache-2.0 许可证。
任务类别
- 文本分类
语言
- 英语



