BEE-spoke-data/google_wellformed_query-hf

Name: BEE-spoke-data/google_wellformed_query-hf
Creator: BEE-spoke-data
Published: 2025-12-29 04:40:10
License: 暂无描述

Hugging Face2025-12-29 更新2024-12-21 收录

下载链接：

https://hf-mirror.com/datasets/BEE-spoke-data/google_wellformed_query-hf

下载链接

链接失效反馈

官方服务：

资源简介：

数据集名为google_wellformed_query-hf，是从原始数据集google_wellformed_query转换而来，格式为parquet，无需使用trust_remote_code。数据集包含三个特征：rating（评分，数据类型为float32）和content（内容，数据类型为string）。数据集分为三个部分：train（训练集，包含17500个样本，大小为857383字节）、test（测试集，包含3850个样本，大小为189499字节）和validation（验证集，包含3750个样本，大小为184106字节）。数据集的总下载大小为788972字节，总大小为1230988字节。数据集的配置名为default，数据文件路径分别为data/train-*、data/test-*和data/validation-*。数据集的许可证为cc-by-sa-4.0，任务类别为文本分类，语言为英语，标签为语法和回归。

The dataset contains two main features: rating (rating, data type float32) and content (content, data type string). The dataset is divided into training set, test set, and validation set, containing 17500, 3850, and 3750 samples respectively. The total download size of the dataset is 788972 bytes, and the total dataset size is 1230988 bytes. The configuration name of the dataset is default, and the data files are stored in the paths data/train-*, data/test-*, and data/validation-*. The dataset is licensed under cc-by-sa-4.0, with task categories including text classification, language in English, and tags including grammar and regression.

提供机构：

BEE-spoke-data

5,000+

优质数据集

54 个

任务类型

进入经典数据集