rollerhafeezh-amikom/language-detection

Name: rollerhafeezh-amikom/language-detection
Creator: rollerhafeezh-amikom
Published: 2024-06-28 07:29:21
License: 暂无描述

Hugging Face2024-06-28 更新2024-06-29 收录

下载链接：

https://hf-mirror.com/datasets/rollerhafeezh-amikom/language-detection

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集名为language-detection，用于语言检测任务。数据集包含训练、验证和测试三个分割，每个分割都有对应的数据文件路径和大小。数据集的特征包括文本（text）和标签（label），其中标签为类别标签，分别代表不同的语言（id, en, es, it, sk）。数据集的总下载大小为152799字节，总大小为274202.94858444267字节。

The dataset is named language-detection and is used for language detection tasks. It includes three splits: train, validation, and test, each with corresponding data file paths and sizes. The features of the dataset include text and label, where the label is a class label representing different languages (id, en, es, it, sk). The total download size of the dataset is 152799 bytes, and the total size is 274202.94858444267 bytes.

提供机构：

rollerhafeezh-amikom

原始信息汇总

数据集概述

数据集名称

名称: language-detection

数据集配置

配置名称: default

数据文件

训练集:
- 路径: data/train-*
- 样本数量: 2185
- 字节数: 187417.00863116438
验证集:
- 路径: data/validation-*
- 样本数量: 502
- 字节数: 43075.22174819871
测试集:
- 路径: data/test-*
- 样本数量: 511
- 字节数: 43710.718205079604

数据集特征

特征名称: text
- 数据类型: string
特征名称: label
- 数据类型: class_label
- 标签名称:
  - 0: id
  - 1: en
  - 2: es
  - 3: it
  - 4: sk

数据集大小

下载大小: 152799
数据集大小: 274202.94858444267

5,000+

优质数据集

54 个

任务类型

进入经典数据集