1-800-SHARED-TASKS/LID201_Devanagari_Script_Languages_Identification

Name: 1-800-SHARED-TASKS/LID201_Devanagari_Script_Languages_Identification
Creator: 1-800-SHARED-TASKS
Published: 2024-09-19 11:02:14
License: 暂无描述

Hugging Face2024-09-19 更新2024-12-14 收录

下载链接：

https://hf-mirror.com/datasets/1-800-SHARED-TASKS/LID201_Devanagari_Script_Languages_Identification

下载链接

链接失效反馈

官方服务：

资源简介：

该数据集包含文本(text)和标签(label)两种类型的字段，均为字符串格式。它分为训练集(train)，共有约303万条样本，数据大小为1002760559字节。尽管README没有提供详细的数据集描述，但从文件结构和大小来看，这可能是一个用于文本分类或标签预测的大型数据集。

The dataset consists of text and label fields, both in string format. It includes a training set (train) with approximately 3.03 million samples, totaling 1002760559 bytes in size. Although the README does not provide a detailed description of the dataset, the file structure and size suggest that this might be a large dataset for text classification or label prediction tasks.

提供机构：

1-800-SHARED-TASKS

5,000+

优质数据集

54 个

任务类型

进入经典数据集