1-800-SHARED-TASKS/LID201_Devanagari_Script_Languages_Identification
收藏Hugging Face2024-09-19 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/1-800-SHARED-TASKS/LID201_Devanagari_Script_Languages_Identification
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本(text)和标签(label)两种类型的字段,均为字符串格式。它分为训练集(train),共有约303万条样本,数据大小为1002760559字节。尽管README没有提供详细的数据集描述,但从文件结构和大小来看,这可能是一个用于文本分类或标签预测的大型数据集。
The dataset consists of text and label fields, both in string format. It includes a training set (train) with approximately 3.03 million samples, totaling 1002760559 bytes in size. Although the README does not provide a detailed description of the dataset, the file structure and size suggest that this might be a large dataset for text classification or label prediction tasks.
提供机构:
1-800-SHARED-TASKS



