aloobun/indic-subset-balanced-v2
收藏Hugging Face2025-03-28 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/aloobun/indic-subset-balanced-v2
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本内容和对应的语言标识,共有训练集一个部分,包含365344个样本,总大小为4065134865字节。数据集适用于文本分类、语言识别等自然语言处理任务。
The dataset includes text content and corresponding language labels, with a total of 365344 samples in the training set, with a size of 4065134865 bytes. It is suitable for natural language processing tasks such as text classification and language identification.
提供机构:
aloobun



