alexandrlukashov/gliclass_resampled_multilang
收藏Hugging Face2025-07-10 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/alexandrlukashov/gliclass_resampled_multilang
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本(text)和与之相关的标签(all_labels和true_labels),以及语言标识(lang)。训练集包含约1199020个样本,数据集整体大小为4225270235字节。数据集适用于文本分类任务,其中可能包含多个标签,以及每个样本的实际标签。语言标识可能表明数据集中的文本语言多样性。
The dataset includes text (text) and associated labels (all_labels and true_labels), as well as language identification (lang). The training set contains approximately 1,199,020 samples, and the total size of the dataset is 4,225,270,235 bytes. The dataset is suitable for text classification tasks, potentially containing multiple labels and the actual label for each sample. The language identification may indicate the diversity of text languages in the dataset.
提供机构:
alexandrlukashov



