Jakh0103/glotlid_processed
收藏Hugging Face2024-11-10 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/Jakh0103/glotlid_processed
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个特征,如标签(label)、文档(doc)、哈希ID(hashids)和标签ID(label_id)。数据集被分为多个分割,包括测试集(test)、评估集(eval)和多个训练集分片(train_shard_*)。每个分割都有详细的字节大小和示例数量。
The dataset contains multiple features such as label, doc, hashids, and label_id. The dataset is divided into several splits including test, eval, and multiple train shards (train_shard_*). Each split has detailed byte sizes and example counts.
提供机构:
Jakh0103



