yiyic/Arab_train_lang_id
收藏Hugging Face2024-07-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/yiyic/Arab_train_lang_id
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要特征:text和lang,分别表示文本内容和语言类型。数据集仅包含一个训练分割(train),该分割包含1,999,000个示例,总大小为10,979,217,409字节。数据集的下载大小为5,312,871,006字节。数据文件路径为data/train-*。
The dataset contains two main features: text and lang, representing text content and language type, respectively. The dataset includes only one training split (train), which contains 1,999,000 examples with a total size of 10,979,217,409 bytes. The download size of the dataset is 5,312,871,006 bytes. The data file path is data/train-*.
提供机构:
yiyic



