yiyic/Cyrl_train_lang_id
收藏Hugging Face2024-07-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/yiyic/Cyrl_train_lang_id
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含两个主要特征:text和lang,分别代表文本内容和语言。数据集仅包含一个训练集,共有1,999,000个示例,总大小为12,540,627,088字节。数据集的下载大小为5,824,792,349字节。配置信息显示数据文件位于data/train-*路径下。
The dataset includes two main features: text and lang, representing text content and language respectively. The dataset contains only a training set with 1,999,000 examples, totaling 12,540,627,088 bytes in size. The download size of the dataset is 5,824,792,349 bytes. Configuration information indicates that the data files are located at the data/train-* path.
提供机构:
yiyic



