yiyic/kaz_Cyrl_pan_Guru_train
收藏Hugging Face2024-07-23 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/yiyic/kaz_Cyrl_pan_Guru_train
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本内容和对应的语言类型两个主要特征。数据集分为一个训练集,包含1,291,974个样本,总大小为8,722,280,393字节。下载大小为3,648,208,741字节。数据集的配置文件中指定了默认配置,数据文件路径为data/train-*。
The dataset contains two main features: text and lang, representing the text content and language type, respectively. The dataset is divided into one training set (train) containing 1,291,974 samples, with a total size of 8,722,280,393 bytes. The download size is 3,648,208,741 bytes. The configuration file of the dataset specifies the default configuration, with the data file path as data/train-*.
提供机构:
yiyic



