yiyic/Atlatic_train_lang_script_id
收藏Hugging Face2024-07-22 更新2024-12-14 收录
下载链接:
https://hf-mirror.com/datasets/yiyic/Atlatic_train_lang_script_id
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含文本内容和语言类型两个主要特征,分别命名为text和lang。数据集被分割为训练集(train),包含4997500个样本,总大小为20291440169字节。数据集的下载大小为11451552362字节。配置信息中,配置名为default,数据文件路径为data/train-*。
The dataset includes two main features: text content and language type, named text and lang respectively. The dataset is split into a training set (train), containing 4,997,500 samples with a total size of 20,291,440,169 bytes. The download size of the dataset is 11,451,552,362 bytes. In the configuration information, the configuration name is default, and the data file path is data/train-*.
提供机构:
yiyic



