JessicaOjo/lid-training-data
收藏Hugging Face2025-06-21 更新2025-08-30 收录
下载链接:
https://hf-mirror.com/datasets/JessicaOjo/lid-training-data
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个多语言文本数据集,包含language(语言)、sentence(句子)、data_lang(数据语言)和dataset(数据集名称)等字段。具体包含了来自不同来源(如afrisenti、bloom_stories等)的训练数据,存储为Parquet格式。
This dataset is a multilingual text dataset, including fields such as language, sentence, data language, and dataset name. It specifically contains training data from different sources (such as afrisenti, bloom_stories, etc.) stored in Parquet format.
提供机构:
JessicaOjo



