tsch00001/news-eu-small
收藏Hugging Face2025-01-30 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/tsch00001/news-eu-small
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含了文本内容及其对应的分词序列和单词数量。数据集被划分为训练集和测试集,其中训练集包含了494071个示例,测试集包含了5182个示例。整个数据集的大小约为1.5GB。
The dataset includes text content along with its corresponding tokenized sequence and word count. It is split into a training set and a test set, with the training set containing 494071 examples and the test set containing 5182 examples. The entire dataset is approximately 1.5GB in size.
提供机构:
tsch00001



