mteb/SwednClustering
收藏Hugging Face2025-10-07 更新2025-10-18 收录
下载链接:
https://hf-mirror.com/datasets/mteb/SwednClustering
下载链接
链接失效反馈官方服务:
资源简介:
SwednClustering数据集是基于瑞典报纸Dagens Nyheter的1963,576篇新闻文章构建的,时间跨度为2000年至2020年。这个数据集使用了文章的类别标签作为聚类,适用于文本分类任务。数据集包含句子和对应的标签,是一个单语言(瑞典语)的数据集。
The SwednClustering dataset is built from 1,963,576 news articles from the Swedish newspaper Dagens Nyheter, covering the period from 2000 to 2020. This dataset uses the category labels of the articles as clusters and is suitable for text classification tasks. It includes sentences and corresponding labels, and is a monolingual (Swedish) dataset.
提供机构:
mteb



