mteb/BlurbsClusteringP2P
收藏Hugging Face2025-05-06 更新2025-05-31 收录
下载链接:
https://hf-mirror.com/datasets/mteb/BlurbsClusteringP2P
下载链接
链接失效反馈官方服务:
资源简介:
BlurbsClusteringP2P数据集是一个德语(deu)单语数据集,用于文本分类任务。数据集包含句子和标签等特征,测试集包含28个示例。该数据集是MTEB(大规模文本嵌入基准)的一部分,用于将书籍标题和简介聚类到不同的流派。数据集用于评估在MTEB任务上的嵌入模型。README还包含数据集的统计数据和使用MTEB库评估模型的方法。
The BlurbsClusteringP2P dataset is a monolingual German (deu) dataset used for text classification tasks. It includes features such as sentences and labels, and has a test split with 28 examples. The dataset is part of the MTEB (Massive Text Embedding Benchmark) and is used for clustering book titles and blurbs into genres. The dataset is used for evaluating embedding models on the MTEB task. The README also includes dataset statistics and instructions on how to evaluate models on the dataset using the MTEB library.
提供机构:
mteb



