ks-pf/JMTEB-fixed
收藏Hugging Face2025-06-09 更新2025-10-25 收录
下载链接:
https://hf-mirror.com/datasets/ks-pf/JMTEB-fixed
下载链接
链接失效反馈官方服务:
资源简介:
Japanese Massive Text Embedding Benchmark是一个包含多种配置的日语文本嵌入数据集,适用于文本分类、问答、零样本分类和句子相似度等任务。每个配置都定义了特征、数据分片和文件路径。该数据集使用CC BY-SA 4.0许可证,属于大型数据集,大小在100M到1B之间。
Japanese Massive Text Embedding Benchmark is a Japanese text embedding dataset with various configurations suitable for tasks such as text classification, question answering, zero-shot classification, and sentence similarity. Each configuration specifies features, data splits, and file paths. The dataset is licensed under CC BY-SA 4.0 and falls under the large dataset category with size ranging from 100M to 1B.
提供机构:
ks-pf



