hotchpotch/sentence_transformer_japanese
收藏Hugging Face2025-01-20 更新2025-02-15 收录
下载链接:
https://hf-mirror.com/datasets/hotchpotch/sentence_transformer_japanese
下载链接
链接失效反馈官方服务:
资源简介:
这是一个由多个子数据集组成的日本语数据集,每个子数据集都包含文本对或文本三元组,适用于对比学习任务。数据集通过SentenceTransformers转换得到,主要来源于hpprc/emb、hpprc/llmjp-kaken、hpprc/msmarco-ja、hpprc/mqa-ja和hpprc/llmjp-warp-html等数据集。
This is a Japanese dataset composed of multiple sub-datasets, each containing text pairs or text triplets suitable for contrastive learning tasks. The dataset is transformed using SentenceTransformers and primarily sourced from datasets such as hpprc/emb, hpprc/llmjp-kaken, hpprc/msmarco-ja, hpprc/mqa-ja, and hpprc/llmjp-warp-html.
提供机构:
hotchpotch



