LocalDoc/triplet_dataset_azerbaijani
收藏Hugging Face2025-05-09 更新2025-07-05 收录
下载链接:
https://hf-mirror.com/datasets/LocalDoc/triplet_dataset_azerbaijani
下载链接
链接失效反馈官方服务:
资源简介:
这个数据集包含了598,000个阿塞拜疆语的triplet,每个triplet由一个基准文本、一个语义上相似的文本和一个语义上不同的文本组成。数据集适用于阿塞拜疆语的语义相似度模型训练、信息检索系统开发、语言模型微调和低资源语言研究。
This dataset contains 598,000 triplets in Azerbaijani, each consisting of an anchor text, a text similar in semantics to the anchor, and a text different in semantics from the anchor. The dataset is suitable for training semantic similarity models, developing information retrieval systems, fine-tuning language models, and research on low-resource languages in Azerbaijani.
提供机构:
LocalDoc



