mohamed2811/MuffakirTriplets
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/mohamed2811/MuffakirTriplets
下载链接
链接失效反馈官方服务:
资源简介:
该数据集由我们创建和整理,用于训练和微调阿拉伯语嵌入模型。它采用三元组结构,支持对比学习和语义相似性任务,帮助模型学习阿拉伯语文本的有意义表示。该数据集已用于微调特定的嵌入模型,适用于语义搜索、信息检索、检索增强生成(RAG)系统、问答系统以及阿拉伯语NLP的研究和基准测试。数据集语言为阿拉伯语,由Mohamed Khaled创建和维护。
This dataset was created and curated by us for training and fine-tuning Arabic embedding models. It follows a triplet-style structure to support contrastive learning and semantic similarity tasks, helping models learn meaningful representations of Arabic text. The dataset has been used to fine-tune specific embedding models and is suitable for semantic search, information retrieval, Retrieval-Augmented Generation (RAG) systems, question answering systems, and research and benchmarking in Arabic NLP. The dataset is in Arabic and was created and maintained by Mohamed Khaled.
提供机构:
mohamed2811



