five

mohamed2811/MuffakirTriplets

收藏
Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/mohamed2811/MuffakirTriplets
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集由我们创建和整理,用于训练和微调阿拉伯语嵌入模型。它采用三元组结构,支持对比学习和语义相似性任务,帮助模型学习阿拉伯语文本的有意义表示。该数据集已用于微调特定的嵌入模型,适用于语义搜索、信息检索、检索增强生成(RAG)系统、问答系统以及阿拉伯语NLP的研究和基准测试。数据集语言为阿拉伯语,由Mohamed Khaled创建和维护。

This dataset was created and curated by us for training and fine-tuning Arabic embedding models. It follows a triplet-style structure to support contrastive learning and semantic similarity tasks, helping models learn meaningful representations of Arabic text. The dataset has been used to fine-tune specific embedding models and is suitable for semantic search, information retrieval, Retrieval-Augmented Generation (RAG) systems, question answering systems, and research and benchmarking in Arabic NLP. The dataset is in Arabic and was created and maintained by Mohamed Khaled.
提供机构:
mohamed2811
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作