five

EMINES/summarized-darija-msa-wiki-data

收藏
Hugging Face2025-03-16 更新2025-04-12 收录
下载链接:
https://hf-mirror.com/datasets/EMINES/summarized-darija-msa-wiki-data
下载链接
链接失效反馈
官方服务:
资源简介:
MSA-Darija摘要是EMINES组织托管的一个数据集,包含约4800条摩洛哥方言和阿拉伯语文本及其阿拉伯语摘要,旨在为开发摘要模型提供基础。该数据集适用于阿拉伯语和摩洛哥方言的文本摘要模型开发、跨方言语言处理和低资源语言研究。

The MSA-Darija Summarization Dataset is an EMINES organization-hosted dataset containing approximately 4800 text segments in Moroccan and Arabic dialects along with their Arabic summaries, designed to serve as a foundation for developing summarization models. It is suitable for developing text summarization models for Arabic and Moroccan dialects, cross-dialect language processing, and low-resource language research.
提供机构:
EMINES
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作