fr3on/egyptian-dialogue
收藏Hugging Face2025-12-17 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/fr3on/egyptian-dialogue
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含4,322对埃及阿拉伯语-英语平行对话,具有自动领域分类功能。数据来源于电视剧字幕,包含自然对话形式的埃及阿拉伯方言(العامية المصرية)。源语言为埃及阿拉伯语(ar_EG)——一种口语方言,目标语言为英语(en)。埃及阿拉伯语是最广泛使用的阿拉伯语方言之一,拥有超过1亿使用者。该数据集提供自然对话、口语表达和习语、领域分类内容以及剧集上下文以支持叙事理解。
This dataset contains 4,322 parallel Egyptian Arabic-English dialogue pairs with automatic domain classification. The data is extracted from TV series subtitles and features natural conversational Egyptian Arabic dialect (العامية المصرية). The source language is Egyptian Arabic (ar_EG) - a colloquial dialect, and the target language is English (en). Egyptian Arabic is one of the most widely spoken Arabic dialects, used by over 100 million speakers. This dataset provides natural conversational dialogue, colloquial expressions and idioms, domain-classified content, and episode context for narrative understanding.
提供机构:
fr3on



