Reverb/arabic-eou-conversations
收藏Hugging Face2025-12-11 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/Reverb/arabic-eou-conversations
下载链接
链接失效反馈官方服务:
资源简介:
该数据集是一个用于检测话语结束(End-of-Utterance, EOU)的多方言阿拉伯语对话数据集。包含11,660个平衡的对话示例(50% EOU,50% NOT_EOU),覆盖了三种主要阿拉伯方言:Levantine(شامي)、Egyptian(مصري)和Gulf(خليجي)。数据以对话对的形式呈现,带有EOU标签,并按80%训练、10%验证、10%测试的比例划分。每个示例包含text(阿拉伯语文本)、label(二进制标签,0表示NOT_EOU,1表示EOU)和可选的context(上下文信息)。数据集收集自三种阿拉伯方言的自然对话数据,适用于EOU检测模型训练、阿拉伯语对话研究、语音助手开发和对话AI系统等用途。
This dataset contains Arabic conversational examples labeled for end-of-utterance detection. It includes natural dialogue patterns across three major Arabic dialects. Key Features: Total Examples: 11,660 (balanced 50/50 EOU/NOT_EOU); Dialects: Levantine (شامي), Egyptian (مصري), Gulf (خليجي); Format: Conversation pairs with EOU labels; Split: 80% train, 10% validation, 10% test. Each example contains: text: The utterance text in Arabic; label: Binary label (0 = NOT_EOU, 1 = EOU); context: Optional previous utterance for context.
提供机构:
Reverb



