HussainKAUST/saudi-data-eou.jsonl
收藏Hugging Face2025-12-13 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/HussainKAUST/saudi-data-eou.jsonl
下载链接
链接失效反馈官方服务:
资源简介:
该数据集专为阿拉伯语会话AI中的话语结束(EOU)检测设计,重点关注沙特阿拉伯方言(ar-SA)。数据集用于二元分类任务:0表示不完整话语(说话者可能继续),1表示完整话语(话轮结束)。每条数据为JSON格式,包含text(阿拉伯语会话话语)和label(0或1)字段。
This dataset is designed for End-of-Utterance (EOU) detection in Arabic conversational AI, with a focus on the Saudi dialect (ar-SA). Its used for binary classification: 0 → Incomplete utterance (speaker likely to continue), 1 → Complete utterance (end of turn). Each entry is a JSON object with text (Arabic conversational utterance) and label (0 or 1) fields.
提供机构:
HussainKAUST



