TigreGotico/yes-no-multilingual
收藏Hugging Face2026-04-24 更新2026-04-26 收录
下载链接:
https://hf-mirror.com/datasets/TigreGotico/yes-no-multilingual
下载链接
链接失效反馈官方服务:
资源简介:
一个包含8,600个对话语句的数据集,用于在43种语言中对是/否/模糊回答进行分类。每个样本都是一个人可能对是/否问题的自然语言回答。数据集涵盖三类标签:yes(肯定、同意或确认)、no(否定、拒绝或不同意)和None(真正模糊,没有上下文无法解决)。数据集包含多种语言的欧洲、亚洲和中东语言,每种语言有200个样本。数据生成过程遵循严格的协议,确保每个语言的语料都是地道和真实的,没有使用机器翻译。
A dataset of 8,600 conversational utterances for classifying yes/no/ambiguous responses across 43 languages. Each sample is a natural language utterance a person might say in response to a yes/no question. The dataset covers three classes: yes (affirmation, agreement, or confirmation), no (negation, refusal, or disagreement), and None (genuinely ambiguous — cannot be resolved without context). The dataset includes a wide range of European, Asian, and Middle Eastern languages, with 200 samples per language. The data was generated by a large language model (Claude) without machine translation, ensuring idiomatic authenticity.
提供机构:
TigreGotico



