five

mteb/svq

收藏
Hugging Face2026-02-05 更新2026-02-07 收录
下载链接:
https://hf-mirror.com/datasets/mteb/svq
下载链接
链接失效反馈
官方服务:
资源简介:
Simple Voice Questions (SVQ) 是一个包含17种语言、26个地区的短音频问题的数据集,这些音频在多种音频条件下录制。数据收集过程中,说话者在四种不同环境下(安静环境、背景语音噪声、交通噪声、媒体噪声)使用自己的设备录制音频。查询文本来源于XTREME-UP的检索和问答基准数据集的验证和测试集。数据集包含700个独特的说话者,记录了说话者的性别(女性、男性、非二元、无回答)和年龄信息。数据集未预先分割为训练、验证或测试子集,而是作为一个完整的集合发布,用户需要自行设计分割策略。

Simple Voice Questions (SVQ) is a set of short audio questions recorded in 26 locales across 17 languages under multiple audio conditions. Speakers recorded the audio in four different environments (clean, background speech noise, traffic noise, media noise) using their own devices. The querys text comes from validation and test sets of the XTREME-UPs retrieval and question answering benchmark datasets. The dataset includes 700 unique speakers, with recorded speaker gender (female, male, non_binary, no_answer) and age information. The dataset is not pre-divided into training, validation, or testing subsets but is released as a complete collection, requiring users to design their own splitting strategies.
提供机构:
mteb
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作