five

issai/SpokenMQA_Kazakh

收藏
Hugging Face2026-04-29 更新2026-05-03 收录
下载链接:
https://hf-mirror.com/datasets/issai/SpokenMQA_Kazakh
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集是SpokenMQA基准测试的哈萨克语机器翻译版本,用于评估基于语音和音频语言模型的数学推理能力。数据集包含四个难度递增的子集:`short_digit`(短数字序列识别)、`long_digit`(长数字序列识别)、`single_step_reasoning`(需要单步算术操作的数学问题)和`multi_step_reasoning`(需要多步推理的数学问题)。原始基准测试的文本字段被机器翻译成哈萨克语,音频则使用MangiSozTTS从翻译后的文本重新合成。

This dataset is a machine-translated Kazakh adaptation of the SpokenMQA benchmark for evaluating spoken mathematical reasoning in speech-based and audio-language models. The benchmark covers four tracks of increasing difficulty: `short_digit` (recognition of short numerical sequences spoken aloud), `long_digit` (recognition of longer numerical sequences spoken aloud), `single_step_reasoning` (mathematical problems requiring a single arithmetic operation), and `multi_step_reasoning` (mathematical problems requiring multi-step reasoning over spoken input). The textual fields of the original benchmark were machine-translated into Kazakh, and the audio was re-synthesized in Kazakh from the translated transcripts using MangiSozTTS.
提供机构:
issai
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作