five

coml/ABX-accent

收藏
Hugging Face2025-10-29 更新2026-01-03 收录
下载链接:
https://hf-mirror.com/datasets/coml/ABX-accent
下载链接
链接失效反馈
官方服务:
资源简介:
--- license: mit --- ABX-accent ----------- The ABX-accent project is based on the preparation and evaluation of the Accented English Speech Recognition Challenge (AESRC) dataset [1], using fastABX [2] for evaluation. This repository provides all the items files you can use for evaluation. What is ABX Evaluation? ----------------------- The ABX metric evaluates whether a representation X of a speech unit (e.g., the triphone “bap”) is closer to a same-category example A (also “bap”) than to a different-category example B (e.g., “bop”). The ABX error rate is calculated by averaging the discrimination errors over all minimal triphone pairs (ie., differing only by the central phoneme) in the corpus. This benchmark focuses on the more challenging ABX across speaker task, where the X example is spoken by a different speaker than the ones in pair (A, B), testing speaker-invariant phonetic discrimination. This benchmark focuses on the more challenging ABX across speaker task, where the X example is spoken by a different speaker than the ones in pair (A, B), testing speaker-invariant phonetic discrimination. About the Dataset. ----------------------- The **[Accented English Speech Recognition Challenge](https://arxiv.org/abs/2102.10233)** dataset includes recordings from ten different regional accents: American, British, Canadian, Chinese, Indian, Japanese, Korean, Portuguese, Spanish, Russian. For academic research only. You can apply this dataset following the instructions on this page: https://www.nexdata.ai/company/sponsored-datasets. Getting Started ------------------- To begin working with the AESRC development data and run evaluations, you will find the following resources in the [GitHub repository](https://github.com/bootphon/ABX-accent). The benchmark is part of the first task of the ZeroSpeech Benchmark on https://zerospeech.com. References ----------- - [1] Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, and Lei Xie, “The accented english speech recognition challenge 2020: open datasets, tracks, baselines, results and methods,” in ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).IEEE, 2021, pp. 6918–6922. - [2] Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux "fastabx: A library for efficient computation of ABX discriminability" arXiv:2505.02692v1 [cs.CL] 5 May 2025.
提供机构:
coml
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作