Speech Accent Archive

Name: Speech Accent Archive
Creator: 阿里云天池
Published: 2026-05-14 08:41:49
License: 暂无描述

阿里云天池2026-05-14 更新2024-03-07 收录

下载链接：

https://tianchi.aliyun.com/dataset/144729

下载链接

链接失效反馈

官方服务：

资源简介：

This dataset allows you to compare the demographic and linguistic backgrounds of the speakers in order to determine which variables are key predictors of each accent. The speech accent archive demonstrates that accents are systematic rather than merely mistaken speech. This dataset contains 2140 speech samples, each from a different talker reading the same reading passage. Talkers come from 177 countries and have 214 different native languages. Each talker is speaking in English.

本数据集可用于对比分析不同说话人的人口统计特征与语言背景，进而确定可作为各类口音关键预测因子的变量。该语音口音档案库（Speech Accent Archive）的研究表明，口音是具备系统性的语言现象，而非单纯的语音失误。本数据集共包含2140条语音样本，每条样本均来自一位独立的说话人，且所有说话人均朗读了同一段指定文本。这些说话人来自177个国家，母语涵盖214种不同语言，且均以英语完成朗读。

提供机构：

阿里云天池

创建时间：

2023-01-18

搜集汇总

数据集介绍