mispeech/speechocean762
收藏数据集概述
基本信息
- 语言: 英语
- 许可证: Apache 2.0
- 数据规模: 1K<n<10K
- 任务类别: 自动语音识别
- 名称: speechocean762
- 标签: 发音评分
配置
- 配置名称: default
- 数据文件:
- 训练集: data/train-*
- 测试集: data/test-*
- 数据文件:
数据集信息
-
特征:
- accuracy: int64
- completeness: float64
- fluency: int64
- prosodic: int64
- text: string
- total: int64
- words:
- accuracy: int64
- phones: 字符序列
- phones-accuracy: 浮点数序列
- stress: int64
- text: string
- total: int64
- mispronunciations:
- canonical-phone: string
- index: int64
- pronounced-phone: string
- speaker: string
- gender: string
- age: int64
- audio: audio
-
分割:
- 训练集:
- 字节数: 291617098
- 样本数: 2500
- 测试集:
- 字节数: 289610485
- 样本数: 2500
- 训练集:
-
下载大小: 611820406
-
数据集大小: 581227583
评分标准
-
句子级别:
- 准确性: 0-10
- 完整性: 0.0-1.0
- 流利度: 0-10
- 韵律: 0-10
-
单词级别:
- 准确性: 0-10
- 重音: {5, 10}
-
音素级别:
- 准确性: 0-2
示例
python
from datasets import load_dataset
test_set = load_dataset("mispeech/speechocean762", split="test")
len(test_set) 2500
next(iter(test_set)) {accuracy: 9, completeness: 10.0, fluency: 9, prosodic: 9, text: MARK IS GOING TO SEE ELEPHANT, total: 9, words: [{accuracy: 10, phones: [M, AA0, R, K], phones-accuracy: [2.0, 2.0, 1.8, 2.0], stress: 10, text: MARK, total: 10, mispronunciations: []}, {accuracy: 10, phones: [IH0, Z], phones-accuracy: [2.0, 1.8], stress: 10, text: IS, total: 10, mispronunciations: []}, {accuracy: 10, phones: [G, OW0, IH0, NG], phones-accuracy: [2.0, 2.0, 2.0, 2.0], stress: 10, text: GOING, total: 10, mispronunciations: []}, {accuracy: 10, phones: [T, UW0], phones-accuracy: [2.0, 2.0], stress: 10, text: TO, total: 10, mispronunciations: []}, {accuracy: 10, phones: [S, IY0], phones-accuracy: [2.0, 2.0], stress: 10, text: SEE, total: 10, mispronunciations: []}, {accuracy: 10, phones: [EH1, L, IH0, F, AH0, N, T], phones-accuracy: [2.0, 2.0, 2.0, 2.0, 2.0, 2.0, 2.0], stress: 10, text: ELEPHANT, total: 10, mispronunciations: []}], speaker: 0003, gender: m, age: 6, audio: {path: 000030012.wav, array: array([-0.00119019, -0.00500488, -0.00283813, ..., 0.00274658,
0. , 0.00125122]),
sampling_rate: 16000}}




