asahi417/seamless-align-enA-frA.speaker-embedding.metavoice
收藏Hugging Face2024-06-22 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/seamless-align-enA-frA.speaker-embedding.metavoice
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个子集(例如subset_1到subset_137),每个子集包含行号、英语和法语音频的ID、两种语言的LASER评分以及英语和法语音频的说话者嵌入特征。数据集被划分为训练集,每个子集都有指定的字节数和示例数。每个子集的下载大小和数据集大小也提供了详细信息。
The dataset contains multiple subsets (e.g., subset_1 to subset_137), each with features such as line numbers, IDs for English and French audio, LASER scores for both languages, and speaker embeddings for both English and French audio. The dataset is split into training sets, with each subset having a specified number of bytes and examples. The download and dataset sizes are also provided for each subset.
提供机构:
asahi417
原始信息汇总
数据集概述
子集信息
| 子集名称 | 特征数量 | 主要特征 | 数据类型 |
|---|---|---|---|
| subset_1 | 8 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_10 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_100 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_101 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_102 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_103 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_104 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_105 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_11 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_12 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_13 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_14 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_15 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_16 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_17 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_18 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_19 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_2 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_20 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_21 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_22 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_23 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_24 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_25 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_26 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_27 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_28 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_29 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_3 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_30 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_300 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_301 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_302 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_303 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_304 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_305 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_306 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_307 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_308 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_309 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_310 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
| subset_311 | 7 | line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding | int64, string, float64, float32 |
数据集大小
| 子集名称 | 训练集大小(字节) | 训练集样本数 | 下载大小(字节) |
|---|---|---|---|
| subset_1 | 4992026 | 2344 | 5597556 |
| subset_10 | 4974906 | 2336 | 5581082 |
| subset_100 | 4953587 | 2326 | 5430238 |
| subset_101 | 4972772 | 2335 | 5438387 |
| subset_102 | 4911033 | 2306 | 5411902 |
| subset_103 | 4972834 | 2335 | 5481119 |
| subset_104 | 4962121 | 2330 | 5428093 |
| subset_105 | 4964328 | 2331 | 5451763 |
| subset_11 | 4930221 | 2315 | 5539349 |
| subset_12 | 5002632 | 2349 | 5621598 |
| subset_13 | 4985513 | 2341 | 5586408 |
| subset_14 | 4979157 | 2338 | 5601499 |
| subset_15 | 5032397 | 2363 | 5623533 |
| subset_16 | 5000461 | 2348 | 5593455 |
| subset_17 | 4957852 | 2328 | 5549825 |
| subset_18 | 5004736 | 2350 | 5584282 |
| subset_19 | 5000474 | 2348 | 5584734 |
| subset_2 | 5034570 | 2364 | 5633409 |
| subset_20 | 4996211 | 2346 | 5560557 |
| subset_21 | 4972789 | 2335 | 5575159 |
| subset_22 | 4964226 | 2331 | 5564136 |
| subset_23 | 4976994 | 2337 | 5524534 |
| subset_24 | 4968496 | 2333 | 5575774 |
| subset_25 | 4938687 | 2319 | 5526935 |
| subset_26 | 4979179 | 2338 | 5572997 |
| subset_27 | 4938698 | 2319 | 5516623 |
| subset_28 | 4970619 | 2334 | 5543926 |
| subset_29 | 4646911 | 2182 | 5185753 |
| subset_3 | 5013275 | 2354 | 5623690 |
| subset_30 | 4363662 | 2049 | 4872857 |
| subset_300 | 4497853 | 2112 | 4835593 |



