five

asahi417/seamless-align-enA-frA.speaker-embedding.metavoice

收藏
Hugging Face2024-06-22 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/seamless-align-enA-frA.speaker-embedding.metavoice
下载链接
链接失效反馈
官方服务:
资源简介:
该数据集包含多个子集(例如subset_1到subset_137),每个子集包含行号、英语和法语音频的ID、两种语言的LASER评分以及英语和法语音频的说话者嵌入特征。数据集被划分为训练集,每个子集都有指定的字节数和示例数。每个子集的下载大小和数据集大小也提供了详细信息。

The dataset contains multiple subsets (e.g., subset_1 to subset_137), each with features such as line numbers, IDs for English and French audio, LASER scores for both languages, and speaker embeddings for both English and French audio. The dataset is split into training sets, with each subset having a specified number of bytes and examples. The download and dataset sizes are also provided for each subset.
提供机构:
asahi417
原始信息汇总

数据集概述

子集信息

子集名称 特征数量 主要特征 数据类型
subset_1 8 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_10 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_100 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_101 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_102 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_103 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_104 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_105 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_11 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_12 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_13 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_14 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_15 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_16 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_17 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_18 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_19 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_2 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_20 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_21 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_22 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_23 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_24 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_25 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_26 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_27 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_28 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_29 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_3 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_30 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_300 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_301 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_302 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_303 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_304 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_305 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_306 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_307 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_308 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_309 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, enA.audio.speaker_embedding, frA.audio.speaker_embedding int64, string, float64, float32
subset_310 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32
subset_311 7 line_no, enA.id, enA.laser_score, frA.id, frA.laser_score, frA.audio.speaker_embedding, enA.audio.speaker_embedding int64, string, float64, float32

数据集大小

子集名称 训练集大小(字节) 训练集样本数 下载大小(字节)
subset_1 4992026 2344 5597556
subset_10 4974906 2336 5581082
subset_100 4953587 2326 5430238
subset_101 4972772 2335 5438387
subset_102 4911033 2306 5411902
subset_103 4972834 2335 5481119
subset_104 4962121 2330 5428093
subset_105 4964328 2331 5451763
subset_11 4930221 2315 5539349
subset_12 5002632 2349 5621598
subset_13 4985513 2341 5586408
subset_14 4979157 2338 5601499
subset_15 5032397 2363 5623533
subset_16 5000461 2348 5593455
subset_17 4957852 2328 5549825
subset_18 5004736 2350 5584282
subset_19 5000474 2348 5584734
subset_2 5034570 2364 5633409
subset_20 4996211 2346 5560557
subset_21 4972789 2335 5575159
subset_22 4964226 2331 5564136
subset_23 4976994 2337 5524534
subset_24 4968496 2333 5575774
subset_25 4938687 2319 5526935
subset_26 4979179 2338 5572997
subset_27 4938698 2319 5516623
subset_28 4970619 2334 5543926
subset_29 4646911 2182 5185753
subset_3 5013275 2354 5623690
subset_30 4363662 2049 4872857
subset_300 4497853 2112 4835593
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作