asahi417/seamless-align-enA-jaA.speaker-embedding.metavoice
收藏Hugging Face2024-06-01 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/seamless-align-enA-jaA.speaker-embedding.metavoice
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: subset_1
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4414641
num_examples: 2073
download_size: 4997658
dataset_size: 4414641
- config_name: subset_10
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4176233
num_examples: 1961
download_size: 4780568
dataset_size: 4176233
- config_name: subset_100
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3741763
num_examples: 1757
download_size: 4353207
dataset_size: 3741763
- config_name: subset_101
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3988818
num_examples: 1873
download_size: 4643706
dataset_size: 3988818
- config_name: subset_102
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3978184
num_examples: 1868
download_size: 4648714
dataset_size: 3978184
- config_name: subset_103
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4001574
num_examples: 1879
download_size: 4658618
dataset_size: 4001574
- config_name: subset_104
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4048413
num_examples: 1901
download_size: 4729608
dataset_size: 4048413
- config_name: subset_105
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3993076
num_examples: 1875
download_size: 4668731
dataset_size: 3993076
- config_name: subset_106
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4003718
num_examples: 1880
download_size: 4662189
dataset_size: 4003718
- config_name: subset_107
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3948365
num_examples: 1854
download_size: 4595782
dataset_size: 3948365
- config_name: subset_108
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3905728
num_examples: 1834
download_size: 4559863
dataset_size: 3905728
- config_name: subset_109
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3769470
num_examples: 1770
download_size: 4394031
dataset_size: 3769470
- config_name: subset_11
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3788623
num_examples: 1779
download_size: 4331531
dataset_size: 3788623
- config_name: subset_110
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4063355
num_examples: 1908
download_size: 4744550
dataset_size: 4063355
- config_name: subset_111
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3997321
num_examples: 1877
download_size: 4671267
dataset_size: 3997321
- config_name: subset_112
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4097438
num_examples: 1924
download_size: 4779454
dataset_size: 4097438
- config_name: subset_113
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4110163
num_examples: 1930
download_size: 4779350
dataset_size: 4110163
- config_name: subset_114
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4131496
num_examples: 1940
download_size: 4826183
dataset_size: 4131496
- config_name: subset_115
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4050591
num_examples: 1902
download_size: 4724188
dataset_size: 4050591
- config_name: subset_116
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4067630
num_examples: 1910
download_size: 4755490
dataset_size: 4067630
- config_name: subset_117
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4048444
num_examples: 1901
download_size: 4728690
dataset_size: 4048444
- config_name: subset_118
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4069725
num_examples: 1911
download_size: 4761998
dataset_size: 4069725
- config_name: subset_119
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3975953
num_examples: 1867
download_size: 4651589
dataset_size: 3975953
- config_name: subset_12
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4080379
num_examples: 1916
download_size: 4694635
dataset_size: 4080379
- config_name: subset_120
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3777961
num_examples: 1774
download_size: 4401239
dataset_size: 3777961
- config_name: subset_121
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4035732
num_examples: 1895
download_size: 4716373
dataset_size: 4035732
- config_name: subset_122
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3941971
num_examples: 1851
download_size: 4602567
dataset_size: 3941971
- config_name: subset_123
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4095279
num_examples: 1923
download_size: 4790695
dataset_size: 4095279
- config_name: subset_124
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4016507
num_examples: 1886
download_size: 4692804
dataset_size: 4016507
- config_name: subset_125
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4105935
num_examples: 1928
download_size: 4791353
dataset_size: 4105935
- config_name: subset_126
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4052679
num_examples: 1903
download_size: 4716569
dataset_size: 4052679
- config_name: subset_127
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4050563
num_examples: 1902
download_size: 4730611
dataset_size: 4050563
- config_name: subset_128
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4025025
num_examples: 1890
download_size: 4692176
dataset_size: 4025025
- config_name: subset_129
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3731131
num_examples: 1752
download_size: 4352773
dataset_size: 3731131
- config_name: subset_13
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3767337
num_examples: 1769
download_size: 4322880
dataset_size: 3767337
- config_name: subset_130
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3897203
num_examples: 1830
download_size: 4564863
dataset_size: 3897203
- config_name: subset_131
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4007978
num_examples: 1882
download_size: 4696605
dataset_size: 4007978
- config_name: subset_132
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4084648
num_examples: 1918
download_size: 4763518
dataset_size: 4084648
- config_name: subset_133
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4016485
num_examples: 1886
download_size: 4676652
dataset_size: 4016485
- config_name: subset_134
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4071885
num_examples: 1912
download_size: 4752866
dataset_size: 4071885
- config_name: subset_135
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4020745
num_examples: 1888
download_size: 4709130
dataset_size: 4020745
- config_name: subset_136
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3993065
num_examples: 1875
download_size: 4672129
dataset_size: 3993065
- config_name: subset_137
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3973924
num_examples: 1866
download_size: 4653207
dataset_size: 3973924
- config_name: subset_138
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3967529
num_examples: 1863
download_size: 4620592
dataset_size: 3967529
- config_name: subset_139
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3958984
num_examples: 1859
download_size: 4625708
dataset_size: 3958984
- config_name: subset_14
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3692769
num_examples: 1734
download_size: 4240607
dataset_size: 3692769
- config_name: subset_140
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3760940
num_examples: 1766
download_size: 4397910
dataset_size: 3760940
- config_name: subset_141
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3971737
num_examples: 1865
download_size: 4652018
dataset_size: 3971737
- config_name: subset_142
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4031442
num_examples: 1893
download_size: 4701220
dataset_size: 4031442
- config_name: subset_143
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4033571
num_examples: 1894
download_size: 4710910
dataset_size: 4033571
- config_name: subset_144
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 2941015
num_examples: 1381
download_size: 3416573
dataset_size: 2941015
- config_name: subset_15
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4076102
num_examples: 1914
download_size: 4692665
dataset_size: 4076102
- config_name: subset_16
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3965362
num_examples: 1862
download_size: 4540435
dataset_size: 3965362
- config_name: subset_17
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3993087
num_examples: 1875
download_size: 4605949
dataset_size: 3993087
- config_name: subset_18
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4125114
num_examples: 1937
download_size: 4764529
dataset_size: 4125114
- config_name: subset_19
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4082494
num_examples: 1917
download_size: 4678453
dataset_size: 4082494
- config_name: subset_2
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4107973
num_examples: 1929
download_size: 4680166
dataset_size: 4107973
- config_name: subset_20
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3997350
num_examples: 1877
download_size: 4605491
dataset_size: 3997350
- config_name: subset_21
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3750299
num_examples: 1761
download_size: 4333247
dataset_size: 3750299
- config_name: subset_22
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3939781
num_examples: 1850
download_size: 4531341
dataset_size: 3939781
- config_name: subset_23
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3812065
num_examples: 1790
download_size: 4380237
dataset_size: 3812065
- config_name: subset_24
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3743919
num_examples: 1758
download_size: 4317489
dataset_size: 3743919
- config_name: subset_25
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4042072
num_examples: 1898
download_size: 4689538
dataset_size: 4042072
- config_name: subset_26
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4137905
num_examples: 1943
download_size: 4770055
dataset_size: 4137905
- config_name: subset_27
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4052735
num_examples: 1903
download_size: 4673663
dataset_size: 4052735
- config_name: subset_28
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4071935
num_examples: 1912
download_size: 4700519
dataset_size: 4071935
- config_name: subset_29
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4142159
num_examples: 1945
download_size: 4764347
dataset_size: 4142159
- config_name: subset_3
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4044108
num_examples: 1899
download_size: 4594191
dataset_size: 4044108
- config_name: subset_30
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4050560
num_examples: 1902
download_size: 4684849
dataset_size: 4050560
- config_name: subset_31
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3844042
num_examples: 1805
download_size: 4455029
dataset_size: 3844042
- config_name: subset_32
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3826954
num_examples: 1797
download_size: 4429378
dataset_size: 3826954
- config_name: subset_33
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3741758
num_examples: 1757
download_size: 4327087
dataset_size: 3741758
- config_name: subset_34
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4031431
num_examples: 1893
download_size: 4659808
dataset_size: 4031431
- config_name: subset_35
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4105946
num_examples: 1928
download_size: 4739698
dataset_size: 4105946
- config_name: subset_36
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3967527
num_examples: 1863
download_size: 4594733
dataset_size: 3967527
- config_name: subset_37
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3950477
num_examples: 1855
download_size: 4588401
dataset_size: 3950477
- config_name: subset_38
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4025054
num_examples: 1890
download_size: 4655391
dataset_size: 4025054
- config_name: subset_39
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4044194
num_examples: 1899
download_size: 4684714
dataset_size: 4044194
- config_name: subset_4
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3907822
num_examples: 1835
download_size: 4441418
dataset_size: 3907822
- config_name: subset_40
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4112388
num_examples: 1931
download_size: 4757715
dataset_size: 4112388
- config_name: subset_41
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3799301
num_examples: 1784
download_size: 4421538
dataset_size: 3799301
- config_name: subset_42
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3826964
num_examples: 1797
download_size: 4440757
dataset_size: 3826964
- config_name: subset_43
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3741772
num_examples: 1757
download_size: 4335182
dataset_size: 3741772
- config_name: subset_44
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3899384
num_examples: 1831
download_size: 4523231
dataset_size: 3899384
- config_name: subset_45
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4027158
num_examples: 1891
download_size: 4659148
dataset_size: 4027158
- config_name: subset_46
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4039928
num_examples: 1897
download_size: 4688481
dataset_size: 4039928
- config_name: subset_47
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4039914
num_examples: 1897
download_size: 4699145
dataset_size: 4039914
- config_name: subset_48
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4050580
num_examples: 1902
download_size: 4694634
dataset_size: 4050580
- config_name: subset_49
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3993047
num_examples: 1875
download_size: 4635678
dataset_size: 3993047
- config_name: subset_5
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4231555
num_examples: 1987
download_size: 4834636
dataset_size: 4231555
- config_name: subset_50
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4154944
num_examples: 1951
download_size: 4811597
dataset_size: 4154944
- config_name: subset_51
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3731105
num_examples: 1752
download_size: 4355756
dataset_size: 3731105
- config_name: subset_52
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3790793
num_examples: 1780
download_size: 4422644
dataset_size: 3790793
- config_name: subset_53
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3931337
num_examples: 1846
download_size: 4572572
dataset_size: 3931337
- config_name: subset_54
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3669406
num_examples: 1723
download_size: 4252833
dataset_size: 3669406
- config_name: subset_55
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3973897
num_examples: 1866
download_size: 4616672
dataset_size: 3973897
- config_name: subset_56
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4031395
num_examples: 1893
download_size: 4680494
dataset_size: 4031395
- config_name: subset_57
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4097435
num_examples: 1924
download_size: 4753526
dataset_size: 4097435
- config_name: subset_58
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4005821
num_examples: 1881
download_size: 4587838
dataset_size: 4005821
- config_name: subset_59
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4018666
num_examples: 1887
download_size: 4663268
dataset_size: 4018666
- config_name: subset_6
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3854606
num_examples: 1810
download_size: 4398834
dataset_size: 3854606
- config_name: subset_60
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4065509
num_examples: 1909
download_size: 4726062
dataset_size: 4065509
- config_name: subset_61
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3679996
num_examples: 1728
download_size: 4287182
dataset_size: 3679996
- config_name: subset_62
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3805698
num_examples: 1787
download_size: 4432790
dataset_size: 3805698
- config_name: subset_63
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3812072
num_examples: 1790
download_size: 4446277
dataset_size: 3812072
- config_name: subset_64
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3858886
num_examples: 1812
download_size: 4498114
dataset_size: 3858886
- config_name: subset_65
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3997364
num_examples: 1877
download_size: 4653377
dataset_size: 3997364
- config_name: subset_66
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4025039
num_examples: 1890
download_size: 4680176
dataset_size: 4025039
- config_name: subset_67
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3988835
num_examples: 1873
download_size: 4652008
dataset_size: 3988835
- config_name: subset_68
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4010112
num_examples: 1883
download_size: 4675552
dataset_size: 4010112
- config_name: subset_69
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4080334
num_examples: 1916
download_size: 4744797
dataset_size: 4080334
- config_name: subset_7
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3901450
num_examples: 1832
download_size: 4458208
dataset_size: 3901450
- config_name: subset_70
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4052685
num_examples: 1903
download_size: 4725605
dataset_size: 4052685
- config_name: subset_71
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3697075
num_examples: 1736
download_size: 4308492
dataset_size: 3697075
- config_name: subset_72
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4018629
num_examples: 1887
download_size: 4700115
dataset_size: 4018629
- config_name: subset_73
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3697041
num_examples: 1736
download_size: 4304109
dataset_size: 3697041
- config_name: subset_74
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3895138
num_examples: 1829
download_size: 4530128
dataset_size: 3895138
- config_name: subset_75
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3965417
num_examples: 1862
download_size: 4616631
dataset_size: 3965417
- config_name: subset_76
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4076142
num_examples: 1914
download_size: 4752231
dataset_size: 4076142
- config_name: subset_77
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3990948
num_examples: 1874
download_size: 4657908
dataset_size: 3990948
- config_name: subset_78
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3984529
num_examples: 1871
download_size: 4632478
dataset_size: 3984529
- config_name: subset_79
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4027216
num_examples: 1891
download_size: 4698453
dataset_size: 4027216
- config_name: subset_8
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4278428
num_examples: 2009
download_size: 4901437
dataset_size: 4278428
- config_name: subset_80
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4014359
num_examples: 1885
download_size: 4664208
dataset_size: 4014359
- config_name: subset_81
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4074036
num_examples: 1913
download_size: 4744040
dataset_size: 4074036
- config_name: subset_82
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4067635
num_examples: 1910
download_size: 4748571
dataset_size: 4067635
- config_name: subset_83
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4018654
num_examples: 1887
download_size: 4684173
dataset_size: 4018654
- config_name: subset_84
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3976074
num_examples: 1867
download_size: 4618829
dataset_size: 3976074
- config_name: subset_85
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4005893
num_examples: 1881
download_size: 4656051
dataset_size: 4005893
- config_name: subset_86
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3965411
num_examples: 1862
download_size: 4623726
dataset_size: 3965411
- config_name: subset_87
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4039932
num_examples: 1897
download_size: 4704102
dataset_size: 4039932
- config_name: subset_88
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4046335
num_examples: 1900
download_size: 4722589
dataset_size: 4046335
- config_name: subset_89
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4016605
num_examples: 1886
download_size: 4681402
dataset_size: 4016605
- config_name: subset_9
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4210283
num_examples: 1977
download_size: 4816280
dataset_size: 4210283
- config_name: subset_90
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4074056
num_examples: 1913
download_size: 4741288
dataset_size: 4074056
- config_name: subset_91
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4074007
num_examples: 1913
download_size: 4748690
dataset_size: 4074007
- config_name: subset_92
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4016498
num_examples: 1886
download_size: 4660997
dataset_size: 4016498
- config_name: subset_93
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3993113
num_examples: 1875
download_size: 4657659
dataset_size: 3993113
- config_name: subset_94
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4046350
num_examples: 1900
download_size: 4719708
dataset_size: 4046350
- config_name: subset_95
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 3976003
num_examples: 1867
download_size: 4630149
dataset_size: 3976003
- config_name: subset_96
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4046345
num_examples: 1900
download_size: 4719169
dataset_size: 4046345
- config_name: subset_97
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4044193
num_examples: 1899
download_size: 4725066
dataset_size: 4044193
- config_name: subset_98
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4054813
num_examples: 1904
download_size: 4726689
dataset_size: 4054813
- config_name: subset_99
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
splits:
- name: train
num_bytes: 4048401
num_examples: 1901
download_size: 4729981
dataset_size: 4048401
configs:
- config_name: subset_1
data_files:
- split: train
path: subset_1/train-*
- config_name: subset_10
data_files:
- split: train
path: subset_10/train-*
- config_name: subset_100
data_files:
- split: train
path: subset_100/train-*
- config_name: subset_101
data_files:
- split: train
path: subset_101/train-*
- config_name: subset_102
data_files:
- split: train
path: subset_102/train-*
- config_name: subset_103
data_files:
- split: train
path: subset_103/train-*
- config_name: subset_104
data_files:
- split: train
path: subset_104/train-*
- config_name: subset_105
data_files:
- split: train
path: subset_105/train-*
- config_name: subset_106
data_files:
- split: train
path: subset_106/train-*
- config_name: subset_107
data_files:
- split: train
path: subset_107/train-*
- config_name: subset_108
data_files:
- split: train
path: subset_108/train-*
- config_name: subset_109
data_files:
- split: train
path: subset_109/train-*
- config_name: subset_11
data_files:
- split: train
path: subset_11/train-*
- config_name: subset_110
data_files:
- split: train
path: subset_110/train-*
- config_name: subset_111
data_files:
- split: train
path: subset_111/train-*
- config_name: subset_112
data_files:
- split: train
path: subset_112/train-*
- config_name: subset_113
data_files:
- split: train
path: subset_113/train-*
- config_name: subset_114
data_files:
- split: train
path: subset_114/train-*
- config_name: subset_115
data_files:
- split: train
path: subset_115/train-*
- config_name: subset_116
data_files:
- split: train
path: subset_116/train-*
- config_name: subset_117
data_files:
- split: train
path: subset_117/train-*
- config_name: subset_118
data_files:
- split: train
path: subset_118/train-*
- config_name: subset_119
data_files:
- split: train
path: subset_119/train-*
- config_name: subset_12
data_files:
- split: train
path: subset_12/train-*
- config_name: subset_120
data_files:
- split: train
path: subset_120/train-*
- config_name: subset_121
data_files:
- split: train
path: subset_121/train-*
- config_name: subset_122
data_files:
- split: train
path: subset_122/train-*
- config_name: subset_123
data_files:
- split: train
path: subset_123/train-*
- config_name: subset_124
data_files:
- split: train
path: subset_124/train-*
- config_name: subset_125
data_files:
- split: train
path: subset_125/train-*
- config_name: subset_126
data_files:
- split: train
path: subset_126/train-*
- config_name: subset_127
data_files:
- split: train
path: subset_127/train-*
- config_name: subset_128
data_files:
- split: train
path: subset_128/train-*
- config_name: subset_129
data_files:
- split: train
path: subset_129/train-*
- config_name: subset_13
data_files:
- split: train
path: subset_13/train-*
- config_name: subset_130
data_files:
- split: train
path: subset_130/train-*
- config_name: subset_131
data_files:
- split: train
path: subset_131/train-*
- config_name: subset_132
data_files:
- split: train
path: subset_132/train-*
- config_name: subset_133
data_files:
- split: train
path: subset_133/train-*
- config_name: subset_134
data_files:
- split: train
path: subset_134/train-*
- config_name: subset_135
data_files:
- split: train
path: subset_135/train-*
- config_name: subset_136
data_files:
- split: train
path: subset_136/train-*
- config_name: subset_137
data_files:
- split: train
path: subset_137/train-*
- config_name: subset_138
data_files:
- split: train
path: subset_138/train-*
- config_name: subset_139
data_files:
- split: train
path: subset_139/train-*
- config_name: subset_14
data_files:
- split: train
path: subset_14/train-*
- config_name: subset_140
data_files:
- split: train
path: subset_140/train-*
- config_name: subset_141
data_files:
- split: train
path: subset_141/train-*
- config_name: subset_142
data_files:
- split: train
path: subset_142/train-*
- config_name: subset_143
data_files:
- split: train
path: subset_143/train-*
- config_name: subset_144
data_files:
- split: train
path: subset_144/train-*
- config_name: subset_15
data_files:
- split: train
path: subset_15/train-*
- config_name: subset_16
data_files:
- split: train
path: subset_16/train-*
- config_name: subset_17
data_files:
- split: train
path: subset_17/train-*
- config_name: subset_18
data_files:
- split: train
path: subset_18/train-*
- config_name: subset_19
data_files:
- split: train
path: subset_19/train-*
- config_name: subset_2
data_files:
- split: train
path: subset_2/train-*
- config_name: subset_20
data_files:
- split: train
path: subset_20/train-*
- config_name: subset_21
data_files:
- split: train
path: subset_21/train-*
- config_name: subset_22
data_files:
- split: train
path: subset_22/train-*
- config_name: subset_23
data_files:
- split: train
path: subset_23/train-*
- config_name: subset_24
data_files:
- split: train
path: subset_24/train-*
- config_name: subset_25
data_files:
- split: train
path: subset_25/train-*
- config_name: subset_26
data_files:
- split: train
path: subset_26/train-*
- config_name: subset_27
data_files:
- split: train
path: subset_27/train-*
- config_name: subset_28
data_files:
- split: train
path: subset_28/train-*
- config_name: subset_29
data_files:
- split: train
path: subset_29/train-*
- config_name: subset_3
data_files:
- split: train
path: subset_3/train-*
- config_name: subset_30
data_files:
- split: train
path: subset_30/train-*
- config_name: subset_31
data_files:
- split: train
path: subset_31/train-*
- config_name: subset_32
data_files:
- split: train
path: subset_32/train-*
- config_name: subset_33
data_files:
- split: train
path: subset_33/train-*
- config_name: subset_34
data_files:
- split: train
path: subset_34/train-*
- config_name: subset_35
data_files:
- split: train
path: subset_35/train-*
- config_name: subset_36
data_files:
- split: train
path: subset_36/train-*
- config_name: subset_37
data_files:
- split: train
path: subset_37/train-*
- config_name: subset_38
data_files:
- split: train
path: subset_38/train-*
- config_name: subset_39
data_files:
- split: train
path: subset_39/train-*
- config_name: subset_4
data_files:
- split: train
path: subset_4/train-*
- config_name: subset_40
data_files:
- split: train
path: subset_40/train-*
- config_name: subset_41
data_files:
- split: train
path: subset_41/train-*
- config_name: subset_42
data_files:
- split: train
path: subset_42/train-*
- config_name: subset_43
data_files:
- split: train
path: subset_43/train-*
- config_name: subset_44
data_files:
- split: train
path: subset_44/train-*
- config_name: subset_45
data_files:
- split: train
path: subset_45/train-*
- config_name: subset_46
data_files:
- split: train
path: subset_46/train-*
- config_name: subset_47
data_files:
- split: train
path: subset_47/train-*
- config_name: subset_48
data_files:
- split: train
path: subset_48/train-*
- config_name: subset_49
data_files:
- split: train
path: subset_49/train-*
- config_name: subset_5
data_files:
- split: train
path: subset_5/train-*
- config_name: subset_50
data_files:
- split: train
path: subset_50/train-*
- config_name: subset_51
data_files:
- split: train
path: subset_51/train-*
- config_name: subset_52
data_files:
- split: train
path: subset_52/train-*
- config_name: subset_53
data_files:
- split: train
path: subset_53/train-*
- config_name: subset_54
data_files:
- split: train
path: subset_54/train-*
- config_name: subset_55
data_files:
- split: train
path: subset_55/train-*
- config_name: subset_56
data_files:
- split: train
path: subset_56/train-*
- config_name: subset_57
data_files:
- split: train
path: subset_57/train-*
- config_name: subset_58
data_files:
- split: train
path: subset_58/train-*
- config_name: subset_59
data_files:
- split: train
path: subset_59/train-*
- config_name: subset_6
data_files:
- split: train
path: subset_6/train-*
- config_name: subset_60
data_files:
- split: train
path: subset_60/train-*
- config_name: subset_61
data_files:
- split: train
path: subset_61/train-*
- config_name: subset_62
data_files:
- split: train
path: subset_62/train-*
- config_name: subset_63
data_files:
- split: train
path: subset_63/train-*
- config_name: subset_64
data_files:
- split: train
path: subset_64/train-*
- config_name: subset_65
data_files:
- split: train
path: subset_65/train-*
- config_name: subset_66
data_files:
- split: train
path: subset_66/train-*
- config_name: subset_67
data_files:
- split: train
path: subset_67/train-*
- config_name: subset_68
data_files:
- split: train
path: subset_68/train-*
- config_name: subset_69
data_files:
- split: train
path: subset_69/train-*
- config_name: subset_7
data_files:
- split: train
path: subset_7/train-*
- config_name: subset_70
data_files:
- split: train
path: subset_70/train-*
- config_name: subset_71
data_files:
- split: train
path: subset_71/train-*
- config_name: subset_72
data_files:
- split: train
path: subset_72/train-*
- config_name: subset_73
data_files:
- split: train
path: subset_73/train-*
- config_name: subset_74
data_files:
- split: train
path: subset_74/train-*
- config_name: subset_75
data_files:
- split: train
path: subset_75/train-*
- config_name: subset_76
data_files:
- split: train
path: subset_76/train-*
- config_name: subset_77
data_files:
- split: train
path: subset_77/train-*
- config_name: subset_78
data_files:
- split: train
path: subset_78/train-*
- config_name: subset_79
data_files:
- split: train
path: subset_79/train-*
- config_name: subset_8
data_files:
- split: train
path: subset_8/train-*
- config_name: subset_80
data_files:
- split: train
path: subset_80/train-*
- config_name: subset_81
data_files:
- split: train
path: subset_81/train-*
- config_name: subset_82
data_files:
- split: train
path: subset_82/train-*
- config_name: subset_83
data_files:
- split: train
path: subset_83/train-*
- config_name: subset_84
data_files:
- split: train
path: subset_84/train-*
- config_name: subset_85
data_files:
- split: train
path: subset_85/train-*
- config_name: subset_86
data_files:
- split: train
path: subset_86/train-*
- config_name: subset_87
data_files:
- split: train
path: subset_87/train-*
- config_name: subset_88
data_files:
- split: train
path: subset_88/train-*
- config_name: subset_89
data_files:
- split: train
path: subset_89/train-*
- config_name: subset_9
data_files:
- split: train
path: subset_9/train-*
- config_name: subset_90
data_files:
- split: train
path: subset_90/train-*
- config_name: subset_91
data_files:
- split: train
path: subset_91/train-*
- config_name: subset_92
data_files:
- split: train
path: subset_92/train-*
- config_name: subset_93
data_files:
- split: train
path: subset_93/train-*
- config_name: subset_94
data_files:
- split: train
path: subset_94/train-*
- config_name: subset_95
data_files:
- split: train
path: subset_95/train-*
- config_name: subset_96
data_files:
- split: train
path: subset_96/train-*
- config_name: subset_97
data_files:
- split: train
path: subset_97/train-*
- config_name: subset_98
data_files:
- split: train
path: subset_98/train-*
- config_name: subset_99
data_files:
- split: train
path: subset_99/train-*
---
提供机构:
asahi417
原始信息汇总
数据集概述
数据集配置
| 配置名称 | 特征数量 |
|---|---|
| subset_1 | 7 |
| subset_10 | 7 |
| subset_100 | 6 |
| subset_101 | 6 |
| subset_102 | 6 |
| subset_103 | 6 |
| subset_104 | 6 |
| subset_105 | 6 |
| subset_106 | 6 |
| subset_107 | 6 |
| subset_108 | 6 |
| subset_109 | 6 |
| subset_11 | 6 |
| subset_110 | 6 |
| subset_111 | 6 |
| subset_112 | 6 |
| subset_113 | 6 |
| subset_114 | 6 |
| subset_115 | 6 |
| subset_116 | 6 |
| subset_117 | 6 |
| subset_118 | 6 |
| subset_119 | 6 |
| subset_12 | 6 |
| subset_120 | 6 |
| subset_121 | 6 |
| subset_122 | 6 |
| subset_123 | 6 |
| subset_124 | 6 |
| subset_125 | 6 |
| subset_126 | 6 |
| subset_127 | 6 |
| subset_128 | 6 |
| subset_129 | 6 |
| subset_13 | 6 |
| subset_130 | 6 |
| subset_131 | 6 |
| subset_132 | 6 |
| subset_133 | 6 |
| subset_134 | 6 |
| subset_135 | 6 |
| subset_136 | 6 |
| subset_137 | 6 |
特征信息
- line_no: 数据类型为 int64。
- enA.id: 数据类型为 string。
- enA.laser_score: 数据类型为 float64。
- jaA.id: 数据类型为 string。
- jaA.laser_score: 数据类型为 float64。
- enA.audio.speaker_embedding: 数据类型为 float32,序列类型。
- jaA.audio.speaker_embedding: 数据类型为 float32,序列类型。
数据集大小
| 配置名称 | 训练集大小(字节) | 训练集样本数 | 下载大小(字节) |
|---|---|---|---|
| subset_1 | 4414641 | 2073 | 4997658 |
| subset_10 | 4176233 | 1961 | 4780568 |
| subset_100 | 3741763 | 1757 | 4353207 |
| subset_101 | 3988818 | 1873 | 4643706 |
| subset_102 | 3978184 | 1868 | 4648714 |
| subset_103 | 4001574 | 1879 | 4658618 |
| subset_104 | 4048413 | 1901 | 4729608 |
| subset_105 | 3993076 | 1875 | 4668731 |
| subset_106 | 4003718 | 1880 | 4662189 |
| subset_107 | 3948365 | 1854 | 4595782 |
| subset_108 | 3905728 | 1834 | 4559863 |
| subset_109 | 3769470 | 1770 | 4394031 |
| subset_11 | 3788623 | 1779 | 4331531 |
| subset_110 | 4063355 | 1908 | 4744550 |
| subset_111 | 3997321 | 1877 | 4671267 |
| subset_112 | 4097438 | 1924 | 4779454 |
| subset_113 | 4110163 | 1930 | 4779350 |
| subset_114 | 4131496 | 1940 | 4826183 |
| subset_115 | 4050591 | 1902 | 4724188 |
| subset_116 | 4067630 | 1910 | 4755490 |
| subset_117 | 4048444 | 1901 | 4728690 |
| subset_118 | 4069725 | 1911 | 4761998 |
| subset_119 | 3975953 | 1867 | 4651589 |
| subset_12 | 4080379 | 1916 | 4694635 |
| subset_120 | 3777961 | 1774 | 4401239 |
| subset_121 | 4035732 | 1895 | 4716373 |
| subset_122 | 3941971 | 1851 | 4602567 |
| subset_123 | 4095279 | 1923 | 4790695 |
| subset_124 | 4016507 | 1886 | 4692804 |
| subset_125 | 4105935 | 1928 | 4791353 |
| subset_126 | 4052679 | 1903 | 4716569 |
| subset_127 | 4050563 | 1902 | 4730611 |
| subset_128 | 4025025 | 1890 | 4692176 |
| subset_129 | 3731131 | 1752 | 4352773 |
| subset_13 | 3767337 | 1769 | 4322880 |
| subset_130 | 3897203 | 1830 | 4564863 |
| subset_131 | 4007978 | 1882 | 4696605 |
| subset_132 | 4084648 | 1918 | 4763518 |
| subset_133 | 4016485 | 1886 | 4676652 |
| subset_134 | 4071885 | 1912 | 4752866 |
| subset_135 | 4020745 | 1888 | 4709130 |
| subset_136 | 3993065 | 1875 | 4672129 |
| subset_137 | 4025025 | 1890 | 4692176 |



