asahi417/seamless-align-enA-jaA.speaker-embedding.w2vbert-600m
收藏Hugging Face2024-06-14 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/seamless-align-enA-jaA.speaker-embedding.w2vbert-600m
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: subset_1
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8702948037
num_examples: 2073
download_size: 8727623134
dataset_size: 8702948037
- config_name: subset_10
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7741197905
num_examples: 1961
download_size: 7763639836
dataset_size: 7741197905
- config_name: subset_100
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7539350527
num_examples: 1757
download_size: 7561057648
dataset_size: 7539350527
- config_name: subset_101
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8000126214
num_examples: 1873
download_size: 8023233099
dataset_size: 8000126214
- config_name: subset_102
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8231636420
num_examples: 1868
download_size: 8254531157
dataset_size: 8231636420
- config_name: subset_103
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8260939982
num_examples: 1879
download_size: 8283834623
dataset_size: 8260939982
- config_name: subset_104
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8212172265
num_examples: 1901
download_size: 8235222862
dataset_size: 8212172265
- config_name: subset_105
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8102126176
num_examples: 1875
download_size: 8125152906
dataset_size: 8102126176
- config_name: subset_106
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8149333978
num_examples: 1880
download_size: 8172350999
dataset_size: 8149333978
- config_name: subset_107
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7957833173
num_examples: 1854
download_size: 7979627705
dataset_size: 7957833173
- config_name: subset_108
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8099793996
num_examples: 1834
download_size: 8122655032
dataset_size: 8099793996
- config_name: subset_109
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7718800410
num_examples: 1770
download_size: 7740413291
dataset_size: 7718800410
- config_name: subset_11
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 6990805131
num_examples: 1779
download_size: 7010541642
dataset_size: 6990805131
- config_name: subset_110
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8330084771
num_examples: 1908
download_size: 8353081082
dataset_size: 8330084771
- config_name: subset_111
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8152306225
num_examples: 1877
download_size: 8175309603
dataset_size: 8152306225
- config_name: subset_112
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8390101886
num_examples: 1924
download_size: 8413102884
dataset_size: 8390101886
- config_name: subset_113
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8310906723
num_examples: 1930
download_size: 8333996530
dataset_size: 8310906723
- config_name: subset_114
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8474559076
num_examples: 1940
download_size: 8497569540
dataset_size: 8474559076
- config_name: subset_115
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8274836795
num_examples: 1902
download_size: 8297842155
dataset_size: 8274836795
- config_name: subset_116
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8425450950
num_examples: 1910
download_size: 8448379586
dataset_size: 8425450950
- config_name: subset_117
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8239572596
num_examples: 1901
download_size: 8262601438
dataset_size: 8239572596
- config_name: subset_118
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8428788397
num_examples: 1911
download_size: 8451712112
dataset_size: 8428788397
- config_name: subset_119
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8197889137
num_examples: 1867
download_size: 8220812536
dataset_size: 8197889137
- config_name: subset_12
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7786880511
num_examples: 1916
download_size: 7809090572
dataset_size: 7786880511
- config_name: subset_120
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7771256109
num_examples: 1774
download_size: 7792859242
dataset_size: 7771256109
- config_name: subset_121
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8381272272
num_examples: 1895
download_size: 8404146628
dataset_size: 8381272272
- config_name: subset_122
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8096171023
num_examples: 1851
download_size: 8119105742
dataset_size: 8096171023
- config_name: subset_123
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8536894075
num_examples: 1923
download_size: 8561046544
dataset_size: 8536894075
- config_name: subset_124
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8324670979
num_examples: 1886
download_size: 8347556191
dataset_size: 8324670979
- config_name: subset_125
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8419646791
num_examples: 1928
download_size: 8442658095
dataset_size: 8419646791
- config_name: subset_126
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8193693735
num_examples: 1903
download_size: 8216757799
dataset_size: 8193693735
- config_name: subset_127
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8402088467
num_examples: 1902
download_size: 8424983997
dataset_size: 8402088467
- config_name: subset_128
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8203946805
num_examples: 1890
download_size: 8226963776
dataset_size: 8203946805
- config_name: subset_129
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7732316635
num_examples: 1752
download_size: 7753855711
dataset_size: 7732316635
- config_name: subset_13
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7037101525
num_examples: 1769
download_size: 7058009817
dataset_size: 7037101525
- config_name: subset_130
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8065944063
num_examples: 1830
download_size: 8088804793
dataset_size: 8065944063
- config_name: subset_131
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8322530442
num_examples: 1882
download_size: 8345403015
dataset_size: 8322530442
- config_name: subset_132
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8367621084
num_examples: 1918
download_size: 8390603718
dataset_size: 8367621084
- config_name: subset_133
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8139076257
num_examples: 1886
download_size: 8162108687
dataset_size: 8139076257
- config_name: subset_134
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8371511509
num_examples: 1912
download_size: 8394489749
dataset_size: 8371511509
- config_name: subset_135
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8314224321
num_examples: 1888
download_size: 8337137850
dataset_size: 8314224321
- config_name: subset_136
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8223646065
num_examples: 1875
download_size: 8246582566
dataset_size: 8223646065
- config_name: subset_137
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8196040056
num_examples: 1866
download_size: 8218960114
dataset_size: 8196040056
- config_name: subset_138
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8158852805
num_examples: 1863
download_size: 8181756297
dataset_size: 8158852805
- config_name: subset_139
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8140652552
num_examples: 1859
download_size: 8163577943
dataset_size: 8140652552
- config_name: subset_14
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 6933327637
num_examples: 1734
download_size: 6952922594
dataset_size: 6933327637
- config_name: subset_140
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7850131272
num_examples: 1766
download_size: 7871620769
dataset_size: 7850131272
- config_name: subset_141
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8322709417
num_examples: 1865
download_size: 8345524409
dataset_size: 8322709417
- config_name: subset_142
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8266927178
num_examples: 1893
download_size: 8289898006
dataset_size: 8266927178
- config_name: subset_143
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8285914359
num_examples: 1894
download_size: 8308883156
dataset_size: 8285914359
- config_name: subset_144
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 6195225027
num_examples: 1381
download_size: 6212594727
dataset_size: 6195225027
- config_name: subset_15
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7665311230
num_examples: 1914
download_size: 7687617157
dataset_size: 7665311230
- config_name: subset_16
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7284662986
num_examples: 1862
download_size: 7305754545
dataset_size: 7284662986
- config_name: subset_17
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7587756587
num_examples: 1875
download_size: 7609952937
dataset_size: 7587756587
- config_name: subset_18
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7874655038
num_examples: 1937
download_size: 7896894047
dataset_size: 7874655038
- config_name: subset_19
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7610994678
num_examples: 1917
download_size: 7633303646
dataset_size: 7610994678
- config_name: subset_2
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7926101081
num_examples: 1929
download_size: 7948245696
dataset_size: 7926101081
- config_name: subset_20
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7527839354
num_examples: 1877
download_size: 7550080089
dataset_size: 7527839354
- config_name: subset_21
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7280210371
num_examples: 1761
download_size: 7300894110
dataset_size: 7280210371
- config_name: subset_22
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7401999881
num_examples: 1850
download_size: 7422966062
dataset_size: 7401999881
- config_name: subset_23
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7247343045
num_examples: 1790
download_size: 7268159959
dataset_size: 7247343045
- config_name: subset_24
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7133290735
num_examples: 1758
download_size: 7154085117
dataset_size: 7133290735
- config_name: subset_25
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7932937468
num_examples: 1898
download_size: 7954959835
dataset_size: 7932937468
- config_name: subset_26
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7771138741
num_examples: 1943
download_size: 7793471558
dataset_size: 7771138741
- config_name: subset_27
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7685359391
num_examples: 1903
download_size: 7707596955
dataset_size: 7685359391
- config_name: subset_28
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7791902759
num_examples: 1912
download_size: 7814086858
dataset_size: 7791902759
- config_name: subset_29
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7829264599
num_examples: 1945
download_size: 7851552812
dataset_size: 7829264599
- config_name: subset_3
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7634149956
num_examples: 1899
download_size: 7656386005
dataset_size: 7634149956
- config_name: subset_30
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7849088664
num_examples: 1902
download_size: 7871167992
dataset_size: 7849088664
- config_name: subset_31
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7483713402
num_examples: 1805
download_size: 7504431374
dataset_size: 7483713402
- config_name: subset_32
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7441076798
num_examples: 1797
download_size: 7461787438
dataset_size: 7441076798
- config_name: subset_33
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7264753022
num_examples: 1757
download_size: 7285428743
dataset_size: 7264753022
- config_name: subset_34
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7771298667
num_examples: 1893
download_size: 7793415792
dataset_size: 7771298667
- config_name: subset_35
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7873248002
num_examples: 1928
download_size: 7895411215
dataset_size: 7873248002
- config_name: subset_36
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7686618903
num_examples: 1863
download_size: 7708682503
dataset_size: 7686618903
- config_name: subset_37
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7715400237
num_examples: 1855
download_size: 7737397687
dataset_size: 7715400237
- config_name: subset_38
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7868878434
num_examples: 1890
download_size: 7890905644
dataset_size: 7868878434
- config_name: subset_39
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7781639342
num_examples: 1899
download_size: 7803773146
dataset_size: 7781639342
- config_name: subset_4
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7182939742
num_examples: 1835
download_size: 7204021516
dataset_size: 7182939742
- config_name: subset_40
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8001971900
num_examples: 1931
download_size: 8025317041
dataset_size: 8001971900
- config_name: subset_41
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7469419069
num_examples: 1784
download_size: 7490040875
dataset_size: 7469419069
- config_name: subset_42
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7468616508
num_examples: 1797
download_size: 7489301657
dataset_size: 7468616508
- config_name: subset_43
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7334272636
num_examples: 1757
download_size: 7354875724
dataset_size: 7334272636
- config_name: subset_44
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7721039896
num_examples: 1831
download_size: 7742936427
dataset_size: 7721039896
- config_name: subset_45
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7758551590
num_examples: 1891
download_size: 7780677193
dataset_size: 7758551590
- config_name: subset_46
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7969570872
num_examples: 1897
download_size: 7991546537
dataset_size: 7969570872
- config_name: subset_47
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8007791058
num_examples: 1897
download_size: 8031001009
dataset_size: 8007791058
- config_name: subset_48
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8007824284
num_examples: 1902
download_size: 8031037654
dataset_size: 8007824284
- config_name: subset_49
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7935588247
num_examples: 1875
download_size: 7957487967
dataset_size: 7935588247
- config_name: subset_5
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7858152479
num_examples: 1987
download_size: 7880605774
dataset_size: 7858152479
- config_name: subset_50
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8109249996
num_examples: 1951
download_size: 8132611446
dataset_size: 8109249996
- config_name: subset_51
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7510818209
num_examples: 1752
download_size: 7532538935
dataset_size: 7510818209
- config_name: subset_52
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7559065253
num_examples: 1780
download_size: 7580860197
dataset_size: 7559065253
- config_name: subset_53
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7823922429
num_examples: 1846
download_size: 7845800994
dataset_size: 7823922429
- config_name: subset_54
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7283573402
num_examples: 1723
download_size: 7304085530
dataset_size: 7283573402
- config_name: subset_55
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7826244629
num_examples: 1866
download_size: 7848199840
dataset_size: 7826244629
- config_name: subset_56
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8070967631
num_examples: 1893
download_size: 8094103833
dataset_size: 8070967631
- config_name: subset_57
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8089440683
num_examples: 1924
download_size: 8112695398
dataset_size: 8089440683
- config_name: subset_58
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7884338733
num_examples: 1881
download_size: 7905956640
dataset_size: 7884338733
- config_name: subset_59
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7908065990
num_examples: 1887
download_size: 7930046277
dataset_size: 7908065990
- config_name: subset_6
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7208550426
num_examples: 1810
download_size: 7229497498
dataset_size: 7208550426
- config_name: subset_60
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8044388677
num_examples: 1909
download_size: 8067603655
dataset_size: 8044388677
- config_name: subset_61
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7377070152
num_examples: 1728
download_size: 7397537262
dataset_size: 7377070152
- config_name: subset_62
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7502071722
num_examples: 1787
download_size: 7523948545
dataset_size: 7502071722
- config_name: subset_63
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7655723552
num_examples: 1790
download_size: 7677492842
dataset_size: 7655723552
- config_name: subset_64
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7712887510
num_examples: 1812
download_size: 7734705808
dataset_size: 7712887510
- config_name: subset_65
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8010253568
num_examples: 1877
download_size: 8033356644
dataset_size: 8010253568
- config_name: subset_66
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8037388419
num_examples: 1890
download_size: 8060541493
dataset_size: 8037388419
- config_name: subset_67
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7999138131
num_examples: 1873
download_size: 8020994067
dataset_size: 7999138131
- config_name: subset_68
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8078264828
num_examples: 1883
download_size: 8101347327
dataset_size: 8078264828
- config_name: subset_69
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8216277566
num_examples: 1916
download_size: 8239402635
dataset_size: 8216277566
- config_name: subset_7
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7279338714
num_examples: 1832
download_size: 7300320145
dataset_size: 7279338714
- config_name: subset_70
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8097733241
num_examples: 1903
download_size: 8120895767
dataset_size: 8097733241
- config_name: subset_71
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7428706247
num_examples: 1736
download_size: 7449166473
dataset_size: 7428706247
- config_name: subset_72
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8201773553
num_examples: 1887
download_size: 8224766208
dataset_size: 8201773553
- config_name: subset_73
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7379653813
num_examples: 1736
download_size: 7400142313
dataset_size: 7379653813
- config_name: subset_74
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7856200346
num_examples: 1829
download_size: 7877966599
dataset_size: 7856200346
- config_name: subset_75
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8009186341
num_examples: 1862
download_size: 8032232828
dataset_size: 8009186341
- config_name: subset_76
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8136036370
num_examples: 1914
download_size: 8159214014
dataset_size: 8136036370
- config_name: subset_77
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8062876796
num_examples: 1874
download_size: 8085940621
dataset_size: 8062876796
- config_name: subset_78
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8023627221
num_examples: 1871
download_size: 8046708604
dataset_size: 8023627221
- config_name: subset_79
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8077302048
num_examples: 1891
download_size: 8100426601
dataset_size: 8077302048
- config_name: subset_8
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7948411696
num_examples: 2009
download_size: 7970892677
dataset_size: 7948411696
- config_name: subset_80
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7942911679
num_examples: 1885
download_size: 7964853748
dataset_size: 7942911679
- config_name: subset_81
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8264358112
num_examples: 1913
download_size: 8287421761
dataset_size: 8264358112
- config_name: subset_82
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8262061855
num_examples: 1910
download_size: 8285114809
dataset_size: 8262061855
- config_name: subset_83
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8113098778
num_examples: 1887
download_size: 8136177900
dataset_size: 8113098778
- config_name: subset_84
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8028612558
num_examples: 1867
download_size: 8051652570
dataset_size: 8028612558
- config_name: subset_85
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8013488805
num_examples: 1881
download_size: 8036620744
dataset_size: 8013488805
- config_name: subset_86
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8003745635
num_examples: 1862
download_size: 8026803981
dataset_size: 8003745635
- config_name: subset_87
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8111430876
num_examples: 1897
download_size: 8134546716
dataset_size: 8111430876
- config_name: subset_88
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8169999635
num_examples: 1900
download_size: 8193073930
dataset_size: 8169999635
- config_name: subset_89
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8168994077
num_examples: 1886
download_size: 8192016527
dataset_size: 8168994077
- config_name: subset_9
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 7774163187
num_examples: 1977
download_size: 7796635468
dataset_size: 7774163187
- config_name: subset_90
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8158902032
num_examples: 1913
download_size: 8182056469
dataset_size: 8158902032
- config_name: subset_91
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8216019083
num_examples: 1913
download_size: 8239110705
dataset_size: 8216019083
- config_name: subset_92
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8020696970
num_examples: 1886
download_size: 8043835828
dataset_size: 8020696970
- config_name: subset_93
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8156262613
num_examples: 1875
download_size: 8179255387
dataset_size: 8156262613
- config_name: subset_94
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8187014650
num_examples: 1900
download_size: 8210091027
dataset_size: 8187014650
- config_name: subset_95
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8012114087
num_examples: 1867
download_size: 8035176759
dataset_size: 8012114087
- config_name: subset_96
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8249310045
num_examples: 1900
download_size: 8272336908
dataset_size: 8249310045
- config_name: subset_97
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8256956441
num_examples: 1899
download_size: 8279963650
dataset_size: 8256956441
- config_name: subset_98
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8255128221
num_examples: 1904
download_size: 8278159024
dataset_size: 8255128221
- config_name: subset_99
features:
- name: line_no
dtype: int64
- name: enA.id
dtype: string
- name: enA.laser_score
dtype: float64
- name: jaA.id
dtype: string
- name: jaA.laser_score
dtype: float64
- name: enA.audio.speaker_embedding
sequence: float32
- name: enA.audio.speaker_embedding.full
sequence:
sequence: float32
- name: jaA.audio.speaker_embedding
sequence: float32
- name: jaA.audio.speaker_embedding.full
sequence:
sequence: float32
splits:
- name: train
num_bytes: 8303626853
num_examples: 1901
download_size: 8326615297
dataset_size: 8303626853
configs:
- config_name: subset_1
data_files:
- split: train
path: subset_1/train-*
- config_name: subset_10
data_files:
- split: train
path: subset_10/train-*
- config_name: subset_100
data_files:
- split: train
path: subset_100/train-*
- config_name: subset_101
data_files:
- split: train
path: subset_101/train-*
- config_name: subset_102
data_files:
- split: train
path: subset_102/train-*
- config_name: subset_103
data_files:
- split: train
path: subset_103/train-*
- config_name: subset_104
data_files:
- split: train
path: subset_104/train-*
- config_name: subset_105
data_files:
- split: train
path: subset_105/train-*
- config_name: subset_106
data_files:
- split: train
path: subset_106/train-*
- config_name: subset_107
data_files:
- split: train
path: subset_107/train-*
- config_name: subset_108
data_files:
- split: train
path: subset_108/train-*
- config_name: subset_109
data_files:
- split: train
path: subset_109/train-*
- config_name: subset_11
data_files:
- split: train
path: subset_11/train-*
- config_name: subset_110
data_files:
- split: train
path: subset_110/train-*
- config_name: subset_111
data_files:
- split: train
path: subset_111/train-*
- config_name: subset_112
data_files:
- split: train
path: subset_112/train-*
- config_name: subset_113
data_files:
- split: train
path: subset_113/train-*
- config_name: subset_114
data_files:
- split: train
path: subset_114/train-*
- config_name: subset_115
data_files:
- split: train
path: subset_115/train-*
- config_name: subset_116
data_files:
- split: train
path: subset_116/train-*
- config_name: subset_117
data_files:
- split: train
path: subset_117/train-*
- config_name: subset_118
data_files:
- split: train
path: subset_118/train-*
- config_name: subset_119
data_files:
- split: train
path: subset_119/train-*
- config_name: subset_12
data_files:
- split: train
path: subset_12/train-*
- config_name: subset_120
data_files:
- split: train
path: subset_120/train-*
- config_name: subset_121
data_files:
- split: train
path: subset_121/train-*
- config_name: subset_122
data_files:
- split: train
path: subset_122/train-*
- config_name: subset_123
data_files:
- split: train
path: subset_123/train-*
- config_name: subset_124
data_files:
- split: train
path: subset_124/train-*
- config_name: subset_125
data_files:
- split: train
path: subset_125/train-*
- config_name: subset_126
data_files:
- split: train
path: subset_126/train-*
- config_name: subset_127
data_files:
- split: train
path: subset_127/train-*
- config_name: subset_128
data_files:
- split: train
path: subset_128/train-*
- config_name: subset_129
data_files:
- split: train
path: subset_129/train-*
- config_name: subset_13
data_files:
- split: train
path: subset_13/train-*
- config_name: subset_130
data_files:
- split: train
path: subset_130/train-*
- config_name: subset_131
data_files:
- split: train
path: subset_131/train-*
- config_name: subset_132
data_files:
- split: train
path: subset_132/train-*
- config_name: subset_133
data_files:
- split: train
path: subset_133/train-*
- config_name: subset_134
data_files:
- split: train
path: subset_134/train-*
- config_name: subset_135
data_files:
- split: train
path: subset_135/train-*
- config_name: subset_136
data_files:
- split: train
path: subset_136/train-*
- config_name: subset_137
data_files:
- split: train
path: subset_137/train-*
- config_name: subset_138
data_files:
- split: train
path: subset_138/train-*
- config_name: subset_139
data_files:
- split: train
path: subset_139/train-*
- config_name: subset_14
data_files:
- split: train
path: subset_14/train-*
- config_name: subset_140
data_files:
- split: train
path: subset_140/train-*
- config_name: subset_141
data_files:
- split: train
path: subset_141/train-*
- config_name: subset_142
data_files:
- split: train
path: subset_142/train-*
- config_name: subset_143
data_files:
- split: train
path: subset_143/train-*
- config_name: subset_144
data_files:
- split: train
path: subset_144/train-*
- config_name: subset_15
data_files:
- split: train
path: subset_15/train-*
- config_name: subset_16
data_files:
- split: train
path: subset_16/train-*
- config_name: subset_17
data_files:
- split: train
path: subset_17/train-*
- config_name: subset_18
data_files:
- split: train
path: subset_18/train-*
- config_name: subset_19
data_files:
- split: train
path: subset_19/train-*
- config_name: subset_2
data_files:
- split: train
path: subset_2/train-*
- config_name: subset_20
data_files:
- split: train
path: subset_20/train-*
- config_name: subset_21
data_files:
- split: train
path: subset_21/train-*
- config_name: subset_22
data_files:
- split: train
path: subset_22/train-*
- config_name: subset_23
data_files:
- split: train
path: subset_23/train-*
- config_name: subset_24
data_files:
- split: train
path: subset_24/train-*
- config_name: subset_25
data_files:
- split: train
path: subset_25/train-*
- config_name: subset_26
data_files:
- split: train
path: subset_26/train-*
- config_name: subset_27
data_files:
- split: train
path: subset_27/train-*
- config_name: subset_28
data_files:
- split: train
path: subset_28/train-*
- config_name: subset_29
data_files:
- split: train
path: subset_29/train-*
- config_name: subset_3
data_files:
- split: train
path: subset_3/train-*
- config_name: subset_30
data_files:
- split: train
path: subset_30/train-*
- config_name: subset_31
data_files:
- split: train
path: subset_31/train-*
- config_name: subset_32
data_files:
- split: train
path: subset_32/train-*
- config_name: subset_33
data_files:
- split: train
path: subset_33/train-*
- config_name: subset_34
data_files:
- split: train
path: subset_34/train-*
- config_name: subset_35
data_files:
- split: train
path: subset_35/train-*
- config_name: subset_36
data_files:
- split: train
path: subset_36/train-*
- config_name: subset_37
data_files:
- split: train
path: subset_37/train-*
- config_name: subset_38
data_files:
- split: train
path: subset_38/train-*
- config_name: subset_39
data_files:
- split: train
path: subset_39/train-*
- config_name: subset_4
data_files:
- split: train
path: subset_4/train-*
- config_name: subset_40
data_files:
- split: train
path: subset_40/train-*
- config_name: subset_41
data_files:
- split: train
path: subset_41/train-*
- config_name: subset_42
data_files:
- split: train
path: subset_42/train-*
- config_name: subset_43
data_files:
- split: train
path: subset_43/train-*
- config_name: subset_44
data_files:
- split: train
path: subset_44/train-*
- config_name: subset_45
data_files:
- split: train
path: subset_45/train-*
- config_name: subset_46
data_files:
- split: train
path: subset_46/train-*
- config_name: subset_47
data_files:
- split: train
path: subset_47/train-*
- config_name: subset_48
data_files:
- split: train
path: subset_48/train-*
- config_name: subset_49
data_files:
- split: train
path: subset_49/train-*
- config_name: subset_5
data_files:
- split: train
path: subset_5/train-*
- config_name: subset_50
data_files:
- split: train
path: subset_50/train-*
- config_name: subset_51
data_files:
- split: train
path: subset_51/train-*
- config_name: subset_52
data_files:
- split: train
path: subset_52/train-*
- config_name: subset_53
data_files:
- split: train
path: subset_53/train-*
- config_name: subset_54
data_files:
- split: train
path: subset_54/train-*
- config_name: subset_55
data_files:
- split: train
path: subset_55/train-*
- config_name: subset_56
data_files:
- split: train
path: subset_56/train-*
- config_name: subset_57
data_files:
- split: train
path: subset_57/train-*
- config_name: subset_58
data_files:
- split: train
path: subset_58/train-*
- config_name: subset_59
data_files:
- split: train
path: subset_59/train-*
- config_name: subset_6
data_files:
- split: train
path: subset_6/train-*
- config_name: subset_60
data_files:
- split: train
path: subset_60/train-*
- config_name: subset_61
data_files:
- split: train
path: subset_61/train-*
- config_name: subset_62
data_files:
- split: train
path: subset_62/train-*
- config_name: subset_63
data_files:
- split: train
path: subset_63/train-*
- config_name: subset_64
data_files:
- split: train
path: subset_64/train-*
- config_name: subset_65
data_files:
- split: train
path: subset_65/train-*
- config_name: subset_66
data_files:
- split: train
path: subset_66/train-*
- config_name: subset_67
data_files:
- split: train
path: subset_67/train-*
- config_name: subset_68
data_files:
- split: train
path: subset_68/train-*
- config_name: subset_69
data_files:
- split: train
path: subset_69/train-*
- config_name: subset_7
data_files:
- split: train
path: subset_7/train-*
- config_name: subset_70
data_files:
- split: train
path: subset_70/train-*
- config_name: subset_71
data_files:
- split: train
path: subset_71/train-*
- config_name: subset_72
data_files:
- split: train
path: subset_72/train-*
- config_name: subset_73
data_files:
- split: train
path: subset_73/train-*
- config_name: subset_74
data_files:
- split: train
path: subset_74/train-*
- config_name: subset_75
data_files:
- split: train
path: subset_75/train-*
- config_name: subset_76
data_files:
- split: train
path: subset_76/train-*
- config_name: subset_77
data_files:
- split: train
path: subset_77/train-*
- config_name: subset_78
data_files:
- split: train
path: subset_78/train-*
- config_name: subset_79
data_files:
- split: train
path: subset_79/train-*
- config_name: subset_8
data_files:
- split: train
path: subset_8/train-*
- config_name: subset_80
data_files:
- split: train
path: subset_80/train-*
- config_name: subset_81
data_files:
- split: train
path: subset_81/train-*
- config_name: subset_82
data_files:
- split: train
path: subset_82/train-*
- config_name: subset_83
data_files:
- split: train
path: subset_83/train-*
- config_name: subset_84
data_files:
- split: train
path: subset_84/train-*
- config_name: subset_85
data_files:
- split: train
path: subset_85/train-*
- config_name: subset_86
data_files:
- split: train
path: subset_86/train-*
- config_name: subset_87
data_files:
- split: train
path: subset_87/train-*
- config_name: subset_88
data_files:
- split: train
path: subset_88/train-*
- config_name: subset_89
data_files:
- split: train
path: subset_89/train-*
- config_name: subset_9
data_files:
- split: train
path: subset_9/train-*
- config_name: subset_90
data_files:
- split: train
path: subset_90/train-*
- config_name: subset_91
data_files:
- split: train
path: subset_91/train-*
- config_name: subset_92
data_files:
- split: train
path: subset_92/train-*
- config_name: subset_93
data_files:
- split: train
path: subset_93/train-*
- config_name: subset_94
data_files:
- split: train
path: subset_94/train-*
- config_name: subset_95
data_files:
- split: train
path: subset_95/train-*
- config_name: subset_96
data_files:
- split: train
path: subset_96/train-*
- config_name: subset_97
data_files:
- split: train
path: subset_97/train-*
- config_name: subset_98
data_files:
- split: train
path: subset_98/train-*
- config_name: subset_99
data_files:
- split: train
path: subset_99/train-*
---
提供机构:
asahi417
原始信息汇总
数据集概述
数据集配置
子集 1 (subset_1)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8702948037
- 样本数: 2073
- 下载大小: 8727623134
- 数据集大小: 8702948037
子集 10 (subset_10)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 7741197905
- 样本数: 1961
- 下载大小: 7763639836
- 数据集大小: 7741197905
子集 100 (subset_100)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 7539350527
- 样本数: 1757
- 下载大小: 7561057648
- 数据集大小: 7539350527
子集 101 (subset_101)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8000126214
- 样本数: 1873
- 下载大小: 8023233099
- 数据集大小: 8000126214
子集 102 (subset_102)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8231636420
- 样本数: 1868
- 下载大小: 8254531157
- 数据集大小: 8231636420
子集 103 (subset_103)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8260939982
- 样本数: 1879
- 下载大小: 8283834623
- 数据集大小: 8260939982
子集 104 (subset_104)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8212172265
- 样本数: 1901
- 下载大小: 8235222862
- 数据集大小: 8212172265
子集 105 (subset_105)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8102126176
- 样本数: 1875
- 下载大小: 8125152906
- 数据集大小: 8102126176
子集 106 (subset_106)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8149333978
- 样本数: 1880
- 下载大小: 8172350999
- 数据集大小: 8149333978
子集 107 (subset_107)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 7957833173
- 样本数: 1854
- 下载大小: 7979627705
- 数据集大小: 7957833173
子集 108 (subset_108)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8099793996
- 样本数: 1834
- 下载大小: 8122655032
- 数据集大小: 8099793996
子集 109 (subset_109)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 7718800410
- 样本数: 1770
- 下载大小: 7740413291
- 数据集大小: 7718800410
子集 11 (subset_11)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 6990805131
- 样本数: 1779
- 下载大小: 7010541642
- 数据集大小: 6990805131
子集 110 (subset_110)
- 特征:
line_no: 整数 (int64)enA.id: 字符串 (string)enA.laser_score: 浮点数 (float64)jaA.id: 字符串 (string)jaA.laser_score: 浮点数 (float64)enA.audio.speaker_embedding: 浮点数序列 (float32)enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)jaA.audio.speaker_embedding: 浮点数序列 (float32)jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
- 分割:
train:- 字节数: 8330084771
- 样本数: 1908
- 下载大小: 8353081082
- 数据集大小: 8330084771
子集 111 (subset_111)
- 特征:
line_no: 整数 (int64)- `



