five

asahi417/seamless-align-enA-jaA.speaker-embedding.w2vbert-600m

收藏
Hugging Face2024-06-14 更新2024-06-29 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/seamless-align-enA-jaA.speaker-embedding.w2vbert-600m
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: subset_1 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8702948037 num_examples: 2073 download_size: 8727623134 dataset_size: 8702948037 - config_name: subset_10 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7741197905 num_examples: 1961 download_size: 7763639836 dataset_size: 7741197905 - config_name: subset_100 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7539350527 num_examples: 1757 download_size: 7561057648 dataset_size: 7539350527 - config_name: subset_101 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8000126214 num_examples: 1873 download_size: 8023233099 dataset_size: 8000126214 - config_name: subset_102 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8231636420 num_examples: 1868 download_size: 8254531157 dataset_size: 8231636420 - config_name: subset_103 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8260939982 num_examples: 1879 download_size: 8283834623 dataset_size: 8260939982 - config_name: subset_104 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8212172265 num_examples: 1901 download_size: 8235222862 dataset_size: 8212172265 - config_name: subset_105 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8102126176 num_examples: 1875 download_size: 8125152906 dataset_size: 8102126176 - config_name: subset_106 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8149333978 num_examples: 1880 download_size: 8172350999 dataset_size: 8149333978 - config_name: subset_107 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7957833173 num_examples: 1854 download_size: 7979627705 dataset_size: 7957833173 - config_name: subset_108 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8099793996 num_examples: 1834 download_size: 8122655032 dataset_size: 8099793996 - config_name: subset_109 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7718800410 num_examples: 1770 download_size: 7740413291 dataset_size: 7718800410 - config_name: subset_11 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 6990805131 num_examples: 1779 download_size: 7010541642 dataset_size: 6990805131 - config_name: subset_110 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8330084771 num_examples: 1908 download_size: 8353081082 dataset_size: 8330084771 - config_name: subset_111 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8152306225 num_examples: 1877 download_size: 8175309603 dataset_size: 8152306225 - config_name: subset_112 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8390101886 num_examples: 1924 download_size: 8413102884 dataset_size: 8390101886 - config_name: subset_113 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8310906723 num_examples: 1930 download_size: 8333996530 dataset_size: 8310906723 - config_name: subset_114 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8474559076 num_examples: 1940 download_size: 8497569540 dataset_size: 8474559076 - config_name: subset_115 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8274836795 num_examples: 1902 download_size: 8297842155 dataset_size: 8274836795 - config_name: subset_116 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8425450950 num_examples: 1910 download_size: 8448379586 dataset_size: 8425450950 - config_name: subset_117 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8239572596 num_examples: 1901 download_size: 8262601438 dataset_size: 8239572596 - config_name: subset_118 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8428788397 num_examples: 1911 download_size: 8451712112 dataset_size: 8428788397 - config_name: subset_119 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8197889137 num_examples: 1867 download_size: 8220812536 dataset_size: 8197889137 - config_name: subset_12 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7786880511 num_examples: 1916 download_size: 7809090572 dataset_size: 7786880511 - config_name: subset_120 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7771256109 num_examples: 1774 download_size: 7792859242 dataset_size: 7771256109 - config_name: subset_121 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8381272272 num_examples: 1895 download_size: 8404146628 dataset_size: 8381272272 - config_name: subset_122 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8096171023 num_examples: 1851 download_size: 8119105742 dataset_size: 8096171023 - config_name: subset_123 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8536894075 num_examples: 1923 download_size: 8561046544 dataset_size: 8536894075 - config_name: subset_124 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8324670979 num_examples: 1886 download_size: 8347556191 dataset_size: 8324670979 - config_name: subset_125 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8419646791 num_examples: 1928 download_size: 8442658095 dataset_size: 8419646791 - config_name: subset_126 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8193693735 num_examples: 1903 download_size: 8216757799 dataset_size: 8193693735 - config_name: subset_127 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8402088467 num_examples: 1902 download_size: 8424983997 dataset_size: 8402088467 - config_name: subset_128 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8203946805 num_examples: 1890 download_size: 8226963776 dataset_size: 8203946805 - config_name: subset_129 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7732316635 num_examples: 1752 download_size: 7753855711 dataset_size: 7732316635 - config_name: subset_13 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7037101525 num_examples: 1769 download_size: 7058009817 dataset_size: 7037101525 - config_name: subset_130 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8065944063 num_examples: 1830 download_size: 8088804793 dataset_size: 8065944063 - config_name: subset_131 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8322530442 num_examples: 1882 download_size: 8345403015 dataset_size: 8322530442 - config_name: subset_132 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8367621084 num_examples: 1918 download_size: 8390603718 dataset_size: 8367621084 - config_name: subset_133 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8139076257 num_examples: 1886 download_size: 8162108687 dataset_size: 8139076257 - config_name: subset_134 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8371511509 num_examples: 1912 download_size: 8394489749 dataset_size: 8371511509 - config_name: subset_135 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8314224321 num_examples: 1888 download_size: 8337137850 dataset_size: 8314224321 - config_name: subset_136 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8223646065 num_examples: 1875 download_size: 8246582566 dataset_size: 8223646065 - config_name: subset_137 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8196040056 num_examples: 1866 download_size: 8218960114 dataset_size: 8196040056 - config_name: subset_138 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8158852805 num_examples: 1863 download_size: 8181756297 dataset_size: 8158852805 - config_name: subset_139 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8140652552 num_examples: 1859 download_size: 8163577943 dataset_size: 8140652552 - config_name: subset_14 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 6933327637 num_examples: 1734 download_size: 6952922594 dataset_size: 6933327637 - config_name: subset_140 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7850131272 num_examples: 1766 download_size: 7871620769 dataset_size: 7850131272 - config_name: subset_141 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8322709417 num_examples: 1865 download_size: 8345524409 dataset_size: 8322709417 - config_name: subset_142 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8266927178 num_examples: 1893 download_size: 8289898006 dataset_size: 8266927178 - config_name: subset_143 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8285914359 num_examples: 1894 download_size: 8308883156 dataset_size: 8285914359 - config_name: subset_144 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 6195225027 num_examples: 1381 download_size: 6212594727 dataset_size: 6195225027 - config_name: subset_15 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7665311230 num_examples: 1914 download_size: 7687617157 dataset_size: 7665311230 - config_name: subset_16 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7284662986 num_examples: 1862 download_size: 7305754545 dataset_size: 7284662986 - config_name: subset_17 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7587756587 num_examples: 1875 download_size: 7609952937 dataset_size: 7587756587 - config_name: subset_18 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7874655038 num_examples: 1937 download_size: 7896894047 dataset_size: 7874655038 - config_name: subset_19 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7610994678 num_examples: 1917 download_size: 7633303646 dataset_size: 7610994678 - config_name: subset_2 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7926101081 num_examples: 1929 download_size: 7948245696 dataset_size: 7926101081 - config_name: subset_20 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7527839354 num_examples: 1877 download_size: 7550080089 dataset_size: 7527839354 - config_name: subset_21 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7280210371 num_examples: 1761 download_size: 7300894110 dataset_size: 7280210371 - config_name: subset_22 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7401999881 num_examples: 1850 download_size: 7422966062 dataset_size: 7401999881 - config_name: subset_23 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7247343045 num_examples: 1790 download_size: 7268159959 dataset_size: 7247343045 - config_name: subset_24 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7133290735 num_examples: 1758 download_size: 7154085117 dataset_size: 7133290735 - config_name: subset_25 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7932937468 num_examples: 1898 download_size: 7954959835 dataset_size: 7932937468 - config_name: subset_26 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7771138741 num_examples: 1943 download_size: 7793471558 dataset_size: 7771138741 - config_name: subset_27 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7685359391 num_examples: 1903 download_size: 7707596955 dataset_size: 7685359391 - config_name: subset_28 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7791902759 num_examples: 1912 download_size: 7814086858 dataset_size: 7791902759 - config_name: subset_29 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7829264599 num_examples: 1945 download_size: 7851552812 dataset_size: 7829264599 - config_name: subset_3 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7634149956 num_examples: 1899 download_size: 7656386005 dataset_size: 7634149956 - config_name: subset_30 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7849088664 num_examples: 1902 download_size: 7871167992 dataset_size: 7849088664 - config_name: subset_31 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7483713402 num_examples: 1805 download_size: 7504431374 dataset_size: 7483713402 - config_name: subset_32 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7441076798 num_examples: 1797 download_size: 7461787438 dataset_size: 7441076798 - config_name: subset_33 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7264753022 num_examples: 1757 download_size: 7285428743 dataset_size: 7264753022 - config_name: subset_34 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7771298667 num_examples: 1893 download_size: 7793415792 dataset_size: 7771298667 - config_name: subset_35 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7873248002 num_examples: 1928 download_size: 7895411215 dataset_size: 7873248002 - config_name: subset_36 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7686618903 num_examples: 1863 download_size: 7708682503 dataset_size: 7686618903 - config_name: subset_37 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7715400237 num_examples: 1855 download_size: 7737397687 dataset_size: 7715400237 - config_name: subset_38 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7868878434 num_examples: 1890 download_size: 7890905644 dataset_size: 7868878434 - config_name: subset_39 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7781639342 num_examples: 1899 download_size: 7803773146 dataset_size: 7781639342 - config_name: subset_4 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7182939742 num_examples: 1835 download_size: 7204021516 dataset_size: 7182939742 - config_name: subset_40 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8001971900 num_examples: 1931 download_size: 8025317041 dataset_size: 8001971900 - config_name: subset_41 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7469419069 num_examples: 1784 download_size: 7490040875 dataset_size: 7469419069 - config_name: subset_42 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7468616508 num_examples: 1797 download_size: 7489301657 dataset_size: 7468616508 - config_name: subset_43 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7334272636 num_examples: 1757 download_size: 7354875724 dataset_size: 7334272636 - config_name: subset_44 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7721039896 num_examples: 1831 download_size: 7742936427 dataset_size: 7721039896 - config_name: subset_45 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7758551590 num_examples: 1891 download_size: 7780677193 dataset_size: 7758551590 - config_name: subset_46 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7969570872 num_examples: 1897 download_size: 7991546537 dataset_size: 7969570872 - config_name: subset_47 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8007791058 num_examples: 1897 download_size: 8031001009 dataset_size: 8007791058 - config_name: subset_48 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8007824284 num_examples: 1902 download_size: 8031037654 dataset_size: 8007824284 - config_name: subset_49 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7935588247 num_examples: 1875 download_size: 7957487967 dataset_size: 7935588247 - config_name: subset_5 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7858152479 num_examples: 1987 download_size: 7880605774 dataset_size: 7858152479 - config_name: subset_50 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8109249996 num_examples: 1951 download_size: 8132611446 dataset_size: 8109249996 - config_name: subset_51 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7510818209 num_examples: 1752 download_size: 7532538935 dataset_size: 7510818209 - config_name: subset_52 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7559065253 num_examples: 1780 download_size: 7580860197 dataset_size: 7559065253 - config_name: subset_53 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7823922429 num_examples: 1846 download_size: 7845800994 dataset_size: 7823922429 - config_name: subset_54 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7283573402 num_examples: 1723 download_size: 7304085530 dataset_size: 7283573402 - config_name: subset_55 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7826244629 num_examples: 1866 download_size: 7848199840 dataset_size: 7826244629 - config_name: subset_56 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8070967631 num_examples: 1893 download_size: 8094103833 dataset_size: 8070967631 - config_name: subset_57 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8089440683 num_examples: 1924 download_size: 8112695398 dataset_size: 8089440683 - config_name: subset_58 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7884338733 num_examples: 1881 download_size: 7905956640 dataset_size: 7884338733 - config_name: subset_59 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7908065990 num_examples: 1887 download_size: 7930046277 dataset_size: 7908065990 - config_name: subset_6 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7208550426 num_examples: 1810 download_size: 7229497498 dataset_size: 7208550426 - config_name: subset_60 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8044388677 num_examples: 1909 download_size: 8067603655 dataset_size: 8044388677 - config_name: subset_61 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7377070152 num_examples: 1728 download_size: 7397537262 dataset_size: 7377070152 - config_name: subset_62 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7502071722 num_examples: 1787 download_size: 7523948545 dataset_size: 7502071722 - config_name: subset_63 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7655723552 num_examples: 1790 download_size: 7677492842 dataset_size: 7655723552 - config_name: subset_64 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7712887510 num_examples: 1812 download_size: 7734705808 dataset_size: 7712887510 - config_name: subset_65 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8010253568 num_examples: 1877 download_size: 8033356644 dataset_size: 8010253568 - config_name: subset_66 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8037388419 num_examples: 1890 download_size: 8060541493 dataset_size: 8037388419 - config_name: subset_67 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7999138131 num_examples: 1873 download_size: 8020994067 dataset_size: 7999138131 - config_name: subset_68 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8078264828 num_examples: 1883 download_size: 8101347327 dataset_size: 8078264828 - config_name: subset_69 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8216277566 num_examples: 1916 download_size: 8239402635 dataset_size: 8216277566 - config_name: subset_7 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7279338714 num_examples: 1832 download_size: 7300320145 dataset_size: 7279338714 - config_name: subset_70 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8097733241 num_examples: 1903 download_size: 8120895767 dataset_size: 8097733241 - config_name: subset_71 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7428706247 num_examples: 1736 download_size: 7449166473 dataset_size: 7428706247 - config_name: subset_72 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8201773553 num_examples: 1887 download_size: 8224766208 dataset_size: 8201773553 - config_name: subset_73 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7379653813 num_examples: 1736 download_size: 7400142313 dataset_size: 7379653813 - config_name: subset_74 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7856200346 num_examples: 1829 download_size: 7877966599 dataset_size: 7856200346 - config_name: subset_75 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8009186341 num_examples: 1862 download_size: 8032232828 dataset_size: 8009186341 - config_name: subset_76 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8136036370 num_examples: 1914 download_size: 8159214014 dataset_size: 8136036370 - config_name: subset_77 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8062876796 num_examples: 1874 download_size: 8085940621 dataset_size: 8062876796 - config_name: subset_78 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8023627221 num_examples: 1871 download_size: 8046708604 dataset_size: 8023627221 - config_name: subset_79 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8077302048 num_examples: 1891 download_size: 8100426601 dataset_size: 8077302048 - config_name: subset_8 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7948411696 num_examples: 2009 download_size: 7970892677 dataset_size: 7948411696 - config_name: subset_80 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7942911679 num_examples: 1885 download_size: 7964853748 dataset_size: 7942911679 - config_name: subset_81 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8264358112 num_examples: 1913 download_size: 8287421761 dataset_size: 8264358112 - config_name: subset_82 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8262061855 num_examples: 1910 download_size: 8285114809 dataset_size: 8262061855 - config_name: subset_83 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8113098778 num_examples: 1887 download_size: 8136177900 dataset_size: 8113098778 - config_name: subset_84 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8028612558 num_examples: 1867 download_size: 8051652570 dataset_size: 8028612558 - config_name: subset_85 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8013488805 num_examples: 1881 download_size: 8036620744 dataset_size: 8013488805 - config_name: subset_86 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8003745635 num_examples: 1862 download_size: 8026803981 dataset_size: 8003745635 - config_name: subset_87 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8111430876 num_examples: 1897 download_size: 8134546716 dataset_size: 8111430876 - config_name: subset_88 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8169999635 num_examples: 1900 download_size: 8193073930 dataset_size: 8169999635 - config_name: subset_89 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8168994077 num_examples: 1886 download_size: 8192016527 dataset_size: 8168994077 - config_name: subset_9 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 7774163187 num_examples: 1977 download_size: 7796635468 dataset_size: 7774163187 - config_name: subset_90 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8158902032 num_examples: 1913 download_size: 8182056469 dataset_size: 8158902032 - config_name: subset_91 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8216019083 num_examples: 1913 download_size: 8239110705 dataset_size: 8216019083 - config_name: subset_92 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8020696970 num_examples: 1886 download_size: 8043835828 dataset_size: 8020696970 - config_name: subset_93 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8156262613 num_examples: 1875 download_size: 8179255387 dataset_size: 8156262613 - config_name: subset_94 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8187014650 num_examples: 1900 download_size: 8210091027 dataset_size: 8187014650 - config_name: subset_95 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8012114087 num_examples: 1867 download_size: 8035176759 dataset_size: 8012114087 - config_name: subset_96 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8249310045 num_examples: 1900 download_size: 8272336908 dataset_size: 8249310045 - config_name: subset_97 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8256956441 num_examples: 1899 download_size: 8279963650 dataset_size: 8256956441 - config_name: subset_98 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8255128221 num_examples: 1904 download_size: 8278159024 dataset_size: 8255128221 - config_name: subset_99 features: - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.laser_score dtype: float64 - name: enA.audio.speaker_embedding sequence: float32 - name: enA.audio.speaker_embedding.full sequence: sequence: float32 - name: jaA.audio.speaker_embedding sequence: float32 - name: jaA.audio.speaker_embedding.full sequence: sequence: float32 splits: - name: train num_bytes: 8303626853 num_examples: 1901 download_size: 8326615297 dataset_size: 8303626853 configs: - config_name: subset_1 data_files: - split: train path: subset_1/train-* - config_name: subset_10 data_files: - split: train path: subset_10/train-* - config_name: subset_100 data_files: - split: train path: subset_100/train-* - config_name: subset_101 data_files: - split: train path: subset_101/train-* - config_name: subset_102 data_files: - split: train path: subset_102/train-* - config_name: subset_103 data_files: - split: train path: subset_103/train-* - config_name: subset_104 data_files: - split: train path: subset_104/train-* - config_name: subset_105 data_files: - split: train path: subset_105/train-* - config_name: subset_106 data_files: - split: train path: subset_106/train-* - config_name: subset_107 data_files: - split: train path: subset_107/train-* - config_name: subset_108 data_files: - split: train path: subset_108/train-* - config_name: subset_109 data_files: - split: train path: subset_109/train-* - config_name: subset_11 data_files: - split: train path: subset_11/train-* - config_name: subset_110 data_files: - split: train path: subset_110/train-* - config_name: subset_111 data_files: - split: train path: subset_111/train-* - config_name: subset_112 data_files: - split: train path: subset_112/train-* - config_name: subset_113 data_files: - split: train path: subset_113/train-* - config_name: subset_114 data_files: - split: train path: subset_114/train-* - config_name: subset_115 data_files: - split: train path: subset_115/train-* - config_name: subset_116 data_files: - split: train path: subset_116/train-* - config_name: subset_117 data_files: - split: train path: subset_117/train-* - config_name: subset_118 data_files: - split: train path: subset_118/train-* - config_name: subset_119 data_files: - split: train path: subset_119/train-* - config_name: subset_12 data_files: - split: train path: subset_12/train-* - config_name: subset_120 data_files: - split: train path: subset_120/train-* - config_name: subset_121 data_files: - split: train path: subset_121/train-* - config_name: subset_122 data_files: - split: train path: subset_122/train-* - config_name: subset_123 data_files: - split: train path: subset_123/train-* - config_name: subset_124 data_files: - split: train path: subset_124/train-* - config_name: subset_125 data_files: - split: train path: subset_125/train-* - config_name: subset_126 data_files: - split: train path: subset_126/train-* - config_name: subset_127 data_files: - split: train path: subset_127/train-* - config_name: subset_128 data_files: - split: train path: subset_128/train-* - config_name: subset_129 data_files: - split: train path: subset_129/train-* - config_name: subset_13 data_files: - split: train path: subset_13/train-* - config_name: subset_130 data_files: - split: train path: subset_130/train-* - config_name: subset_131 data_files: - split: train path: subset_131/train-* - config_name: subset_132 data_files: - split: train path: subset_132/train-* - config_name: subset_133 data_files: - split: train path: subset_133/train-* - config_name: subset_134 data_files: - split: train path: subset_134/train-* - config_name: subset_135 data_files: - split: train path: subset_135/train-* - config_name: subset_136 data_files: - split: train path: subset_136/train-* - config_name: subset_137 data_files: - split: train path: subset_137/train-* - config_name: subset_138 data_files: - split: train path: subset_138/train-* - config_name: subset_139 data_files: - split: train path: subset_139/train-* - config_name: subset_14 data_files: - split: train path: subset_14/train-* - config_name: subset_140 data_files: - split: train path: subset_140/train-* - config_name: subset_141 data_files: - split: train path: subset_141/train-* - config_name: subset_142 data_files: - split: train path: subset_142/train-* - config_name: subset_143 data_files: - split: train path: subset_143/train-* - config_name: subset_144 data_files: - split: train path: subset_144/train-* - config_name: subset_15 data_files: - split: train path: subset_15/train-* - config_name: subset_16 data_files: - split: train path: subset_16/train-* - config_name: subset_17 data_files: - split: train path: subset_17/train-* - config_name: subset_18 data_files: - split: train path: subset_18/train-* - config_name: subset_19 data_files: - split: train path: subset_19/train-* - config_name: subset_2 data_files: - split: train path: subset_2/train-* - config_name: subset_20 data_files: - split: train path: subset_20/train-* - config_name: subset_21 data_files: - split: train path: subset_21/train-* - config_name: subset_22 data_files: - split: train path: subset_22/train-* - config_name: subset_23 data_files: - split: train path: subset_23/train-* - config_name: subset_24 data_files: - split: train path: subset_24/train-* - config_name: subset_25 data_files: - split: train path: subset_25/train-* - config_name: subset_26 data_files: - split: train path: subset_26/train-* - config_name: subset_27 data_files: - split: train path: subset_27/train-* - config_name: subset_28 data_files: - split: train path: subset_28/train-* - config_name: subset_29 data_files: - split: train path: subset_29/train-* - config_name: subset_3 data_files: - split: train path: subset_3/train-* - config_name: subset_30 data_files: - split: train path: subset_30/train-* - config_name: subset_31 data_files: - split: train path: subset_31/train-* - config_name: subset_32 data_files: - split: train path: subset_32/train-* - config_name: subset_33 data_files: - split: train path: subset_33/train-* - config_name: subset_34 data_files: - split: train path: subset_34/train-* - config_name: subset_35 data_files: - split: train path: subset_35/train-* - config_name: subset_36 data_files: - split: train path: subset_36/train-* - config_name: subset_37 data_files: - split: train path: subset_37/train-* - config_name: subset_38 data_files: - split: train path: subset_38/train-* - config_name: subset_39 data_files: - split: train path: subset_39/train-* - config_name: subset_4 data_files: - split: train path: subset_4/train-* - config_name: subset_40 data_files: - split: train path: subset_40/train-* - config_name: subset_41 data_files: - split: train path: subset_41/train-* - config_name: subset_42 data_files: - split: train path: subset_42/train-* - config_name: subset_43 data_files: - split: train path: subset_43/train-* - config_name: subset_44 data_files: - split: train path: subset_44/train-* - config_name: subset_45 data_files: - split: train path: subset_45/train-* - config_name: subset_46 data_files: - split: train path: subset_46/train-* - config_name: subset_47 data_files: - split: train path: subset_47/train-* - config_name: subset_48 data_files: - split: train path: subset_48/train-* - config_name: subset_49 data_files: - split: train path: subset_49/train-* - config_name: subset_5 data_files: - split: train path: subset_5/train-* - config_name: subset_50 data_files: - split: train path: subset_50/train-* - config_name: subset_51 data_files: - split: train path: subset_51/train-* - config_name: subset_52 data_files: - split: train path: subset_52/train-* - config_name: subset_53 data_files: - split: train path: subset_53/train-* - config_name: subset_54 data_files: - split: train path: subset_54/train-* - config_name: subset_55 data_files: - split: train path: subset_55/train-* - config_name: subset_56 data_files: - split: train path: subset_56/train-* - config_name: subset_57 data_files: - split: train path: subset_57/train-* - config_name: subset_58 data_files: - split: train path: subset_58/train-* - config_name: subset_59 data_files: - split: train path: subset_59/train-* - config_name: subset_6 data_files: - split: train path: subset_6/train-* - config_name: subset_60 data_files: - split: train path: subset_60/train-* - config_name: subset_61 data_files: - split: train path: subset_61/train-* - config_name: subset_62 data_files: - split: train path: subset_62/train-* - config_name: subset_63 data_files: - split: train path: subset_63/train-* - config_name: subset_64 data_files: - split: train path: subset_64/train-* - config_name: subset_65 data_files: - split: train path: subset_65/train-* - config_name: subset_66 data_files: - split: train path: subset_66/train-* - config_name: subset_67 data_files: - split: train path: subset_67/train-* - config_name: subset_68 data_files: - split: train path: subset_68/train-* - config_name: subset_69 data_files: - split: train path: subset_69/train-* - config_name: subset_7 data_files: - split: train path: subset_7/train-* - config_name: subset_70 data_files: - split: train path: subset_70/train-* - config_name: subset_71 data_files: - split: train path: subset_71/train-* - config_name: subset_72 data_files: - split: train path: subset_72/train-* - config_name: subset_73 data_files: - split: train path: subset_73/train-* - config_name: subset_74 data_files: - split: train path: subset_74/train-* - config_name: subset_75 data_files: - split: train path: subset_75/train-* - config_name: subset_76 data_files: - split: train path: subset_76/train-* - config_name: subset_77 data_files: - split: train path: subset_77/train-* - config_name: subset_78 data_files: - split: train path: subset_78/train-* - config_name: subset_79 data_files: - split: train path: subset_79/train-* - config_name: subset_8 data_files: - split: train path: subset_8/train-* - config_name: subset_80 data_files: - split: train path: subset_80/train-* - config_name: subset_81 data_files: - split: train path: subset_81/train-* - config_name: subset_82 data_files: - split: train path: subset_82/train-* - config_name: subset_83 data_files: - split: train path: subset_83/train-* - config_name: subset_84 data_files: - split: train path: subset_84/train-* - config_name: subset_85 data_files: - split: train path: subset_85/train-* - config_name: subset_86 data_files: - split: train path: subset_86/train-* - config_name: subset_87 data_files: - split: train path: subset_87/train-* - config_name: subset_88 data_files: - split: train path: subset_88/train-* - config_name: subset_89 data_files: - split: train path: subset_89/train-* - config_name: subset_9 data_files: - split: train path: subset_9/train-* - config_name: subset_90 data_files: - split: train path: subset_90/train-* - config_name: subset_91 data_files: - split: train path: subset_91/train-* - config_name: subset_92 data_files: - split: train path: subset_92/train-* - config_name: subset_93 data_files: - split: train path: subset_93/train-* - config_name: subset_94 data_files: - split: train path: subset_94/train-* - config_name: subset_95 data_files: - split: train path: subset_95/train-* - config_name: subset_96 data_files: - split: train path: subset_96/train-* - config_name: subset_97 data_files: - split: train path: subset_97/train-* - config_name: subset_98 data_files: - split: train path: subset_98/train-* - config_name: subset_99 data_files: - split: train path: subset_99/train-* ---
提供机构:
asahi417
原始信息汇总

数据集概述

数据集配置

子集 1 (subset_1)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8702948037
      • 样本数: 2073
  • 下载大小: 8727623134
  • 数据集大小: 8702948037

子集 10 (subset_10)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 7741197905
      • 样本数: 1961
  • 下载大小: 7763639836
  • 数据集大小: 7741197905

子集 100 (subset_100)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 7539350527
      • 样本数: 1757
  • 下载大小: 7561057648
  • 数据集大小: 7539350527

子集 101 (subset_101)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8000126214
      • 样本数: 1873
  • 下载大小: 8023233099
  • 数据集大小: 8000126214

子集 102 (subset_102)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8231636420
      • 样本数: 1868
  • 下载大小: 8254531157
  • 数据集大小: 8231636420

子集 103 (subset_103)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8260939982
      • 样本数: 1879
  • 下载大小: 8283834623
  • 数据集大小: 8260939982

子集 104 (subset_104)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8212172265
      • 样本数: 1901
  • 下载大小: 8235222862
  • 数据集大小: 8212172265

子集 105 (subset_105)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8102126176
      • 样本数: 1875
  • 下载大小: 8125152906
  • 数据集大小: 8102126176

子集 106 (subset_106)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8149333978
      • 样本数: 1880
  • 下载大小: 8172350999
  • 数据集大小: 8149333978

子集 107 (subset_107)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 7957833173
      • 样本数: 1854
  • 下载大小: 7979627705
  • 数据集大小: 7957833173

子集 108 (subset_108)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8099793996
      • 样本数: 1834
  • 下载大小: 8122655032
  • 数据集大小: 8099793996

子集 109 (subset_109)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 7718800410
      • 样本数: 1770
  • 下载大小: 7740413291
  • 数据集大小: 7718800410

子集 11 (subset_11)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 6990805131
      • 样本数: 1779
  • 下载大小: 7010541642
  • 数据集大小: 6990805131

子集 110 (subset_110)

  • 特征:
    • line_no: 整数 (int64)
    • enA.id: 字符串 (string)
    • enA.laser_score: 浮点数 (float64)
    • jaA.id: 字符串 (string)
    • jaA.laser_score: 浮点数 (float64)
    • enA.audio.speaker_embedding: 浮点数序列 (float32)
    • enA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
    • jaA.audio.speaker_embedding: 浮点数序列 (float32)
    • jaA.audio.speaker_embedding.full: 嵌套浮点数序列 (float32)
  • 分割:
    • train:
      • 字节数: 8330084771
      • 样本数: 1908
  • 下载大小: 8353081082
  • 数据集大小: 8330084771

子集 111 (subset_111)

  • 特征:
    • line_no: 整数 (int64)
    • `
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作