five

asahi417/seamless-align-enA-jaA

收藏
Hugging Face2024-05-28 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/seamless-align-enA-jaA
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: subset_1 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 392397221.989 num_examples: 2081 download_size: 386957004 dataset_size: 392397221.989 - config_name: subset_10 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 351659087.73 num_examples: 1965 download_size: 347647106 dataset_size: 351659087.73 - config_name: subset_100 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 347439311.634 num_examples: 1763 download_size: 344710645 dataset_size: 347439311.634 - config_name: subset_101 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 365547481.5 num_examples: 1875 download_size: 362661933 dataset_size: 365547481.5 - config_name: subset_102 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 381968578.736 num_examples: 1881 download_size: 378806119 dataset_size: 381968578.736 - config_name: subset_103 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 377107099.288 num_examples: 1892 download_size: 376129169 dataset_size: 377107099.288 - config_name: subset_104 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 381521507.888 num_examples: 1906 download_size: 373259505 dataset_size: 381521507.888 - config_name: subset_105 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 368079639.417 num_examples: 1883 download_size: 369775684 dataset_size: 368079639.417 - config_name: subset_106 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 372391250.272 num_examples: 1892 download_size: 371305914 dataset_size: 372391250.272 - config_name: subset_107 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 367981287.206 num_examples: 1858 download_size: 367316048 dataset_size: 367981287.206 - config_name: subset_108 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 370223944.467 num_examples: 1841 download_size: 372346370 dataset_size: 370223944.467 - config_name: subset_109 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 357015475.43 num_examples: 1785 download_size: 352298722 dataset_size: 357015475.43 - config_name: subset_11 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 315461317.835 num_examples: 1785 download_size: 317212663 dataset_size: 315461317.835 - config_name: subset_110 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 385144074.155 num_examples: 1915 download_size: 379037748 dataset_size: 385144074.155 - config_name: subset_111 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 374935612.104 num_examples: 1892 download_size: 372239748 dataset_size: 374935612.104 - config_name: subset_112 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 387721101.719 num_examples: 1927 download_size: 383839213 dataset_size: 387721101.719 - config_name: subset_113 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 379212576.24 num_examples: 1934 download_size: 378453943 dataset_size: 379212576.24 - config_name: subset_114 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 384159658.67 num_examples: 1955 download_size: 384669279 dataset_size: 384159658.67 - config_name: subset_115 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 377452879.389 num_examples: 1909 download_size: 376371193 dataset_size: 377452879.389 - config_name: subset_116 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 391109792.769 num_examples: 1919 download_size: 383604460 dataset_size: 391109792.769 - config_name: subset_117 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 381047288.968 num_examples: 1904 download_size: 376450557 dataset_size: 381047288.968 - config_name: subset_118 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 382181426.2 num_examples: 1920 download_size: 385299598 dataset_size: 382181426.2 - config_name: subset_119 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 378101700.56 num_examples: 1874 download_size: 376758439 dataset_size: 378101700.56 - config_name: subset_12 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 358314567.44 num_examples: 1927 download_size: 352456885 dataset_size: 358314567.44 - config_name: subset_120 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 359954090.328 num_examples: 1784 download_size: 356979013 dataset_size: 359954090.328 - config_name: subset_121 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 376331279.186 num_examples: 1902 download_size: 380547371 dataset_size: 376331279.186 - config_name: subset_122 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 369654334.356 num_examples: 1862 download_size: 371179800 dataset_size: 369654334.356 - config_name: subset_123 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 388541568.145 num_examples: 1929 download_size: 389026218 dataset_size: 388541568.145 - config_name: subset_124 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 379776241.785 num_examples: 1895 download_size: 378850414 dataset_size: 379776241.785 - config_name: subset_125 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 389122249.73 num_examples: 1938 download_size: 384996217 dataset_size: 389122249.73 - config_name: subset_126 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 379184588.31 num_examples: 1914 download_size: 373563947 dataset_size: 379184588.31 - config_name: subset_127 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 378141523.803 num_examples: 1907 download_size: 383375289 dataset_size: 378141523.803 - config_name: subset_128 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 377769554.88 num_examples: 1895 download_size: 376653550 dataset_size: 377769554.88 - config_name: subset_129 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 353562633.516 num_examples: 1756 download_size: 354226686 dataset_size: 353562633.516 - config_name: subset_13 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 317069015.599 num_examples: 1777 download_size: 321144002 dataset_size: 317069015.599 - config_name: subset_130 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 370915893.668 num_examples: 1839 download_size: 371889303 dataset_size: 370915893.668 - config_name: subset_131 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 375004158.552 num_examples: 1888 download_size: 380782590 dataset_size: 375004158.552 - config_name: subset_132 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 388826866.976 num_examples: 1928 download_size: 382856675 dataset_size: 388826866.976 - config_name: subset_133 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 372116703.19 num_examples: 1894 download_size: 372237148 dataset_size: 372116703.19 - config_name: subset_134 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 386208157.937 num_examples: 1917 download_size: 383730732 dataset_size: 386208157.937 - config_name: subset_135 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 381640691.498 num_examples: 1899 download_size: 380239839 dataset_size: 381640691.498 - config_name: subset_136 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 378631953.776 num_examples: 1888 download_size: 375946111 dataset_size: 378631953.776 - config_name: subset_137 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 374309454.612 num_examples: 1876 download_size: 373714310 dataset_size: 374309454.612 - config_name: subset_138 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 371147813.535 num_examples: 1871 download_size: 371796840 dataset_size: 371147813.535 - config_name: subset_139 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 366916855.865 num_examples: 1861 download_size: 368992793 dataset_size: 366916855.865 - config_name: subset_14 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 315585865.88 num_examples: 1741 download_size: 315277855 dataset_size: 315585865.88 - config_name: subset_140 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 363171291.466 num_examples: 1773 download_size: 358358262 dataset_size: 363171291.466 - config_name: subset_141 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 383956900.416 num_examples: 1872 download_size: 380114135 dataset_size: 383956900.416 - config_name: subset_142 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 381078393.54 num_examples: 1906 download_size: 380218832 dataset_size: 381078393.54 - config_name: subset_143 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 380967934.34 num_examples: 1901 download_size: 379622210 dataset_size: 380967934.34 - config_name: subset_144 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 284053361.18 num_examples: 1391 download_size: 281375275 dataset_size: 284053361.18 - config_name: subset_15 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 345703994.68 num_examples: 1920 download_size: 347443074 dataset_size: 345703994.68 - config_name: subset_16 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 331502387.256 num_examples: 1874 download_size: 330665924 dataset_size: 331502387.256 - config_name: subset_17 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 345364118.622 num_examples: 1894 download_size: 345895501 dataset_size: 345364118.622 - config_name: subset_18 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 349524837.675 num_examples: 1955 download_size: 357408084 dataset_size: 349524837.675 - config_name: subset_19 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 346525218.4 num_examples: 1920 download_size: 345890982 dataset_size: 346525218.4 - config_name: subset_2 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 362488104.758 num_examples: 1934 download_size: 354865280 dataset_size: 362488104.758 - config_name: subset_20 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 350939026.25 num_examples: 1885 download_size: 341990731 dataset_size: 350939026.25 - config_name: subset_21 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 341215066.124 num_examples: 1774 download_size: 331849670 dataset_size: 341215066.124 - config_name: subset_22 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 336975532.536 num_examples: 1861 download_size: 335041433 dataset_size: 336975532.536 - config_name: subset_23 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 328915586.396 num_examples: 1794 download_size: 328763242 dataset_size: 328915586.396 - config_name: subset_24 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 332737609.168 num_examples: 1762 download_size: 325215649 dataset_size: 332737609.168 - config_name: subset_25 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 358862142.102 num_examples: 1907 download_size: 362106233 dataset_size: 358862142.102 - config_name: subset_26 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 358705148.232 num_examples: 1948 download_size: 353341914 dataset_size: 358705148.232 - config_name: subset_27 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 355566098.624 num_examples: 1928 download_size: 353149576 dataset_size: 355566098.624 - config_name: subset_28 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 365815404.452 num_examples: 1922 download_size: 356156160 dataset_size: 365815404.452 - config_name: subset_29 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 354934591.488 num_examples: 1952 download_size: 354608609 dataset_size: 354934591.488 - config_name: subset_3 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 346798563.768 num_examples: 1904 download_size: 343495987 dataset_size: 346798563.768 - config_name: subset_30 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 361155095.104 num_examples: 1907 download_size: 354721798 dataset_size: 361155095.104 - config_name: subset_31 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 335059745.876 num_examples: 1814 download_size: 340653641 dataset_size: 335059745.876 - config_name: subset_32 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 337790342.508 num_examples: 1804 download_size: 337683645 dataset_size: 337790342.508 - config_name: subset_33 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 334395079.204 num_examples: 1766 download_size: 333195229 dataset_size: 334395079.204 - config_name: subset_34 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 352353817.407 num_examples: 1897 download_size: 353893909 dataset_size: 352353817.407 - config_name: subset_35 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 354852096.8 num_examples: 1935 download_size: 354770966 dataset_size: 354852096.8 - config_name: subset_36 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 344032713.228 num_examples: 1868 download_size: 348221349 dataset_size: 344032713.228 - config_name: subset_37 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 355957197.391 num_examples: 1893 download_size: 353589260 dataset_size: 355957197.391 - config_name: subset_38 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 359951388.335 num_examples: 1901 download_size: 359991079 dataset_size: 359951388.335 - config_name: subset_39 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 355950399.686 num_examples: 1906 download_size: 351954997 dataset_size: 355950399.686 - config_name: subset_4 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 324875597.44 num_examples: 1840 download_size: 323033218 dataset_size: 324875597.44 - config_name: subset_40 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 359980567.238 num_examples: 1938 download_size: 361203498 dataset_size: 359980567.238 - config_name: subset_41 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 341491079.89 num_examples: 1795 download_size: 342598966 dataset_size: 341491079.89 - config_name: subset_42 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 337050840.36 num_examples: 1810 download_size: 338982181 dataset_size: 337050840.36 - config_name: subset_43 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 333078752.306 num_examples: 1762 download_size: 331973425 dataset_size: 333078752.306 - config_name: subset_44 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 354837353.808 num_examples: 1836 download_size: 352414746 dataset_size: 354837353.808 - config_name: subset_45 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 355504571.626 num_examples: 1897 download_size: 351898298 dataset_size: 355504571.626 - config_name: subset_46 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 366453524.485 num_examples: 1899 download_size: 361219728 dataset_size: 366453524.485 - config_name: subset_47 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 369537229.91 num_examples: 1910 download_size: 366818569 dataset_size: 369537229.91 - config_name: subset_48 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 371055917.565 num_examples: 1921 download_size: 364934935 dataset_size: 371055917.565 - config_name: subset_49 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 365111049.931 num_examples: 1883 download_size: 363378366 dataset_size: 365111049.931 - config_name: subset_5 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 359687039.651 num_examples: 1997 download_size: 355215716 dataset_size: 359687039.651 - config_name: subset_50 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 363903197.66 num_examples: 1956 download_size: 369802089 dataset_size: 363903197.66 - config_name: subset_51 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 349528411.223 num_examples: 1767 download_size: 345586795 dataset_size: 349528411.223 - config_name: subset_52 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 350742328.126 num_examples: 1787 download_size: 347735026 dataset_size: 350742328.126 - config_name: subset_53 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 355375693.046 num_examples: 1854 download_size: 358497900 dataset_size: 355375693.046 - config_name: subset_54 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 331900210.526 num_examples: 1731 download_size: 335241695 dataset_size: 331900210.526 - config_name: subset_55 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 358310697.212 num_examples: 1874 download_size: 356550085 dataset_size: 358310697.212 - config_name: subset_56 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 370320570.7 num_examples: 1900 download_size: 368953320 dataset_size: 370320570.7 - config_name: subset_57 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 369177867.17 num_examples: 1930 download_size: 365431010 dataset_size: 369177867.17 - config_name: subset_58 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 364750512.24 num_examples: 1915 download_size: 359038523 dataset_size: 364750512.24 - config_name: subset_59 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 366063836.56 num_examples: 1896 download_size: 361919269 dataset_size: 366063836.56 - config_name: subset_6 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 330620046.596 num_examples: 1814 download_size: 324933334 dataset_size: 330620046.596 - config_name: subset_60 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 362027807.292 num_examples: 1919 download_size: 368443865 dataset_size: 362027807.292 - config_name: subset_61 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 336572343.386 num_examples: 1743 download_size: 336203973 dataset_size: 336572343.386 - config_name: subset_62 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 347496680.375 num_examples: 1795 download_size: 344109877 dataset_size: 347496680.375 - config_name: subset_63 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 352974912.662 num_examples: 1798 download_size: 350775013 dataset_size: 352974912.662 - config_name: subset_64 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 353464515.368 num_examples: 1817 download_size: 352635014 dataset_size: 353464515.368 - config_name: subset_65 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 363826523.188 num_examples: 1884 download_size: 363596840 dataset_size: 363826523.188 - config_name: subset_66 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 370086286.518 num_examples: 1894 download_size: 366106457 dataset_size: 370086286.518 - config_name: subset_67 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 367206754.248 num_examples: 1883 download_size: 363308659 dataset_size: 367206754.248 - config_name: subset_68 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 369958948.708 num_examples: 1916 download_size: 375430648 dataset_size: 369958948.708 - config_name: subset_69 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 377933959.326 num_examples: 1922 download_size: 371490261 dataset_size: 377933959.326 - config_name: subset_7 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 330330927.696 num_examples: 1836 download_size: 329514575 dataset_size: 330330927.696 - config_name: subset_70 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 372167954.944 num_examples: 1908 download_size: 367229781 dataset_size: 372167954.944 - config_name: subset_71 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 340415056.848 num_examples: 1748 download_size: 342450661 dataset_size: 340415056.848 - config_name: subset_72 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 378986780.672 num_examples: 1894 download_size: 374832652 dataset_size: 378986780.672 - config_name: subset_73 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 340998735.385 num_examples: 1743 download_size: 339125356 dataset_size: 340998735.385 - config_name: subset_74 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 366140124.688 num_examples: 1832 download_size: 359991384 dataset_size: 366140124.688 - config_name: subset_75 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 360268108.16 num_examples: 1868 download_size: 367867795 dataset_size: 360268108.16 - config_name: subset_76 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 365777697.252 num_examples: 1919 download_size: 370998271 dataset_size: 365777697.252 - config_name: subset_77 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 367151012.651 num_examples: 1907 download_size: 370795072 dataset_size: 367151012.651 - config_name: subset_78 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 367935265.25 num_examples: 1875 download_size: 368403671 dataset_size: 367935265.25 - config_name: subset_79 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 370145543.78 num_examples: 1905 download_size: 365886501 dataset_size: 370145543.78 - config_name: subset_8 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 361935515.97 num_examples: 2010 download_size: 353248114 dataset_size: 361935515.97 - config_name: subset_80 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 368484664.482 num_examples: 1886 download_size: 363208555 dataset_size: 368484664.482 - config_name: subset_81 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 382681091.404 num_examples: 1922 download_size: 378494301 dataset_size: 382681091.404 - config_name: subset_82 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 376125305.856 num_examples: 1912 download_size: 376672124 dataset_size: 376125305.856 - config_name: subset_83 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 371577857.09 num_examples: 1893 download_size: 369316183 dataset_size: 371577857.09 - config_name: subset_84 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 369560463.322 num_examples: 1874 download_size: 365140195 dataset_size: 369560463.322 - config_name: subset_85 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 364358029.998 num_examples: 1889 download_size: 364548198 dataset_size: 364358029.998 - config_name: subset_86 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 376636073.672 num_examples: 1872 download_size: 373150870 dataset_size: 376636073.672 - config_name: subset_87 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 362562530.995 num_examples: 1905 download_size: 369354649 dataset_size: 362562530.995 - config_name: subset_88 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 381751130.869 num_examples: 1909 download_size: 373678648 dataset_size: 381751130.869 - config_name: subset_89 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 371633741.0 num_examples: 1895 download_size: 372501566 dataset_size: 371633741.0 - config_name: subset_9 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 348422943.515 num_examples: 1985 download_size: 351519803 dataset_size: 348422943.515 - config_name: subset_90 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 373825088.18 num_examples: 1924 download_size: 375226691 dataset_size: 373825088.18 - config_name: subset_91 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 373896143.252 num_examples: 1922 download_size: 374160814 dataset_size: 373896143.252 - config_name: subset_92 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 364351588.28 num_examples: 1890 download_size: 365888274 dataset_size: 364351588.28 - config_name: subset_93 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 387866261.842 num_examples: 1881 download_size: 375615708 dataset_size: 387866261.842 - config_name: subset_94 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 377305622.574 num_examples: 1907 download_size: 373264809 dataset_size: 377305622.574 - config_name: subset_95 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 369064503.5 num_examples: 1875 download_size: 367451772 dataset_size: 369064503.5 - config_name: subset_96 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 377780781.221 num_examples: 1903 download_size: 378226431 dataset_size: 377780781.221 - config_name: subset_97 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 373154202.246 num_examples: 1906 download_size: 376439653 dataset_size: 373154202.246 - config_name: subset_98 features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 368477992.163 num_examples: 1911 download_size: 374903560 dataset_size: 368477992.163 - config_name: subset_99 features: - name: enA.audio dtype: audio - name: jaA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 380696639.606 num_examples: 1907 download_size: 380352674 dataset_size: 380696639.606 - config_name: subset_test features: - name: jaA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: jaA.id dtype: string - name: jaA.url dtype: string - name: jaA.duration_start dtype: int64 - name: jaA.duration_end dtype: int64 - name: jaA.laser_score dtype: float64 splits: - name: train num_bytes: 1530085.0 num_examples: 8 download_size: 1506623 dataset_size: 1530085.0 configs: - config_name: subset_1 data_files: - split: train path: subset_1/train-* - config_name: subset_10 data_files: - split: train path: subset_10/train-* - config_name: subset_100 data_files: - split: train path: subset_100/train-* - config_name: subset_101 data_files: - split: train path: subset_101/train-* - config_name: subset_102 data_files: - split: train path: subset_102/train-* - config_name: subset_103 data_files: - split: train path: subset_103/train-* - config_name: subset_104 data_files: - split: train path: subset_104/train-* - config_name: subset_105 data_files: - split: train path: subset_105/train-* - config_name: subset_106 data_files: - split: train path: subset_106/train-* - config_name: subset_107 data_files: - split: train path: subset_107/train-* - config_name: subset_108 data_files: - split: train path: subset_108/train-* - config_name: subset_109 data_files: - split: train path: subset_109/train-* - config_name: subset_11 data_files: - split: train path: subset_11/train-* - config_name: subset_110 data_files: - split: train path: subset_110/train-* - config_name: subset_111 data_files: - split: train path: subset_111/train-* - config_name: subset_112 data_files: - split: train path: subset_112/train-* - config_name: subset_113 data_files: - split: train path: subset_113/train-* - config_name: subset_114 data_files: - split: train path: subset_114/train-* - config_name: subset_115 data_files: - split: train path: subset_115/train-* - config_name: subset_116 data_files: - split: train path: subset_116/train-* - config_name: subset_117 data_files: - split: train path: subset_117/train-* - config_name: subset_118 data_files: - split: train path: subset_118/train-* - config_name: subset_119 data_files: - split: train path: subset_119/train-* - config_name: subset_12 data_files: - split: train path: subset_12/train-* - config_name: subset_120 data_files: - split: train path: subset_120/train-* - config_name: subset_121 data_files: - split: train path: subset_121/train-* - config_name: subset_122 data_files: - split: train path: subset_122/train-* - config_name: subset_123 data_files: - split: train path: subset_123/train-* - config_name: subset_124 data_files: - split: train path: subset_124/train-* - config_name: subset_125 data_files: - split: train path: subset_125/train-* - config_name: subset_126 data_files: - split: train path: subset_126/train-* - config_name: subset_127 data_files: - split: train path: subset_127/train-* - config_name: subset_128 data_files: - split: train path: subset_128/train-* - config_name: subset_129 data_files: - split: train path: subset_129/train-* - config_name: subset_13 data_files: - split: train path: subset_13/train-* - config_name: subset_130 data_files: - split: train path: subset_130/train-* - config_name: subset_131 data_files: - split: train path: subset_131/train-* - config_name: subset_132 data_files: - split: train path: subset_132/train-* - config_name: subset_133 data_files: - split: train path: subset_133/train-* - config_name: subset_134 data_files: - split: train path: subset_134/train-* - config_name: subset_135 data_files: - split: train path: subset_135/train-* - config_name: subset_136 data_files: - split: train path: subset_136/train-* - config_name: subset_137 data_files: - split: train path: subset_137/train-* - config_name: subset_138 data_files: - split: train path: subset_138/train-* - config_name: subset_139 data_files: - split: train path: subset_139/train-* - config_name: subset_14 data_files: - split: train path: subset_14/train-* - config_name: subset_140 data_files: - split: train path: subset_140/train-* - config_name: subset_141 data_files: - split: train path: subset_141/train-* - config_name: subset_142 data_files: - split: train path: subset_142/train-* - config_name: subset_143 data_files: - split: train path: subset_143/train-* - config_name: subset_144 data_files: - split: train path: subset_144/train-* - config_name: subset_15 data_files: - split: train path: subset_15/train-* - config_name: subset_16 data_files: - split: train path: subset_16/train-* - config_name: subset_17 data_files: - split: train path: subset_17/train-* - config_name: subset_18 data_files: - split: train path: subset_18/train-* - config_name: subset_19 data_files: - split: train path: subset_19/train-* - config_name: subset_2 data_files: - split: train path: subset_2/train-* - config_name: subset_20 data_files: - split: train path: subset_20/train-* - config_name: subset_21 data_files: - split: train path: subset_21/train-* - config_name: subset_22 data_files: - split: train path: subset_22/train-* - config_name: subset_23 data_files: - split: train path: subset_23/train-* - config_name: subset_24 data_files: - split: train path: subset_24/train-* - config_name: subset_25 data_files: - split: train path: subset_25/train-* - config_name: subset_26 data_files: - split: train path: subset_26/train-* - config_name: subset_27 data_files: - split: train path: subset_27/train-* - config_name: subset_28 data_files: - split: train path: subset_28/train-* - config_name: subset_29 data_files: - split: train path: subset_29/train-* - config_name: subset_3 data_files: - split: train path: subset_3/train-* - config_name: subset_30 data_files: - split: train path: subset_30/train-* - config_name: subset_31 data_files: - split: train path: subset_31/train-* - config_name: subset_32 data_files: - split: train path: subset_32/train-* - config_name: subset_33 data_files: - split: train path: subset_33/train-* - config_name: subset_34 data_files: - split: train path: subset_34/train-* - config_name: subset_35 data_files: - split: train path: subset_35/train-* - config_name: subset_36 data_files: - split: train path: subset_36/train-* - config_name: subset_37 data_files: - split: train path: subset_37/train-* - config_name: subset_38 data_files: - split: train path: subset_38/train-* - config_name: subset_39 data_files: - split: train path: subset_39/train-* - config_name: subset_4 data_files: - split: train path: subset_4/train-* - config_name: subset_40 data_files: - split: train path: subset_40/train-* - config_name: subset_41 data_files: - split: train path: subset_41/train-* - config_name: subset_42 data_files: - split: train path: subset_42/train-* - config_name: subset_43 data_files: - split: train path: subset_43/train-* - config_name: subset_44 data_files: - split: train path: subset_44/train-* - config_name: subset_45 data_files: - split: train path: subset_45/train-* - config_name: subset_46 data_files: - split: train path: subset_46/train-* - config_name: subset_47 data_files: - split: train path: subset_47/train-* - config_name: subset_48 data_files: - split: train path: subset_48/train-* - config_name: subset_49 data_files: - split: train path: subset_49/train-* - config_name: subset_5 data_files: - split: train path: subset_5/train-* - config_name: subset_50 data_files: - split: train path: subset_50/train-* - config_name: subset_51 data_files: - split: train path: subset_51/train-* - config_name: subset_52 data_files: - split: train path: subset_52/train-* - config_name: subset_53 data_files: - split: train path: subset_53/train-* - config_name: subset_54 data_files: - split: train path: subset_54/train-* - config_name: subset_55 data_files: - split: train path: subset_55/train-* - config_name: subset_56 data_files: - split: train path: subset_56/train-* - config_name: subset_57 data_files: - split: train path: subset_57/train-* - config_name: subset_58 data_files: - split: train path: subset_58/train-* - config_name: subset_59 data_files: - split: train path: subset_59/train-* - config_name: subset_6 data_files: - split: train path: subset_6/train-* - config_name: subset_60 data_files: - split: train path: subset_60/train-* - config_name: subset_61 data_files: - split: train path: subset_61/train-* - config_name: subset_62 data_files: - split: train path: subset_62/train-* - config_name: subset_63 data_files: - split: train path: subset_63/train-* - config_name: subset_64 data_files: - split: train path: subset_64/train-* - config_name: subset_65 data_files: - split: train path: subset_65/train-* - config_name: subset_66 data_files: - split: train path: subset_66/train-* - config_name: subset_67 data_files: - split: train path: subset_67/train-* - config_name: subset_68 data_files: - split: train path: subset_68/train-* - config_name: subset_69 data_files: - split: train path: subset_69/train-* - config_name: subset_7 data_files: - split: train path: subset_7/train-* - config_name: subset_70 data_files: - split: train path: subset_70/train-* - config_name: subset_71 data_files: - split: train path: subset_71/train-* - config_name: subset_72 data_files: - split: train path: subset_72/train-* - config_name: subset_73 data_files: - split: train path: subset_73/train-* - config_name: subset_74 data_files: - split: train path: subset_74/train-* - config_name: subset_75 data_files: - split: train path: subset_75/train-* - config_name: subset_76 data_files: - split: train path: subset_76/train-* - config_name: subset_77 data_files: - split: train path: subset_77/train-* - config_name: subset_78 data_files: - split: train path: subset_78/train-* - config_name: subset_79 data_files: - split: train path: subset_79/train-* - config_name: subset_8 data_files: - split: train path: subset_8/train-* - config_name: subset_80 data_files: - split: train path: subset_80/train-* - config_name: subset_81 data_files: - split: train path: subset_81/train-* - config_name: subset_82 data_files: - split: train path: subset_82/train-* - config_name: subset_83 data_files: - split: train path: subset_83/train-* - config_name: subset_84 data_files: - split: train path: subset_84/train-* - config_name: subset_85 data_files: - split: train path: subset_85/train-* - config_name: subset_86 data_files: - split: train path: subset_86/train-* - config_name: subset_87 data_files: - split: train path: subset_87/train-* - config_name: subset_88 data_files: - split: train path: subset_88/train-* - config_name: subset_89 data_files: - split: train path: subset_89/train-* - config_name: subset_9 data_files: - split: train path: subset_9/train-* - config_name: subset_90 data_files: - split: train path: subset_90/train-* - config_name: subset_91 data_files: - split: train path: subset_91/train-* - config_name: subset_92 data_files: - split: train path: subset_92/train-* - config_name: subset_93 data_files: - split: train path: subset_93/train-* - config_name: subset_94 data_files: - split: train path: subset_94/train-* - config_name: subset_95 data_files: - split: train path: subset_95/train-* - config_name: subset_96 data_files: - split: train path: subset_96/train-* - config_name: subset_97 data_files: - split: train path: subset_97/train-* - config_name: subset_98 data_files: - split: train path: subset_98/train-* - config_name: subset_99 data_files: - split: train path: subset_99/train-* - config_name: subset_test data_files: - split: train path: subset_test/train-* ---
提供机构:
asahi417
原始信息汇总

数据集概述

子集配置信息

  1. 子集 1 (config_name: subset_1)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 2081 个样本,大小为 392397221.989 字节
    • 下载大小: 386957004 字节
    • 数据集大小: 392397221.989 字节
  2. 子集 10 (config_name: subset_10)

    • 特征:
      • jaA.audio: 音频
      • enA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1965 个样本,大小为 351659087.73 字节
    • 下载大小: 347647106 字节
    • 数据集大小: 351659087.73 字节
  3. 子集 100 (config_name: subset_100)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1763 个样本,大小为 347439311.634 字节
    • 下载大小: 344710645 字节
    • 数据集大小: 347439311.634 字节
  4. 子集 101 (config_name: subset_101)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1875 个样本,大小为 365547481.5 字节
    • 下载大小: 362661933 字节
    • 数据集大小: 365547481.5 字节
  5. 子集 102 (config_name: subset_102)

    • 特征:
      • jaA.audio: 音频
      • enA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1881 个样本,大小为 381968578.736 字节
    • 下载大小: 378806119 字节
    • 数据集大小: 381968578.736 字节
  6. 子集 103 (config_name: subset_103)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1892 个样本,大小为 377107099.288 字节
    • 下载大小: 376129169 字节
    • 数据集大小: 377107099.288 字节
  7. 子集 104 (config_name: subset_104)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1906 个样本,大小为 381521507.888 字节
    • 下载大小: 373259505 字节
    • 数据集大小: 381521507.888 字节
  8. 子集 105 (config_name: subset_105)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1883 个样本,大小为 368079639.417 字节
    • 下载大小: 369775684 字节
    • 数据集大小: 368079639.417 字节
  9. 子集 106 (config_name: subset_106)

    • 特征:
      • jaA.audio: 音频
      • enA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1892 个样本,大小为 372391250.272 字节
    • 下载大小: 371305914 字节
    • 数据集大小: 372391250.272 字节
  10. 子集 107 (config_name: subset_107)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1858 个样本,大小为 367981287.206 字节
    • 下载大小: 367316048 字节
    • 数据集大小: 367981287.206 字节
  11. 子集 108 (config_name: subset_108)

    • 特征:
      • jaA.audio: 音频
      • enA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1841 个样本,大小为 370223944.467 字节
    • 下载大小: 372346370 字节
    • 数据集大小: 370223944.467 字节
  12. 子集 109 (config_name: subset_109)

    • 特征:
      • jaA.audio: 音频
      • enA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1785 个样本,大小为 357015475.43 字节
    • 下载大小: 352298722 字节
    • 数据集大小: 357015475.43 字节
  13. 子集 11 (config_name: subset_11)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1785 个样本,大小为 315461317.835 字节
    • 下载大小: 317212663 字节
    • 数据集大小: 315461317.835 字节
  14. 子集 110 (config_name: subset_110)

    • 特征:
      • enA.audio: 音频
      • jaA.audio: 音频
      • line_no: 整数
      • enA.id: 字符串
      • enA.url: 字符串
      • enA.duration_start: 整数
      • enA.duration_end: 整数
      • enA.laser_score: 浮点数
      • jaA.id: 字符串
      • jaA.url: 字符串
      • jaA.duration_start: 整数
      • jaA.duration_end: 整数
      • jaA.laser_score: 浮点数
    • 分割:
      • train: 1915 个样本,大小为 385144074.155 字节
    • 下载大小: 379037748 字节
    • 数据集大小: 385144074.155 字节
  15. 子集 111 (config_name: subset_111)

    • 特征:
      • jaA.audio: 音频
      • enA.audio: 音频
      • line_no: 整数
      • enA.id: 字
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作