five

asahi417/seamless-align-enA-hiA

收藏
Hugging Face2024-05-30 更新2024-06-12 收录
下载链接:
https://hf-mirror.com/datasets/asahi417/seamless-align-enA-hiA
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: subset_1 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 410483653.543 num_examples: 2297 download_size: 412555784 dataset_size: 410483653.543 - config_name: subset_10 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 271862277.536 num_examples: 2032 download_size: 273365121 dataset_size: 271862277.536 - config_name: subset_11 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 263150120.56 num_examples: 1988 download_size: 261518751 dataset_size: 263150120.56 - config_name: subset_12 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 269466728.23 num_examples: 2011 download_size: 262544262 dataset_size: 269466728.23 - config_name: subset_13 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 253581799.08 num_examples: 1940 download_size: 249339076 dataset_size: 253581799.08 - config_name: subset_14 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 267950451.912 num_examples: 1984 download_size: 261855591 dataset_size: 267950451.912 - config_name: subset_15 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 247393968.885 num_examples: 1965 download_size: 247129654 dataset_size: 247393968.885 - config_name: subset_16 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 257412810.412 num_examples: 2006 download_size: 256982287 dataset_size: 257412810.412 - config_name: subset_17 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 269596131.776 num_examples: 2028 download_size: 272294201 dataset_size: 269596131.776 - config_name: subset_18 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 253857527.84 num_examples: 1996 download_size: 258901774 dataset_size: 253857527.84 - config_name: subset_19 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 261430690.512 num_examples: 1968 download_size: 259714920 dataset_size: 261430690.512 - config_name: subset_2 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 419562006.72 num_examples: 2340 download_size: 401315570 dataset_size: 419562006.72 - config_name: subset_20 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 265881917.419 num_examples: 1977 download_size: 260081233 dataset_size: 265881917.419 - config_name: subset_21 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 262436473.241 num_examples: 1979 download_size: 258149267 dataset_size: 262436473.241 - config_name: subset_22 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 266669226.24 num_examples: 2024 download_size: 266112306 dataset_size: 266669226.24 - config_name: subset_23 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 258802972.248 num_examples: 1986 download_size: 255535922 dataset_size: 258802972.248 - config_name: subset_24 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 260110000.944 num_examples: 1976 download_size: 259990251 dataset_size: 260110000.944 - config_name: subset_25 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 261131466.28 num_examples: 1980 download_size: 254777016 dataset_size: 261131466.28 - config_name: subset_26 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 257491740.05 num_examples: 1938 download_size: 257527316 dataset_size: 257491740.05 - config_name: subset_27 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 259895156.762 num_examples: 1963 download_size: 256182167 dataset_size: 259895156.762 - config_name: subset_28 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 270180442.326 num_examples: 1963 download_size: 264265589 dataset_size: 270180442.326 - config_name: subset_29 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 261116231.93 num_examples: 1955 download_size: 259846823 dataset_size: 261116231.93 - config_name: subset_3 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 382949661.988 num_examples: 2284 download_size: 363390598 dataset_size: 382949661.988 - config_name: subset_30 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 257918518.82 num_examples: 1940 download_size: 255822545 dataset_size: 257918518.82 - config_name: subset_31 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 257545460.061 num_examples: 1941 download_size: 254066493 dataset_size: 257545460.061 - config_name: subset_32 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 269109273.799 num_examples: 1969 download_size: 264795977 dataset_size: 269109273.799 - config_name: subset_33 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 258077454.974 num_examples: 1958 download_size: 255785580 dataset_size: 258077454.974 - config_name: subset_34 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 249061629.604 num_examples: 1876 download_size: 248289822 dataset_size: 249061629.604 - config_name: subset_35 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 255228029.7 num_examples: 1900 download_size: 253679553 dataset_size: 255228029.7 - config_name: subset_36 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 267498234.804 num_examples: 1922 download_size: 262892101 dataset_size: 267498234.804 - config_name: subset_37 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 259200364.638 num_examples: 1927 download_size: 261353101 dataset_size: 259200364.638 - config_name: subset_38 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 255339736.1 num_examples: 1900 download_size: 256796714 dataset_size: 255339736.1 - config_name: subset_39 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 256112748.759 num_examples: 1909 download_size: 255075229 dataset_size: 256112748.759 - config_name: subset_4 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 342676580.064 num_examples: 2222 download_size: 336637408 dataset_size: 342676580.064 - config_name: subset_40 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 264478247.031 num_examples: 1897 download_size: 261613430 dataset_size: 264478247.031 - config_name: subset_41 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 263047391.246 num_examples: 1938 download_size: 264335335 dataset_size: 263047391.246 - config_name: subset_42 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 271433615.126 num_examples: 1926 download_size: 269093305 dataset_size: 271433615.126 - config_name: subset_43 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 267302333.822 num_examples: 1931 download_size: 262446815 dataset_size: 267302333.822 - config_name: subset_44 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 258089650.374 num_examples: 1907 download_size: 257901062 dataset_size: 258089650.374 - config_name: subset_45 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 267126530.464 num_examples: 1906 download_size: 259715921 dataset_size: 267126530.464 - config_name: subset_46 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 267397061.684 num_examples: 1894 download_size: 266311610 dataset_size: 267397061.684 - config_name: subset_47 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 261808732.984 num_examples: 1922 download_size: 261447972 dataset_size: 261808732.984 - config_name: subset_48 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 266204953.827 num_examples: 1903 download_size: 262098097 dataset_size: 266204953.827 - config_name: subset_49 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 272582963.713 num_examples: 1937 download_size: 272699393 dataset_size: 272582963.713 - config_name: subset_5 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 327466047.367 num_examples: 2207 download_size: 317127305 dataset_size: 327466047.367 - config_name: subset_50 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 269820211.415 num_examples: 1897 download_size: 265934237 dataset_size: 269820211.415 - config_name: subset_51 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 263331384.278 num_examples: 1867 download_size: 260476626 dataset_size: 263331384.278 - config_name: subset_52 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 273667430.075 num_examples: 1951 download_size: 270497322 dataset_size: 273667430.075 - config_name: subset_53 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 269233156.328 num_examples: 1924 download_size: 263876114 dataset_size: 269233156.328 - config_name: subset_54 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 275202109.892 num_examples: 1972 download_size: 274750233 dataset_size: 275202109.892 - config_name: subset_55 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 273027927.16 num_examples: 1905 download_size: 267781879 dataset_size: 273027927.16 - config_name: subset_56 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 266974653.26 num_examples: 1940 download_size: 266937007 dataset_size: 266974653.26 - config_name: subset_57 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 276419581.036 num_examples: 1922 download_size: 271467539 dataset_size: 276419581.036 - config_name: subset_58 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 273876030.42 num_examples: 1945 download_size: 271013902 dataset_size: 273876030.42 - config_name: subset_59 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 277266268.083 num_examples: 1913 download_size: 272249948 dataset_size: 277266268.083 - config_name: subset_6 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 293136723.216 num_examples: 2124 download_size: 289237116 dataset_size: 293136723.216 - config_name: subset_60 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 260599219.68 num_examples: 1878 download_size: 261904714 dataset_size: 260599219.68 - config_name: subset_61 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 272031128.58 num_examples: 1910 download_size: 270271562 dataset_size: 272031128.58 - config_name: subset_62 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 283231593.451 num_examples: 1893 download_size: 276490773 dataset_size: 283231593.451 - config_name: subset_63 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 264515944.95 num_examples: 1866 download_size: 267500412 dataset_size: 264515944.95 - config_name: subset_64 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 271157263.404 num_examples: 1901 download_size: 270389115 dataset_size: 271157263.404 - config_name: subset_65 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 263982137.388 num_examples: 1892 download_size: 259853366 dataset_size: 263982137.388 - config_name: subset_66 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 279413856.67 num_examples: 1926 download_size: 275955524 dataset_size: 279413856.67 - config_name: subset_67 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 271311546.505 num_examples: 1915 download_size: 277606157 dataset_size: 271311546.505 - config_name: subset_68 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 273278156.474 num_examples: 1943 download_size: 275128892 dataset_size: 273278156.474 - config_name: subset_69 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 285372466.172 num_examples: 1941 download_size: 281447689 dataset_size: 285372466.172 - config_name: subset_7 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 290634380.08 num_examples: 2088 download_size: 284304489 dataset_size: 290634380.08 - config_name: subset_70 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 271127223.452 num_examples: 1892 download_size: 271987662 dataset_size: 271127223.452 - config_name: subset_71 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 291662957.67 num_examples: 1935 download_size: 281437061 dataset_size: 291662957.67 - config_name: subset_72 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 286940524.725 num_examples: 1925 download_size: 283723212 dataset_size: 286940524.725 - config_name: subset_73 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 278124571.71 num_examples: 1905 download_size: 279731683 dataset_size: 278124571.71 - config_name: subset_74 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 274648738.375 num_examples: 1905 download_size: 271511496 dataset_size: 274648738.375 - config_name: subset_75 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 281728075.978 num_examples: 1934 download_size: 277198651 dataset_size: 281728075.978 - config_name: subset_76 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 279537195.492 num_examples: 1916 download_size: 277980448 dataset_size: 279537195.492 - config_name: subset_77 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 280652372.07 num_examples: 1930 download_size: 283471189 dataset_size: 280652372.07 - config_name: subset_78 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 280137011.501 num_examples: 1917 download_size: 273227350 dataset_size: 280137011.501 - config_name: subset_79 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 281722137.835 num_examples: 1955 download_size: 284817049 dataset_size: 281722137.835 - config_name: subset_8 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 275023448.12 num_examples: 2085 download_size: 272189090 dataset_size: 275023448.12 - config_name: subset_80 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 267158670.44 num_examples: 1882 download_size: 270811360 dataset_size: 267158670.44 - config_name: subset_81 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 278817727.146 num_examples: 1906 download_size: 282495649 dataset_size: 278817727.146 - config_name: subset_82 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 279204927.466 num_examples: 1899 download_size: 279159018 dataset_size: 279204927.466 - config_name: subset_83 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 271094408.495 num_examples: 1885 download_size: 275435863 dataset_size: 271094408.495 - config_name: subset_84 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 280637661.348 num_examples: 1909 download_size: 276004263 dataset_size: 280637661.348 - config_name: subset_85 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 290199186.01 num_examples: 1990 download_size: 291059518 dataset_size: 290199186.01 - config_name: subset_86 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 288138249.606 num_examples: 1938 download_size: 288830100 dataset_size: 288138249.606 - config_name: subset_87 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 268557700.352 num_examples: 1858 download_size: 267654935 dataset_size: 268557700.352 - config_name: subset_88 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 268882003.15999997 num_examples: 1855 download_size: 269433081 dataset_size: 268882003.15999997 - config_name: subset_89 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 287281040.752 num_examples: 1904 download_size: 281107117 dataset_size: 287281040.752 - config_name: subset_9 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 276275613.24 num_examples: 2030 download_size: 273610084 dataset_size: 276275613.24 - config_name: subset_90 features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 279146081.983 num_examples: 1901 download_size: 273712476 dataset_size: 279146081.983 - config_name: subset_91 features: - name: hiA.audio dtype: audio - name: enA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 270773993.542 num_examples: 1833 download_size: 268404383 dataset_size: 270773993.542 - config_name: subset_test features: - name: enA.audio dtype: audio - name: hiA.audio dtype: audio - name: line_no dtype: int64 - name: enA.id dtype: string - name: enA.url dtype: string - name: enA.duration_start dtype: int64 - name: enA.duration_end dtype: int64 - name: enA.laser_score dtype: float64 - name: hiA.id dtype: string - name: hiA.url dtype: string - name: hiA.duration_start dtype: int64 - name: hiA.duration_end dtype: int64 - name: hiA.laser_score dtype: float64 splits: - name: train num_bytes: 1374410.0 num_examples: 8 download_size: 1381522 dataset_size: 1374410.0 configs: - config_name: subset_1 data_files: - split: train path: subset_1/train-* - config_name: subset_10 data_files: - split: train path: subset_10/train-* - config_name: subset_11 data_files: - split: train path: subset_11/train-* - config_name: subset_12 data_files: - split: train path: subset_12/train-* - config_name: subset_13 data_files: - split: train path: subset_13/train-* - config_name: subset_14 data_files: - split: train path: subset_14/train-* - config_name: subset_15 data_files: - split: train path: subset_15/train-* - config_name: subset_16 data_files: - split: train path: subset_16/train-* - config_name: subset_17 data_files: - split: train path: subset_17/train-* - config_name: subset_18 data_files: - split: train path: subset_18/train-* - config_name: subset_19 data_files: - split: train path: subset_19/train-* - config_name: subset_2 data_files: - split: train path: subset_2/train-* - config_name: subset_20 data_files: - split: train path: subset_20/train-* - config_name: subset_21 data_files: - split: train path: subset_21/train-* - config_name: subset_22 data_files: - split: train path: subset_22/train-* - config_name: subset_23 data_files: - split: train path: subset_23/train-* - config_name: subset_24 data_files: - split: train path: subset_24/train-* - config_name: subset_25 data_files: - split: train path: subset_25/train-* - config_name: subset_26 data_files: - split: train path: subset_26/train-* - config_name: subset_27 data_files: - split: train path: subset_27/train-* - config_name: subset_28 data_files: - split: train path: subset_28/train-* - config_name: subset_29 data_files: - split: train path: subset_29/train-* - config_name: subset_3 data_files: - split: train path: subset_3/train-* - config_name: subset_30 data_files: - split: train path: subset_30/train-* - config_name: subset_31 data_files: - split: train path: subset_31/train-* - config_name: subset_32 data_files: - split: train path: subset_32/train-* - config_name: subset_33 data_files: - split: train path: subset_33/train-* - config_name: subset_34 data_files: - split: train path: subset_34/train-* - config_name: subset_35 data_files: - split: train path: subset_35/train-* - config_name: subset_36 data_files: - split: train path: subset_36/train-* - config_name: subset_37 data_files: - split: train path: subset_37/train-* - config_name: subset_38 data_files: - split: train path: subset_38/train-* - config_name: subset_39 data_files: - split: train path: subset_39/train-* - config_name: subset_4 data_files: - split: train path: subset_4/train-* - config_name: subset_40 data_files: - split: train path: subset_40/train-* - config_name: subset_41 data_files: - split: train path: subset_41/train-* - config_name: subset_42 data_files: - split: train path: subset_42/train-* - config_name: subset_43 data_files: - split: train path: subset_43/train-* - config_name: subset_44 data_files: - split: train path: subset_44/train-* - config_name: subset_45 data_files: - split: train path: subset_45/train-* - config_name: subset_46 data_files: - split: train path: subset_46/train-* - config_name: subset_47 data_files: - split: train path: subset_47/train-* - config_name: subset_48 data_files: - split: train path: subset_48/train-* - config_name: subset_49 data_files: - split: train path: subset_49/train-* - config_name: subset_5 data_files: - split: train path: subset_5/train-* - config_name: subset_50 data_files: - split: train path: subset_50/train-* - config_name: subset_51 data_files: - split: train path: subset_51/train-* - config_name: subset_52 data_files: - split: train path: subset_52/train-* - config_name: subset_53 data_files: - split: train path: subset_53/train-* - config_name: subset_54 data_files: - split: train path: subset_54/train-* - config_name: subset_55 data_files: - split: train path: subset_55/train-* - config_name: subset_56 data_files: - split: train path: subset_56/train-* - config_name: subset_57 data_files: - split: train path: subset_57/train-* - config_name: subset_58 data_files: - split: train path: subset_58/train-* - config_name: subset_59 data_files: - split: train path: subset_59/train-* - config_name: subset_6 data_files: - split: train path: subset_6/train-* - config_name: subset_60 data_files: - split: train path: subset_60/train-* - config_name: subset_61 data_files: - split: train path: subset_61/train-* - config_name: subset_62 data_files: - split: train path: subset_62/train-* - config_name: subset_63 data_files: - split: train path: subset_63/train-* - config_name: subset_64 data_files: - split: train path: subset_64/train-* - config_name: subset_65 data_files: - split: train path: subset_65/train-* - config_name: subset_66 data_files: - split: train path: subset_66/train-* - config_name: subset_67 data_files: - split: train path: subset_67/train-* - config_name: subset_68 data_files: - split: train path: subset_68/train-* - config_name: subset_69 data_files: - split: train path: subset_69/train-* - config_name: subset_7 data_files: - split: train path: subset_7/train-* - config_name: subset_70 data_files: - split: train path: subset_70/train-* - config_name: subset_71 data_files: - split: train path: subset_71/train-* - config_name: subset_72 data_files: - split: train path: subset_72/train-* - config_name: subset_73 data_files: - split: train path: subset_73/train-* - config_name: subset_74 data_files: - split: train path: subset_74/train-* - config_name: subset_75 data_files: - split: train path: subset_75/train-* - config_name: subset_76 data_files: - split: train path: subset_76/train-* - config_name: subset_77 data_files: - split: train path: subset_77/train-* - config_name: subset_78 data_files: - split: train path: subset_78/train-* - config_name: subset_79 data_files: - split: train path: subset_79/train-* - config_name: subset_8 data_files: - split: train path: subset_8/train-* - config_name: subset_80 data_files: - split: train path: subset_80/train-* - config_name: subset_81 data_files: - split: train path: subset_81/train-* - config_name: subset_82 data_files: - split: train path: subset_82/train-* - config_name: subset_83 data_files: - split: train path: subset_83/train-* - config_name: subset_84 data_files: - split: train path: subset_84/train-* - config_name: subset_85 data_files: - split: train path: subset_85/train-* - config_name: subset_86 data_files: - split: train path: subset_86/train-* - config_name: subset_87 data_files: - split: train path: subset_87/train-* - config_name: subset_88 data_files: - split: train path: subset_88/train-* - config_name: subset_89 data_files: - split: train path: subset_89/train-* - config_name: subset_9 data_files: - split: train path: subset_9/train-* - config_name: subset_90 data_files: - split: train path: subset_90/train-* - config_name: subset_91 data_files: - split: train path: subset_91/train-* - config_name: subset_test data_files: - split: train path: subset_test/train-* ---
提供机构:
asahi417
原始信息汇总

数据集概述

本数据集包含多个子集,每个子集具有相似的结构和特征,主要用于音频相关的研究和分析。以下是各子集的关键信息:

子集1

  • 特征:
    • hiA.audio: 音频
    • enA.audio: 音频
    • line_no: 整数
    • enA.id: 字符串
    • enA.url: 字符串
    • enA.duration_start: 整数
    • enA.duration_end: 整数
    • enA.laser_score: 浮点数
    • hiA.id: 字符串
    • hiA.url: 字符串
    • hiA.duration_start: 整数
    • hiA.duration_end: 整数
    • hiA.laser_score: 浮点数
  • 分割:
    • 训练集: 2297个样本,大小410483653.543字节
    • 下载大小: 412555784字节
    • 数据集大小: 410483653.543字节

子集10

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 2032个样本,大小271862277.536字节
    • 下载大小: 273365121字节
    • 数据集大小: 271862277.536字节

子集11

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1988个样本,大小263150120.56字节
    • 下载大小: 261518751字节
    • 数据集大小: 263150120.56字节

子集12

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 2011个样本,大小269466728.23字节
    • 下载大小: 262544262字节
    • 数据集大小: 269466728.23字节

子集13

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1940个样本,大小253581799.08字节
    • 下载大小: 249339076字节
    • 数据集大小: 253581799.08字节

子集14

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1984个样本,大小267950451.912字节
    • 下载大小: 261855591字节
    • 数据集大小: 267950451.912字节

子集15

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1965个样本,大小247393968.885字节
    • 下载大小: 247129654字节
    • 数据集大小: 247393968.885字节

子集16

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 2006个样本,大小257412810.412字节
    • 下载大小: 256982287字节
    • 数据集大小: 257412810.412字节

子集17

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 2028个样本,大小269596131.776字节
    • 下载大小: 272294201字节
    • 数据集大小: 269596131.776字节

子集18

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1996个样本,大小253857527.84字节
    • 下载大小: 258901774字节
    • 数据集大小: 253857527.84字节

子集19

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1968个样本,大小261430690.512字节
    • 下载大小: 259714920字节
    • 数据集大小: 261430690.512字节

子集2

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 2340个样本,大小419562006.72字节
    • 下载大小: 401315570字节
    • 数据集大小: 419562006.72字节

子集20

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1977个样本,大小265881917.419字节
    • 下载大小: 260081233字节
    • 数据集大小: 265881917.419字节

子集21

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1979个样本,大小262436473.241字节
    • 下载大小: 258149267字节
    • 数据集大小: 262436473.241字节

子集22

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 2024个样本,大小266669226.24字节
    • 下载大小: 266112306字节
    • 数据集大小: 266669226.24字节

子集23

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1986个样本,大小258802972.248字节
    • 下载大小: 255535922字节
    • 数据集大小: 258802972.248字节

子集24

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1976个样本,大小260110000.944字节
    • 下载大小: 259990251字节
    • 数据集大小: 260110000.944字节

子集25

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1980个样本,大小261131466.28字节
    • 下载大小: 254777016字节
    • 数据集大小: 261131466.28字节

子集26

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1938个样本,大小257491740.05字节
    • 下载大小: 257527316字节
    • 数据集大小: 257491740.05字节

子集27

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1963个样本,大小259895156.762字节
    • 下载大小: 256182167字节
    • 数据集大小: 259895156.762字节

子集28

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1963个样本,大小270180442.326字节
    • 下载大小: 264265589字节
    • 数据集大小: 270180442.326字节

子集29

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1955个样本,大小261116231.93字节
    • 下载大小: 259846823字节
    • 数据集大小: 261116231.93字节

子集3

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 2284个样本,大小382949661.988字节
    • 下载大小: 363390598字节
    • 数据集大小: 382949661.988字节

子集30

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1940个样本,大小257918518.82字节
    • 下载大小: 255822545字节
    • 数据集大小: 257918518.82字节

子集31

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1941个样本,大小257545460.061字节
    • 下载大小: 254066493字节
    • 数据集大小: 257545460.061字节

子集32

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1969个样本,大小269109273.799字节
    • 下载大小: 264795977字节
    • 数据集大小: 269109273.799字节

子集33

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1958个样本,大小258077454.974字节
    • 下载大小: 255785580字节
    • 数据集大小: 258077454.974字节

子集34

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1876个样本,大小249061629.604字节
    • 下载大小: 248289822字节
    • 数据集大小: 249061629.604字节

子集35

  • 特征:
    • 同子集1
  • 分割:
    • 训练集: 1941个样本,大小257545460.061字节
    • 下载大小: 254066493字节
    • 数据集大小: 257545460.061字节

每个子集都包含音频数据以及相关的元数据,如音频的开始和结束时间、激光评分等,适用于音频处理和分析的研究。

5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作