five

atutej/m_lama

收藏
Hugging Face2024-03-14 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/atutej/m_lama
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: af features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1364986 num_examples: 7331 download_size: 544481 dataset_size: 1364986 - config_name: ar features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4564504 num_examples: 19354 download_size: 1580143 dataset_size: 4564504 - config_name: az features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1467465 num_examples: 7653 download_size: 578396 dataset_size: 1467465 - config_name: be features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2285464 num_examples: 8853 download_size: 714406 dataset_size: 2285464 - config_name: bg features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3109085 num_examples: 12461 download_size: 1013009 dataset_size: 3109085 - config_name: bn features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2969863 num_examples: 8975 download_size: 748274 dataset_size: 2969863 - config_name: ca features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4620850 num_examples: 24287 download_size: 1940588 dataset_size: 4620850 - config_name: ceb features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1433194 num_examples: 6769 download_size: 524854 dataset_size: 1433194 - config_name: cs features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2997353 num_examples: 15848 download_size: 1246743 dataset_size: 2997353 - config_name: cy features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1901684 num_examples: 9915 download_size: 769225 dataset_size: 1901684 - config_name: da features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3672623 num_examples: 19636 download_size: 1535250 dataset_size: 3672623 - config_name: de features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 6348506 num_examples: 32548 download_size: 2613173 dataset_size: 6348506 - config_name: el features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3416098 num_examples: 12854 download_size: 1074167 dataset_size: 3416098 - config_name: en features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 7031572 num_examples: 37498 download_size: 3023574 dataset_size: 7031572 - config_name: es features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 6000790 num_examples: 31578 download_size: 2542929 dataset_size: 6000790 - config_name: et features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1847160 num_examples: 9880 download_size: 748222 dataset_size: 1847160 - config_name: eu features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2260887 num_examples: 11910 download_size: 921424 dataset_size: 2260887 - config_name: fa features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4482869 num_examples: 18481 download_size: 1497801 dataset_size: 4482869 - config_name: fi features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3575879 num_examples: 19017 download_size: 1477166 dataset_size: 3575879 - config_name: fr features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 6553643 num_examples: 33872 download_size: 2716208 dataset_size: 6553643 - config_name: ga features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2809813 num_examples: 13937 download_size: 1076939 dataset_size: 2809813 - config_name: gl features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2062413 num_examples: 10567 download_size: 817987 dataset_size: 2062413 - config_name: he features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3273282 num_examples: 14769 download_size: 1165490 dataset_size: 3273282 - config_name: hi features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2750247 num_examples: 8570 download_size: 707213 dataset_size: 2750247 - config_name: hr features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1766612 num_examples: 9322 download_size: 714362 dataset_size: 1766612 - config_name: hu features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3629786 num_examples: 18850 download_size: 1485748 dataset_size: 3629786 - config_name: hy features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2580835 num_examples: 10030 download_size: 809063 dataset_size: 2580835 - config_name: id features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2693872 num_examples: 14183 download_size: 1103155 dataset_size: 2693872 - config_name: it features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 5287655 num_examples: 27648 download_size: 2198936 dataset_size: 5287655 - config_name: ja features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 6105411 num_examples: 25356 download_size: 2091964 dataset_size: 6105411 - config_name: ka features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2649721 num_examples: 8099 download_size: 647390 dataset_size: 2649721 - config_name: ko features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3526211 num_examples: 16327 download_size: 1309593 dataset_size: 3526211 - config_name: la features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1581833 num_examples: 8061 download_size: 612760 dataset_size: 1581833 - config_name: lt features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1835683 num_examples: 9560 download_size: 736354 dataset_size: 1835683 - config_name: lv features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1649860 num_examples: 8474 download_size: 643807 dataset_size: 1649860 - config_name: ms features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1768627 num_examples: 9146 download_size: 702211 dataset_size: 1768627 - config_name: nl features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 6221612 num_examples: 32423 download_size: 2597145 dataset_size: 6221612 - config_name: pl features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4013247 num_examples: 20727 download_size: 1644648 dataset_size: 4013247 - config_name: pt features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4044269 num_examples: 21023 download_size: 1653658 dataset_size: 4044269 - config_name: ro features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2523121 num_examples: 12886 download_size: 1007651 dataset_size: 2523121 - config_name: ru features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 6405438 num_examples: 25335 download_size: 2129105 dataset_size: 6405438 - config_name: sk features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1942547 num_examples: 10205 download_size: 788723 dataset_size: 1942547 - config_name: sl features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3455705 num_examples: 18091 download_size: 1406987 dataset_size: 3455705 - config_name: sq features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2404246 num_examples: 12586 download_size: 956395 dataset_size: 2404246 - config_name: sr features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3104514 num_examples: 12477 download_size: 1027773 dataset_size: 3104514 - config_name: sv features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4536924 num_examples: 24240 download_size: 1905031 dataset_size: 4536924 - config_name: ta features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2546658 num_examples: 7223 download_size: 599177 dataset_size: 2546658 - config_name: th features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 3451558 num_examples: 9786 download_size: 851558 dataset_size: 3451558 - config_name: tr features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2701219 num_examples: 14209 download_size: 1101256 dataset_size: 2701219 - config_name: transliterated-hi features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1619992 num_examples: 8570 download_size: 646087 dataset_size: 1619992 - config_name: uk features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4528716 num_examples: 18035 download_size: 1523846 dataset_size: 4528716 - config_name: ur features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 1774430 num_examples: 7279 download_size: 576108 dataset_size: 1774430 - config_name: vi features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 2331103 num_examples: 11350 download_size: 893519 dataset_size: 2331103 - config_name: zh features: - name: uuid dtype: string - name: lineid dtype: uint32 - name: obj_uri dtype: string - name: obj_label dtype: string - name: sub_uri dtype: string - name: sub_label dtype: string - name: template dtype: string - name: language dtype: string - name: predicate_id dtype: string - name: options sequence: string splits: - name: test num_bytes: 4178875 num_examples: 21449 download_size: 1747217 dataset_size: 4178875 configs: - config_name: af data_files: - split: test path: af/test-* - config_name: ar data_files: - split: test path: ar/test-* - config_name: az data_files: - split: test path: az/test-* - config_name: be data_files: - split: test path: be/test-* - config_name: bg data_files: - split: test path: bg/test-* - config_name: bn data_files: - split: test path: bn/test-* - config_name: ca data_files: - split: test path: ca/test-* - config_name: ceb data_files: - split: test path: ceb/test-* - config_name: cs data_files: - split: test path: cs/test-* - config_name: cy data_files: - split: test path: cy/test-* - config_name: da data_files: - split: test path: da/test-* - config_name: de data_files: - split: test path: de/test-* - config_name: el data_files: - split: test path: el/test-* - config_name: en data_files: - split: test path: en/test-* - config_name: es data_files: - split: test path: es/test-* - config_name: et data_files: - split: test path: et/test-* - config_name: eu data_files: - split: test path: eu/test-* - config_name: fa data_files: - split: test path: fa/test-* - config_name: fi data_files: - split: test path: fi/test-* - config_name: fr data_files: - split: test path: fr/test-* - config_name: ga data_files: - split: test path: ga/test-* - config_name: gl data_files: - split: test path: gl/test-* - config_name: he data_files: - split: test path: he/test-* - config_name: hi data_files: - split: test path: hi/test-* - config_name: hr data_files: - split: test path: hr/test-* - config_name: hu data_files: - split: test path: hu/test-* - config_name: hy data_files: - split: test path: hy/test-* - config_name: id data_files: - split: test path: id/test-* - config_name: it data_files: - split: test path: it/test-* - config_name: ja data_files: - split: test path: ja/test-* - config_name: ka data_files: - split: test path: ka/test-* - config_name: ko data_files: - split: test path: ko/test-* - config_name: la data_files: - split: test path: la/test-* - config_name: lt data_files: - split: test path: lt/test-* - config_name: lv data_files: - split: test path: lv/test-* - config_name: ms data_files: - split: test path: ms/test-* - config_name: nl data_files: - split: test path: nl/test-* - config_name: pl data_files: - split: test path: pl/test-* - config_name: pt data_files: - split: test path: pt/test-* - config_name: ro data_files: - split: test path: ro/test-* - config_name: ru data_files: - split: test path: ru/test-* - config_name: sk data_files: - split: test path: sk/test-* - config_name: sl data_files: - split: test path: sl/test-* - config_name: sq data_files: - split: test path: sq/test-* - config_name: sr data_files: - split: test path: sr/test-* - config_name: sv data_files: - split: test path: sv/test-* - config_name: ta data_files: - split: test path: ta/test-* - config_name: th data_files: - split: test path: th/test-* - config_name: tr data_files: - split: test path: tr/test-* - config_name: transliterated-hi data_files: - split: test path: transliterated-hi/test-* - config_name: uk data_files: - split: test path: uk/test-* - config_name: ur data_files: - split: test path: ur/test-* - config_name: vi data_files: - split: test path: vi/test-* - config_name: zh data_files: - split: test path: zh/test-* --- Extension/Modification of the original m_lama dataset
提供机构:
atutej
原始信息汇总

数据集概述

该数据集包含多种语言的配置,每种语言配置下包含相同的特征和测试数据分割。以下是各语言配置的详细信息:

语言配置列表

  • af (南非荷兰语)
  • ar (阿拉伯语)
  • az (阿塞拜疆语)
  • be (白俄罗斯语)
  • bg (保加利亚语)
  • bn (孟加拉语)
  • ca (加泰罗尼亚语)
  • ceb (宿务语)
  • cs (捷克语)
  • cy (威尔士语)
  • da (丹麦语)
  • de (德语)
  • el (希腊语)
  • en (英语)
  • es (西班牙语)
  • et (爱沙尼亚语)
  • eu (巴斯克语)
  • fa (波斯语)
  • fi (芬兰语)
  • fr (法语)
  • ga (爱尔兰语)
  • gl (加利西亚语)
  • he (希伯来语)
  • hi (印地语)
  • hr (克罗地亚语)
  • hu (匈牙利语)
  • hy (亚美尼亚语)
  • id (印度尼西亚语)
  • it (意大利语)
  • ja (日语)
  • ka (格鲁吉亚语)
  • ko (韩语)
  • la (拉丁语)
  • lt (立陶宛语)
  • lv (拉脱维亚语)
  • ms (马来语)
  • nl (荷兰语)
  • pl (波兰语)
  • pt (葡萄牙语)

共同特征

每种语言配置包含以下特征:

  • uuid: 字符串类型
  • lineid: 32位无符号整数类型
  • obj_uri: 字符串类型
  • obj_label: 字符串类型
  • sub_uri: 字符串类型
  • sub_label: 字符串类型
  • template: 字符串类型
  • language: 字符串类型
  • predicate_id: 字符串类型
  • options: 字符串序列类型

数据分割

每种语言配置包含一个测试数据分割:

  • test: 包含数据字节数和示例数量

数据大小

每种语言配置的下载大小和数据集大小如下:

  • af: 下载大小 544481 字节,数据集大小 1364986 字节
  • ar: 下载大小 1580143 字节,数据集大小 4564504 字节
  • az: 下载大小 578396 字节,数据集大小 1467465 字节
  • be: 下载大小 714406 字节,数据集大小 2285464 字节
  • bg: 下载大小 1013009 字节,数据集大小 3109085 字节
  • bn: 下载大小 748274 字节,数据集大小 2969863 字节
  • ca: 下载大小 1940588 字节,数据集大小 4620850 字节
  • ceb: 下载大小 524854 字节,数据集大小 1433194 字节
  • cs: 下载大小 1246743 字节,数据集大小 2997353 字节
  • cy: 下载大小 769225 字节,数据集大小 1901684 字节
  • da: 下载大小 1535250 字节,数据集大小 3672623 字节
  • de: 下载大小 2613173 字节,数据集大小 6348506 字节
  • el: 下载大小 1074167 字节,数据集大小 3416098 字节
  • en: 下载大小 3023574 字节,数据集大小 7031572 字节
  • es: 下载大小 2542929 字节,数据集大小 6000790 字节
  • et: 下载大小 748222 字节,数据集大小 1847160 字节
  • eu: 下载大小 921424 字节,数据集大小 2260887 字节
  • fa: 下载大小 1497801 字节,数据集大小 4482869 字节
  • fi: 下载大小 1477166 字节,数据集大小 3575879 字节
  • fr: 下载大小 2716208 字节,数据集大小 6553643 字节
  • ga: 下载大小 1076939 字节,数据集大小 2809813 字节
  • gl: 下载大小 817987 字节,数据集大小 2062413 字节
  • he: 下载大小 1165490 字节,数据集大小 3273282 字节
  • hi: 下载大小 707213 字节,数据集大小 2750247 字节
  • hr: 下载大小 714362 字节,数据集大小 1766612 字节
  • hu: 下载大小 1485748 字节,数据集大小 3629786 字节
  • hy: 下载大小 809063 字节,数据集大小 2580835 字节
  • id: 下载大小 1103155 字节,数据集大小 2693872 字节
  • it: 下载大小 2198936 字节,数据集大小 5287655 字节
  • ja: 下载大小 2091964 字节,数据集大小 6105411 字节
  • ka: 下载大小 647390 字节,数据集大小 2649721 字节
  • ko: 下载大小 1309593 字节,数据集大小 3526211 字节
  • la: 下载大小 612760 字节,数据集大小 1581833 字节
  • lt: 下载大小 736354 字节,数据集大小 1835683 字节
  • lv: 下载大小 643807 字节,数据集大小 1649860 字节
  • ms: 下载大小 702211 字节,数据集大小 1768627 字节
  • nl: 下载大小 2597145 字节,数据集大小 6221612 字节
  • pl: 下载大小 1644648 字节,数据集大小 4013247 字节
  • pt: 下载大小 1580143 字节,数据集大小 4564504 字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作