atutej/m_lama
收藏Hugging Face2024-03-14 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/atutej/m_lama
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: af
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1364986
num_examples: 7331
download_size: 544481
dataset_size: 1364986
- config_name: ar
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4564504
num_examples: 19354
download_size: 1580143
dataset_size: 4564504
- config_name: az
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1467465
num_examples: 7653
download_size: 578396
dataset_size: 1467465
- config_name: be
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2285464
num_examples: 8853
download_size: 714406
dataset_size: 2285464
- config_name: bg
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3109085
num_examples: 12461
download_size: 1013009
dataset_size: 3109085
- config_name: bn
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2969863
num_examples: 8975
download_size: 748274
dataset_size: 2969863
- config_name: ca
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4620850
num_examples: 24287
download_size: 1940588
dataset_size: 4620850
- config_name: ceb
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1433194
num_examples: 6769
download_size: 524854
dataset_size: 1433194
- config_name: cs
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2997353
num_examples: 15848
download_size: 1246743
dataset_size: 2997353
- config_name: cy
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1901684
num_examples: 9915
download_size: 769225
dataset_size: 1901684
- config_name: da
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3672623
num_examples: 19636
download_size: 1535250
dataset_size: 3672623
- config_name: de
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 6348506
num_examples: 32548
download_size: 2613173
dataset_size: 6348506
- config_name: el
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3416098
num_examples: 12854
download_size: 1074167
dataset_size: 3416098
- config_name: en
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 7031572
num_examples: 37498
download_size: 3023574
dataset_size: 7031572
- config_name: es
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 6000790
num_examples: 31578
download_size: 2542929
dataset_size: 6000790
- config_name: et
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1847160
num_examples: 9880
download_size: 748222
dataset_size: 1847160
- config_name: eu
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2260887
num_examples: 11910
download_size: 921424
dataset_size: 2260887
- config_name: fa
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4482869
num_examples: 18481
download_size: 1497801
dataset_size: 4482869
- config_name: fi
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3575879
num_examples: 19017
download_size: 1477166
dataset_size: 3575879
- config_name: fr
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 6553643
num_examples: 33872
download_size: 2716208
dataset_size: 6553643
- config_name: ga
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2809813
num_examples: 13937
download_size: 1076939
dataset_size: 2809813
- config_name: gl
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2062413
num_examples: 10567
download_size: 817987
dataset_size: 2062413
- config_name: he
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3273282
num_examples: 14769
download_size: 1165490
dataset_size: 3273282
- config_name: hi
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2750247
num_examples: 8570
download_size: 707213
dataset_size: 2750247
- config_name: hr
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1766612
num_examples: 9322
download_size: 714362
dataset_size: 1766612
- config_name: hu
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3629786
num_examples: 18850
download_size: 1485748
dataset_size: 3629786
- config_name: hy
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2580835
num_examples: 10030
download_size: 809063
dataset_size: 2580835
- config_name: id
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2693872
num_examples: 14183
download_size: 1103155
dataset_size: 2693872
- config_name: it
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 5287655
num_examples: 27648
download_size: 2198936
dataset_size: 5287655
- config_name: ja
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 6105411
num_examples: 25356
download_size: 2091964
dataset_size: 6105411
- config_name: ka
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2649721
num_examples: 8099
download_size: 647390
dataset_size: 2649721
- config_name: ko
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3526211
num_examples: 16327
download_size: 1309593
dataset_size: 3526211
- config_name: la
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1581833
num_examples: 8061
download_size: 612760
dataset_size: 1581833
- config_name: lt
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1835683
num_examples: 9560
download_size: 736354
dataset_size: 1835683
- config_name: lv
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1649860
num_examples: 8474
download_size: 643807
dataset_size: 1649860
- config_name: ms
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1768627
num_examples: 9146
download_size: 702211
dataset_size: 1768627
- config_name: nl
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 6221612
num_examples: 32423
download_size: 2597145
dataset_size: 6221612
- config_name: pl
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4013247
num_examples: 20727
download_size: 1644648
dataset_size: 4013247
- config_name: pt
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4044269
num_examples: 21023
download_size: 1653658
dataset_size: 4044269
- config_name: ro
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2523121
num_examples: 12886
download_size: 1007651
dataset_size: 2523121
- config_name: ru
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 6405438
num_examples: 25335
download_size: 2129105
dataset_size: 6405438
- config_name: sk
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1942547
num_examples: 10205
download_size: 788723
dataset_size: 1942547
- config_name: sl
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3455705
num_examples: 18091
download_size: 1406987
dataset_size: 3455705
- config_name: sq
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2404246
num_examples: 12586
download_size: 956395
dataset_size: 2404246
- config_name: sr
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3104514
num_examples: 12477
download_size: 1027773
dataset_size: 3104514
- config_name: sv
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4536924
num_examples: 24240
download_size: 1905031
dataset_size: 4536924
- config_name: ta
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2546658
num_examples: 7223
download_size: 599177
dataset_size: 2546658
- config_name: th
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 3451558
num_examples: 9786
download_size: 851558
dataset_size: 3451558
- config_name: tr
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2701219
num_examples: 14209
download_size: 1101256
dataset_size: 2701219
- config_name: transliterated-hi
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1619992
num_examples: 8570
download_size: 646087
dataset_size: 1619992
- config_name: uk
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4528716
num_examples: 18035
download_size: 1523846
dataset_size: 4528716
- config_name: ur
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 1774430
num_examples: 7279
download_size: 576108
dataset_size: 1774430
- config_name: vi
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 2331103
num_examples: 11350
download_size: 893519
dataset_size: 2331103
- config_name: zh
features:
- name: uuid
dtype: string
- name: lineid
dtype: uint32
- name: obj_uri
dtype: string
- name: obj_label
dtype: string
- name: sub_uri
dtype: string
- name: sub_label
dtype: string
- name: template
dtype: string
- name: language
dtype: string
- name: predicate_id
dtype: string
- name: options
sequence: string
splits:
- name: test
num_bytes: 4178875
num_examples: 21449
download_size: 1747217
dataset_size: 4178875
configs:
- config_name: af
data_files:
- split: test
path: af/test-*
- config_name: ar
data_files:
- split: test
path: ar/test-*
- config_name: az
data_files:
- split: test
path: az/test-*
- config_name: be
data_files:
- split: test
path: be/test-*
- config_name: bg
data_files:
- split: test
path: bg/test-*
- config_name: bn
data_files:
- split: test
path: bn/test-*
- config_name: ca
data_files:
- split: test
path: ca/test-*
- config_name: ceb
data_files:
- split: test
path: ceb/test-*
- config_name: cs
data_files:
- split: test
path: cs/test-*
- config_name: cy
data_files:
- split: test
path: cy/test-*
- config_name: da
data_files:
- split: test
path: da/test-*
- config_name: de
data_files:
- split: test
path: de/test-*
- config_name: el
data_files:
- split: test
path: el/test-*
- config_name: en
data_files:
- split: test
path: en/test-*
- config_name: es
data_files:
- split: test
path: es/test-*
- config_name: et
data_files:
- split: test
path: et/test-*
- config_name: eu
data_files:
- split: test
path: eu/test-*
- config_name: fa
data_files:
- split: test
path: fa/test-*
- config_name: fi
data_files:
- split: test
path: fi/test-*
- config_name: fr
data_files:
- split: test
path: fr/test-*
- config_name: ga
data_files:
- split: test
path: ga/test-*
- config_name: gl
data_files:
- split: test
path: gl/test-*
- config_name: he
data_files:
- split: test
path: he/test-*
- config_name: hi
data_files:
- split: test
path: hi/test-*
- config_name: hr
data_files:
- split: test
path: hr/test-*
- config_name: hu
data_files:
- split: test
path: hu/test-*
- config_name: hy
data_files:
- split: test
path: hy/test-*
- config_name: id
data_files:
- split: test
path: id/test-*
- config_name: it
data_files:
- split: test
path: it/test-*
- config_name: ja
data_files:
- split: test
path: ja/test-*
- config_name: ka
data_files:
- split: test
path: ka/test-*
- config_name: ko
data_files:
- split: test
path: ko/test-*
- config_name: la
data_files:
- split: test
path: la/test-*
- config_name: lt
data_files:
- split: test
path: lt/test-*
- config_name: lv
data_files:
- split: test
path: lv/test-*
- config_name: ms
data_files:
- split: test
path: ms/test-*
- config_name: nl
data_files:
- split: test
path: nl/test-*
- config_name: pl
data_files:
- split: test
path: pl/test-*
- config_name: pt
data_files:
- split: test
path: pt/test-*
- config_name: ro
data_files:
- split: test
path: ro/test-*
- config_name: ru
data_files:
- split: test
path: ru/test-*
- config_name: sk
data_files:
- split: test
path: sk/test-*
- config_name: sl
data_files:
- split: test
path: sl/test-*
- config_name: sq
data_files:
- split: test
path: sq/test-*
- config_name: sr
data_files:
- split: test
path: sr/test-*
- config_name: sv
data_files:
- split: test
path: sv/test-*
- config_name: ta
data_files:
- split: test
path: ta/test-*
- config_name: th
data_files:
- split: test
path: th/test-*
- config_name: tr
data_files:
- split: test
path: tr/test-*
- config_name: transliterated-hi
data_files:
- split: test
path: transliterated-hi/test-*
- config_name: uk
data_files:
- split: test
path: uk/test-*
- config_name: ur
data_files:
- split: test
path: ur/test-*
- config_name: vi
data_files:
- split: test
path: vi/test-*
- config_name: zh
data_files:
- split: test
path: zh/test-*
---
Extension/Modification of the original m_lama dataset
提供机构:
atutej
原始信息汇总
数据集概述
该数据集包含多种语言的配置,每种语言配置下包含相同的特征和测试数据分割。以下是各语言配置的详细信息:
语言配置列表
- af (南非荷兰语)
- ar (阿拉伯语)
- az (阿塞拜疆语)
- be (白俄罗斯语)
- bg (保加利亚语)
- bn (孟加拉语)
- ca (加泰罗尼亚语)
- ceb (宿务语)
- cs (捷克语)
- cy (威尔士语)
- da (丹麦语)
- de (德语)
- el (希腊语)
- en (英语)
- es (西班牙语)
- et (爱沙尼亚语)
- eu (巴斯克语)
- fa (波斯语)
- fi (芬兰语)
- fr (法语)
- ga (爱尔兰语)
- gl (加利西亚语)
- he (希伯来语)
- hi (印地语)
- hr (克罗地亚语)
- hu (匈牙利语)
- hy (亚美尼亚语)
- id (印度尼西亚语)
- it (意大利语)
- ja (日语)
- ka (格鲁吉亚语)
- ko (韩语)
- la (拉丁语)
- lt (立陶宛语)
- lv (拉脱维亚语)
- ms (马来语)
- nl (荷兰语)
- pl (波兰语)
- pt (葡萄牙语)
共同特征
每种语言配置包含以下特征:
- uuid: 字符串类型
- lineid: 32位无符号整数类型
- obj_uri: 字符串类型
- obj_label: 字符串类型
- sub_uri: 字符串类型
- sub_label: 字符串类型
- template: 字符串类型
- language: 字符串类型
- predicate_id: 字符串类型
- options: 字符串序列类型
数据分割
每种语言配置包含一个测试数据分割:
- test: 包含数据字节数和示例数量
数据大小
每种语言配置的下载大小和数据集大小如下:
- af: 下载大小 544481 字节,数据集大小 1364986 字节
- ar: 下载大小 1580143 字节,数据集大小 4564504 字节
- az: 下载大小 578396 字节,数据集大小 1467465 字节
- be: 下载大小 714406 字节,数据集大小 2285464 字节
- bg: 下载大小 1013009 字节,数据集大小 3109085 字节
- bn: 下载大小 748274 字节,数据集大小 2969863 字节
- ca: 下载大小 1940588 字节,数据集大小 4620850 字节
- ceb: 下载大小 524854 字节,数据集大小 1433194 字节
- cs: 下载大小 1246743 字节,数据集大小 2997353 字节
- cy: 下载大小 769225 字节,数据集大小 1901684 字节
- da: 下载大小 1535250 字节,数据集大小 3672623 字节
- de: 下载大小 2613173 字节,数据集大小 6348506 字节
- el: 下载大小 1074167 字节,数据集大小 3416098 字节
- en: 下载大小 3023574 字节,数据集大小 7031572 字节
- es: 下载大小 2542929 字节,数据集大小 6000790 字节
- et: 下载大小 748222 字节,数据集大小 1847160 字节
- eu: 下载大小 921424 字节,数据集大小 2260887 字节
- fa: 下载大小 1497801 字节,数据集大小 4482869 字节
- fi: 下载大小 1477166 字节,数据集大小 3575879 字节
- fr: 下载大小 2716208 字节,数据集大小 6553643 字节
- ga: 下载大小 1076939 字节,数据集大小 2809813 字节
- gl: 下载大小 817987 字节,数据集大小 2062413 字节
- he: 下载大小 1165490 字节,数据集大小 3273282 字节
- hi: 下载大小 707213 字节,数据集大小 2750247 字节
- hr: 下载大小 714362 字节,数据集大小 1766612 字节
- hu: 下载大小 1485748 字节,数据集大小 3629786 字节
- hy: 下载大小 809063 字节,数据集大小 2580835 字节
- id: 下载大小 1103155 字节,数据集大小 2693872 字节
- it: 下载大小 2198936 字节,数据集大小 5287655 字节
- ja: 下载大小 2091964 字节,数据集大小 6105411 字节
- ka: 下载大小 647390 字节,数据集大小 2649721 字节
- ko: 下载大小 1309593 字节,数据集大小 3526211 字节
- la: 下载大小 612760 字节,数据集大小 1581833 字节
- lt: 下载大小 736354 字节,数据集大小 1835683 字节
- lv: 下载大小 643807 字节,数据集大小 1649860 字节
- ms: 下载大小 702211 字节,数据集大小 1768627 字节
- nl: 下载大小 2597145 字节,数据集大小 6221612 字节
- pl: 下载大小 1644648 字节,数据集大小 4013247 字节
- pt: 下载大小 1580143 字节,数据集大小 4564504 字节



