liu-nlp/estonian-blimp-single-error
收藏Hugging Face2025-12-04 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/liu-nlp/estonian-blimp-single-error
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
- config_name: gen-pl-to-ill-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 148916
num_examples: 500
download_size: 104561
dataset_size: 148916
- config_name: gen-pl-to-nom-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 146952
num_examples: 500
download_size: 104092
dataset_size: 146952
- config_name: gen-pl-to-part-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 147097
num_examples: 500
download_size: 104193
dataset_size: 147097
- config_name: gen-sg-to-gen-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 495831
num_examples: 1793
download_size: 326422
dataset_size: 495831
- config_name: gen-sg-to-ill-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 438414
num_examples: 1581
download_size: 291428
dataset_size: 438414
- config_name: gen-sg-to-nom-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 380416
num_examples: 1337
download_size: 256308
dataset_size: 380416
- config_name: gen-sg-to-part-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 409475
num_examples: 1470
download_size: 271842
dataset_size: 409475
- config_name: ind-prs-3pl-to-1pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 27748
num_examples: 93
download_size: 22331
dataset_size: 27748
- config_name: ind-prs-3pl-to-2pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 27749
num_examples: 93
download_size: 22310
dataset_size: 27749
- config_name: ind-prs-3sg-to-1sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 57178
num_examples: 195
download_size: 43510
dataset_size: 57178
- config_name: ind-prs-3sg-to-2sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 57178
num_examples: 195
download_size: 43496
dataset_size: 57178
- config_name: ind-pst-3pl-to-1pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 35797
num_examples: 103
download_size: 28386
dataset_size: 35797
- config_name: ind-pst-3pl-to-2pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 35799
num_examples: 103
download_size: 28348
dataset_size: 35799
- config_name: ind-pst-3sg-to-1sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 191237
num_examples: 653
download_size: 132497
dataset_size: 191237
- config_name: ind-pst-3sg-to-2sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 191236
num_examples: 653
download_size: 132427
dataset_size: 191236
- config_name: nom-pl-to-gen-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 125921
num_examples: 452
download_size: 87765
dataset_size: 125921
- config_name: nom-pl-to-ill-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 127276
num_examples: 452
download_size: 88020
dataset_size: 127276
- config_name: nom-pl-to-part-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 125641
num_examples: 452
download_size: 87685
dataset_size: 125641
- config_name: nom-sg-to-gen-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 301018
num_examples: 1092
download_size: 206240
dataset_size: 301018
- config_name: nom-sg-to-ill-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 358539
num_examples: 1316
download_size: 243096
dataset_size: 358539
- config_name: nom-sg-to-nom-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 357969
num_examples: 1316
download_size: 242787
dataset_size: 357969
- config_name: nom-sg-to-part-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 326801
num_examples: 1184
download_size: 223657
dataset_size: 326801
- config_name: part-pl-to-gen-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 73051
num_examples: 241
download_size: 53746
dataset_size: 73051
- config_name: part-pl-to-ill-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 73772
num_examples: 241
download_size: 53953
dataset_size: 73772
- config_name: part-sg-to-gen-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 149312
num_examples: 536
download_size: 104106
dataset_size: 149312
- config_name: part-sg-to-ill-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 133173
num_examples: 461
download_size: 93284
dataset_size: 133173
- config_name: part-sg-to-nom-sg
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 182777
num_examples: 649
download_size: 127580
dataset_size: 182777
- config_name: part-sg-to-part-pl
features:
- name: correct
dtype: string
- name: incorrect
dtype: string
splits:
- name: train
num_bytes: 200139
num_examples: 706
download_size: 139232
dataset_size: 200139
configs:
- config_name: gen-pl-to-ill-pl
data_files:
- split: train
path: gen-pl-to-ill-pl/train-*
- config_name: gen-pl-to-nom-pl
data_files:
- split: train
path: gen-pl-to-nom-pl/train-*
- config_name: gen-pl-to-part-pl
data_files:
- split: train
path: gen-pl-to-part-pl/train-*
- config_name: gen-sg-to-gen-pl
data_files:
- split: train
path: gen-sg-to-gen-pl/train-*
- config_name: gen-sg-to-ill-sg
data_files:
- split: train
path: gen-sg-to-ill-sg/train-*
- config_name: gen-sg-to-nom-sg
data_files:
- split: train
path: gen-sg-to-nom-sg/train-*
- config_name: gen-sg-to-part-sg
data_files:
- split: train
path: gen-sg-to-part-sg/train-*
- config_name: ind-prs-3pl-to-1pl
data_files:
- split: train
path: ind-prs-3pl-to-1pl/train-*
- config_name: ind-prs-3pl-to-2pl
data_files:
- split: train
path: ind-prs-3pl-to-2pl/train-*
- config_name: ind-prs-3sg-to-1sg
data_files:
- split: train
path: ind-prs-3sg-to-1sg/train-*
- config_name: ind-prs-3sg-to-2sg
data_files:
- split: train
path: ind-prs-3sg-to-2sg/train-*
- config_name: ind-pst-3pl-to-1pl
data_files:
- split: train
path: ind-pst-3pl-to-1pl/train-*
- config_name: ind-pst-3pl-to-2pl
data_files:
- split: train
path: ind-pst-3pl-to-2pl/train-*
- config_name: ind-pst-3sg-to-1sg
data_files:
- split: train
path: ind-pst-3sg-to-1sg/train-*
- config_name: ind-pst-3sg-to-2sg
data_files:
- split: train
path: ind-pst-3sg-to-2sg/train-*
- config_name: nom-pl-to-gen-pl
data_files:
- split: train
path: nom-pl-to-gen-pl/train-*
- config_name: nom-pl-to-ill-pl
data_files:
- split: train
path: nom-pl-to-ill-pl/train-*
- config_name: nom-pl-to-part-pl
data_files:
- split: train
path: nom-pl-to-part-pl/train-*
- config_name: nom-sg-to-gen-sg
data_files:
- split: train
path: nom-sg-to-gen-sg/train-*
- config_name: nom-sg-to-ill-sg
data_files:
- split: train
path: nom-sg-to-ill-sg/train-*
- config_name: nom-sg-to-nom-pl
data_files:
- split: train
path: nom-sg-to-nom-pl/train-*
- config_name: nom-sg-to-part-sg
data_files:
- split: train
path: nom-sg-to-part-sg/train-*
- config_name: part-pl-to-gen-pl
data_files:
- split: train
path: part-pl-to-gen-pl/train-*
- config_name: part-pl-to-ill-pl
data_files:
- split: train
path: part-pl-to-ill-pl/train-*
- config_name: part-sg-to-gen-sg
data_files:
- split: train
path: part-sg-to-gen-sg/train-*
- config_name: part-sg-to-ill-sg
data_files:
- split: train
path: part-sg-to-ill-sg/train-*
- config_name: part-sg-to-nom-sg
data_files:
- split: train
path: part-sg-to-nom-sg/train-*
- config_name: part-sg-to-part-pl
data_files:
- split: train
path: part-sg-to-part-pl/train-*
---
数据集信息:
- 配置名称:属格复数(genitive plural, GEN-PL)到向格复数(illative plural, ILL-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:148916,样本总数:500
下载体积:104561,数据集存储体积:148916
- 配置名称:属格复数(genitive plural, GEN-PL)到主格复数(nominative plural, NOM-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:146952,样本总数:500
下载体积:104092,数据集存储体积:146952
- 配置名称:属格复数(genitive plural, GEN-PL)到部分格复数(partitive plural, PART-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:147097,样本总数:500
下载体积:104193,数据集存储体积:147097
- 配置名称:属格单数(genitive singular, GEN-SG)到属格复数(genitive plural, GEN-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:495831,样本总数:1793
下载体积:326422,数据集存储体积:495831
- 配置名称:属格单数(genitive singular, GEN-SG)到向格单数(illative singular, ILL-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:438414,样本总数:1581
下载体积:291428,数据集存储体积:438414
- 配置名称:属格单数(genitive singular, GEN-SG)到主格单数(nominative singular, NOM-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:380416,样本总数:1337
下载体积:256308,数据集存储体积:380416
- 配置名称:属格单数(genitive singular, GEN-SG)到部分格单数(partitive singular, PART-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:409475,样本总数:1470
下载体积:271842,数据集存储体积:409475
- 配置名称:现在时直陈式第三人称复数(indicative present 3rd-person plural, IND-PRS-3PL)到第一人称复数(1st-person plural, 1PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:27748,样本总数:93
下载体积:22331,数据集存储体积:27748
- 配置名称:现在时直陈式第三人称复数(indicative present 3rd-person plural, IND-PRS-3PL)到第二人称复数(2nd-person plural, 2PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:27749,样本总数:93
下载体积:22310,数据集存储体积:27749
- 配置名称:现在时直陈式第三人称单数(indicative present 3rd-person singular, IND-PRS-3SG)到第一人称单数(1st-person singular, 1SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:57178,样本总数:195
下载体积:43510,数据集存储体积:57178
- 配置名称:现在时直陈式第三人称单数(indicative present 3rd-person singular, IND-PRS-3SG)到第二人称单数(2nd-person singular, 2SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:57178,样本总数:195
下载体积:43496,数据集存储体积:57178
- 配置名称:过去时直陈式第三人称复数(indicative past 3rd-person plural, IND-PST-3PL)到第一人称复数(1st-person plural, 1PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:35797,样本总数:103
下载体积:28386,数据集存储体积:35797
- 配置名称:过去时直陈式第三人称复数(indicative past 3rd-person plural, IND-PST-3PL)到第二人称复数(2nd-person plural, 2PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:35799,样本总数:103
下载体积:28348,数据集存储体积:35799
- 配置名称:过去时直陈式第三人称单数(indicative past 3rd-person singular, IND-PST-3SG)到第一人称单数(1st-person singular, 1SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:191237,样本总数:653
下载体积:132497,数据集存储体积:191237
- 配置名称:过去时直陈式第三人称单数(indicative past 3rd-person singular, IND-PST-3SG)到第二人称单数(2nd-person singular, 2SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:191236,样本总数:653
下载体积:132427,数据集存储体积:191236
- 配置名称:主格复数(nominative plural, NOM-PL)到属格复数(genitive plural, GEN-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:125921,样本总数:452
下载体积:87765,数据集存储体积:125921
- 配置名称:主格复数(nominative plural, NOM-PL)到向格复数(illative plural, ILL-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:127276,样本总数:452
下载体积:88020,数据集存储体积:127276
- 配置名称:主格复数(nominative plural, NOM-PL)到部分格复数(partitive plural, PART-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:125641,样本总数:452
下载体积:87685,数据集存储体积:125641
- 配置名称:主格单数(nominative singular, NOM-SG)到属格单数(genitive singular, GEN-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:301018,样本总数:1092
下载体积:206240,数据集存储体积:301018
- 配置名称:主格单数(nominative singular, NOM-SG)到向格单数(illative singular, ILL-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:358539,样本总数:1316
下载体积:243096,数据集存储体积:358539
- 配置名称:主格单数(nominative singular, NOM-SG)到主格复数(nominative plural, NOM-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:357969,样本总数:1316
下载体积:242787,数据集存储体积:357969
- 配置名称:主格单数(nominative singular, NOM-SG)到部分格单数(partitive singular, PART-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:326801,样本总数:1184
下载体积:223657,数据集存储体积:326801
- 配置名称:部分格复数(partitive plural, PART-PL)到属格复数(genitive plural, GEN-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:73051,样本总数:241
下载体积:53746,数据集存储体积:73051
- 配置名称:部分格复数(partitive plural, PART-PL)到向格复数(illative plural, ILL-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:73772,样本总数:241
下载体积:53953,数据集存储体积:73772
- 配置名称:部分格单数(partitive singular, PART-SG)到属格单数(genitive singular, GEN-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:149312,样本总数:536
下载体积:104106,数据集存储体积:149312
- 配置名称:部分格单数(partitive singular, PART-SG)到向格单数(illative singular, ILL-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:133173,样本总数:461
下载体积:93284,数据集存储体积:133173
- 配置名称:部分格单数(partitive singular, PART-SG)到主格单数(nominative singular, NOM-SG)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:182777,样本总数:649
下载体积:127580,数据集存储体积:182777
- 配置名称:部分格单数(partitive singular, PART-SG)到部分格复数(partitive plural, PART-PL)
特征字段:
- 字段名:`correct`,数据类型:字符串(string)
- 字段名:`incorrect`,数据类型:字符串(string)
数据集划分:
- 划分名称:train(训练集),字节占用量:200139,样本总数:706
下载体积:139232,数据集存储体积:200139
配置项:
每个配置项对应上述数据集划分,数据文件均仅包含训练集划分,路径格式为`[配置名称]/train-*`
提供机构:
liu-nlp



