five

liu-nlp/estonian-blimp-single-error

收藏
Hugging Face2025-12-04 更新2025-12-20 收录
下载链接:
https://hf-mirror.com/datasets/liu-nlp/estonian-blimp-single-error
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: - config_name: gen-pl-to-ill-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 148916 num_examples: 500 download_size: 104561 dataset_size: 148916 - config_name: gen-pl-to-nom-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 146952 num_examples: 500 download_size: 104092 dataset_size: 146952 - config_name: gen-pl-to-part-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 147097 num_examples: 500 download_size: 104193 dataset_size: 147097 - config_name: gen-sg-to-gen-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 495831 num_examples: 1793 download_size: 326422 dataset_size: 495831 - config_name: gen-sg-to-ill-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 438414 num_examples: 1581 download_size: 291428 dataset_size: 438414 - config_name: gen-sg-to-nom-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 380416 num_examples: 1337 download_size: 256308 dataset_size: 380416 - config_name: gen-sg-to-part-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 409475 num_examples: 1470 download_size: 271842 dataset_size: 409475 - config_name: ind-prs-3pl-to-1pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 27748 num_examples: 93 download_size: 22331 dataset_size: 27748 - config_name: ind-prs-3pl-to-2pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 27749 num_examples: 93 download_size: 22310 dataset_size: 27749 - config_name: ind-prs-3sg-to-1sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 57178 num_examples: 195 download_size: 43510 dataset_size: 57178 - config_name: ind-prs-3sg-to-2sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 57178 num_examples: 195 download_size: 43496 dataset_size: 57178 - config_name: ind-pst-3pl-to-1pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 35797 num_examples: 103 download_size: 28386 dataset_size: 35797 - config_name: ind-pst-3pl-to-2pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 35799 num_examples: 103 download_size: 28348 dataset_size: 35799 - config_name: ind-pst-3sg-to-1sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 191237 num_examples: 653 download_size: 132497 dataset_size: 191237 - config_name: ind-pst-3sg-to-2sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 191236 num_examples: 653 download_size: 132427 dataset_size: 191236 - config_name: nom-pl-to-gen-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 125921 num_examples: 452 download_size: 87765 dataset_size: 125921 - config_name: nom-pl-to-ill-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 127276 num_examples: 452 download_size: 88020 dataset_size: 127276 - config_name: nom-pl-to-part-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 125641 num_examples: 452 download_size: 87685 dataset_size: 125641 - config_name: nom-sg-to-gen-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 301018 num_examples: 1092 download_size: 206240 dataset_size: 301018 - config_name: nom-sg-to-ill-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 358539 num_examples: 1316 download_size: 243096 dataset_size: 358539 - config_name: nom-sg-to-nom-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 357969 num_examples: 1316 download_size: 242787 dataset_size: 357969 - config_name: nom-sg-to-part-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 326801 num_examples: 1184 download_size: 223657 dataset_size: 326801 - config_name: part-pl-to-gen-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 73051 num_examples: 241 download_size: 53746 dataset_size: 73051 - config_name: part-pl-to-ill-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 73772 num_examples: 241 download_size: 53953 dataset_size: 73772 - config_name: part-sg-to-gen-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 149312 num_examples: 536 download_size: 104106 dataset_size: 149312 - config_name: part-sg-to-ill-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 133173 num_examples: 461 download_size: 93284 dataset_size: 133173 - config_name: part-sg-to-nom-sg features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 182777 num_examples: 649 download_size: 127580 dataset_size: 182777 - config_name: part-sg-to-part-pl features: - name: correct dtype: string - name: incorrect dtype: string splits: - name: train num_bytes: 200139 num_examples: 706 download_size: 139232 dataset_size: 200139 configs: - config_name: gen-pl-to-ill-pl data_files: - split: train path: gen-pl-to-ill-pl/train-* - config_name: gen-pl-to-nom-pl data_files: - split: train path: gen-pl-to-nom-pl/train-* - config_name: gen-pl-to-part-pl data_files: - split: train path: gen-pl-to-part-pl/train-* - config_name: gen-sg-to-gen-pl data_files: - split: train path: gen-sg-to-gen-pl/train-* - config_name: gen-sg-to-ill-sg data_files: - split: train path: gen-sg-to-ill-sg/train-* - config_name: gen-sg-to-nom-sg data_files: - split: train path: gen-sg-to-nom-sg/train-* - config_name: gen-sg-to-part-sg data_files: - split: train path: gen-sg-to-part-sg/train-* - config_name: ind-prs-3pl-to-1pl data_files: - split: train path: ind-prs-3pl-to-1pl/train-* - config_name: ind-prs-3pl-to-2pl data_files: - split: train path: ind-prs-3pl-to-2pl/train-* - config_name: ind-prs-3sg-to-1sg data_files: - split: train path: ind-prs-3sg-to-1sg/train-* - config_name: ind-prs-3sg-to-2sg data_files: - split: train path: ind-prs-3sg-to-2sg/train-* - config_name: ind-pst-3pl-to-1pl data_files: - split: train path: ind-pst-3pl-to-1pl/train-* - config_name: ind-pst-3pl-to-2pl data_files: - split: train path: ind-pst-3pl-to-2pl/train-* - config_name: ind-pst-3sg-to-1sg data_files: - split: train path: ind-pst-3sg-to-1sg/train-* - config_name: ind-pst-3sg-to-2sg data_files: - split: train path: ind-pst-3sg-to-2sg/train-* - config_name: nom-pl-to-gen-pl data_files: - split: train path: nom-pl-to-gen-pl/train-* - config_name: nom-pl-to-ill-pl data_files: - split: train path: nom-pl-to-ill-pl/train-* - config_name: nom-pl-to-part-pl data_files: - split: train path: nom-pl-to-part-pl/train-* - config_name: nom-sg-to-gen-sg data_files: - split: train path: nom-sg-to-gen-sg/train-* - config_name: nom-sg-to-ill-sg data_files: - split: train path: nom-sg-to-ill-sg/train-* - config_name: nom-sg-to-nom-pl data_files: - split: train path: nom-sg-to-nom-pl/train-* - config_name: nom-sg-to-part-sg data_files: - split: train path: nom-sg-to-part-sg/train-* - config_name: part-pl-to-gen-pl data_files: - split: train path: part-pl-to-gen-pl/train-* - config_name: part-pl-to-ill-pl data_files: - split: train path: part-pl-to-ill-pl/train-* - config_name: part-sg-to-gen-sg data_files: - split: train path: part-sg-to-gen-sg/train-* - config_name: part-sg-to-ill-sg data_files: - split: train path: part-sg-to-ill-sg/train-* - config_name: part-sg-to-nom-sg data_files: - split: train path: part-sg-to-nom-sg/train-* - config_name: part-sg-to-part-pl data_files: - split: train path: part-sg-to-part-pl/train-* ---

数据集信息: - 配置名称:属格复数(genitive plural, GEN-PL)到向格复数(illative plural, ILL-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:148916,样本总数:500 下载体积:104561,数据集存储体积:148916 - 配置名称:属格复数(genitive plural, GEN-PL)到主格复数(nominative plural, NOM-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:146952,样本总数:500 下载体积:104092,数据集存储体积:146952 - 配置名称:属格复数(genitive plural, GEN-PL)到部分格复数(partitive plural, PART-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:147097,样本总数:500 下载体积:104193,数据集存储体积:147097 - 配置名称:属格单数(genitive singular, GEN-SG)到属格复数(genitive plural, GEN-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:495831,样本总数:1793 下载体积:326422,数据集存储体积:495831 - 配置名称:属格单数(genitive singular, GEN-SG)到向格单数(illative singular, ILL-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:438414,样本总数:1581 下载体积:291428,数据集存储体积:438414 - 配置名称:属格单数(genitive singular, GEN-SG)到主格单数(nominative singular, NOM-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:380416,样本总数:1337 下载体积:256308,数据集存储体积:380416 - 配置名称:属格单数(genitive singular, GEN-SG)到部分格单数(partitive singular, PART-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:409475,样本总数:1470 下载体积:271842,数据集存储体积:409475 - 配置名称:现在时直陈式第三人称复数(indicative present 3rd-person plural, IND-PRS-3PL)到第一人称复数(1st-person plural, 1PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:27748,样本总数:93 下载体积:22331,数据集存储体积:27748 - 配置名称:现在时直陈式第三人称复数(indicative present 3rd-person plural, IND-PRS-3PL)到第二人称复数(2nd-person plural, 2PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:27749,样本总数:93 下载体积:22310,数据集存储体积:27749 - 配置名称:现在时直陈式第三人称单数(indicative present 3rd-person singular, IND-PRS-3SG)到第一人称单数(1st-person singular, 1SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:57178,样本总数:195 下载体积:43510,数据集存储体积:57178 - 配置名称:现在时直陈式第三人称单数(indicative present 3rd-person singular, IND-PRS-3SG)到第二人称单数(2nd-person singular, 2SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:57178,样本总数:195 下载体积:43496,数据集存储体积:57178 - 配置名称:过去时直陈式第三人称复数(indicative past 3rd-person plural, IND-PST-3PL)到第一人称复数(1st-person plural, 1PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:35797,样本总数:103 下载体积:28386,数据集存储体积:35797 - 配置名称:过去时直陈式第三人称复数(indicative past 3rd-person plural, IND-PST-3PL)到第二人称复数(2nd-person plural, 2PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:35799,样本总数:103 下载体积:28348,数据集存储体积:35799 - 配置名称:过去时直陈式第三人称单数(indicative past 3rd-person singular, IND-PST-3SG)到第一人称单数(1st-person singular, 1SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:191237,样本总数:653 下载体积:132497,数据集存储体积:191237 - 配置名称:过去时直陈式第三人称单数(indicative past 3rd-person singular, IND-PST-3SG)到第二人称单数(2nd-person singular, 2SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:191236,样本总数:653 下载体积:132427,数据集存储体积:191236 - 配置名称:主格复数(nominative plural, NOM-PL)到属格复数(genitive plural, GEN-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:125921,样本总数:452 下载体积:87765,数据集存储体积:125921 - 配置名称:主格复数(nominative plural, NOM-PL)到向格复数(illative plural, ILL-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:127276,样本总数:452 下载体积:88020,数据集存储体积:127276 - 配置名称:主格复数(nominative plural, NOM-PL)到部分格复数(partitive plural, PART-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:125641,样本总数:452 下载体积:87685,数据集存储体积:125641 - 配置名称:主格单数(nominative singular, NOM-SG)到属格单数(genitive singular, GEN-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:301018,样本总数:1092 下载体积:206240,数据集存储体积:301018 - 配置名称:主格单数(nominative singular, NOM-SG)到向格单数(illative singular, ILL-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:358539,样本总数:1316 下载体积:243096,数据集存储体积:358539 - 配置名称:主格单数(nominative singular, NOM-SG)到主格复数(nominative plural, NOM-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:357969,样本总数:1316 下载体积:242787,数据集存储体积:357969 - 配置名称:主格单数(nominative singular, NOM-SG)到部分格单数(partitive singular, PART-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:326801,样本总数:1184 下载体积:223657,数据集存储体积:326801 - 配置名称:部分格复数(partitive plural, PART-PL)到属格复数(genitive plural, GEN-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:73051,样本总数:241 下载体积:53746,数据集存储体积:73051 - 配置名称:部分格复数(partitive plural, PART-PL)到向格复数(illative plural, ILL-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:73772,样本总数:241 下载体积:53953,数据集存储体积:73772 - 配置名称:部分格单数(partitive singular, PART-SG)到属格单数(genitive singular, GEN-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:149312,样本总数:536 下载体积:104106,数据集存储体积:149312 - 配置名称:部分格单数(partitive singular, PART-SG)到向格单数(illative singular, ILL-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:133173,样本总数:461 下载体积:93284,数据集存储体积:133173 - 配置名称:部分格单数(partitive singular, PART-SG)到主格单数(nominative singular, NOM-SG) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:182777,样本总数:649 下载体积:127580,数据集存储体积:182777 - 配置名称:部分格单数(partitive singular, PART-SG)到部分格复数(partitive plural, PART-PL) 特征字段: - 字段名:`correct`,数据类型:字符串(string) - 字段名:`incorrect`,数据类型:字符串(string) 数据集划分: - 划分名称:train(训练集),字节占用量:200139,样本总数:706 下载体积:139232,数据集存储体积:200139 配置项: 每个配置项对应上述数据集划分,数据文件均仅包含训练集划分,路径格式为`[配置名称]/train-*`
提供机构:
liu-nlp
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作