coref-data/corefud_indiscrim
收藏Hugging Face2024-01-21 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/coref-data/corefud_indiscrim
下载链接
链接失效反馈官方服务:
资源简介:
该数据集包含多个配置,每个配置对应不同的语言和来源。每个配置包括句子、词元、共指链和元数据等特征。数据集分为训练集和验证集,并提供了每个分区的字节数和示例数的详细信息。该数据集设计用于共指消解和语言分析任务。
该数据集包含多个配置,每个配置对应不同的语言和来源。每个配置包括句子、词元、共指链和元数据等特征。数据集分为训练集和验证集,并提供了每个分区的字节数和示例数的详细信息。该数据集设计用于共指消解和语言分析任务。
提供机构:
coref-data
原始信息汇总
数据集概述
数据集配置
ca_ancora-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 38341803num_examples: 1011
validation:num_bytes: 5660530num_examples: 131
- 下载大小: 7906331
- 数据集大小: 44002333
cs_pcedt-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 149583151num_examples: 1875
validation:num_bytes: 26160516num_examples: 337
- 下载大小: 31260936
- 数据集大小: 175743667
cs_pdt-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 109542424num_examples: 2533
validation:num_bytes: 14886840num_examples: 316
- 下载大小: 23982751
- 数据集大小: 124429264
de_parcorfull-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: int64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 1035732num_examples: 15
validation:num_bytes: 132412num_examples: 2
- 下载大小: 273217
- 数据集大小: 1168144
de_potsdamcc-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: int64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 3999054num_examples: 142
validation:num_bytes: 511557num_examples: 17
- 下载大小: 859121
- 数据集大小: 4510611
en_gum-corefud
- 特征:
sentences:id: int64speaker: stringtext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 17919310num_examples: 151
validation:num_bytes: 2369056num_examples: 22
- 下载大小: 4234788
- 数据集大小: 20288366
en_parcorfull-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: int64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 899917num_examples: 15
validation:num_bytes: 115587num_examples: 2
- 下载大小: 259976
- 数据集大小: 1015504
es_ancora-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 43242148num_examples: 1080
validation:num_bytes: 5404400num_examples: 131
- 下载大小: 8758107
- 数据集大小: 48646548
fr_democrat-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: int64lemma: stringmisc: stringtext: stringupos: stringxpos: null
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 23704875num_examples: 50
validation:num_bytes: 2914195num_examples: 46
- 下载大小: 5011046
- 数据集大小: 26619070
hu_korkor-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 2358029num_examples: 76
validation:num_bytes: 305829num_examples: 9
- 下载大小: 644899
- 数据集大小: 2663858
hu_szegedkoref-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 11618556num_examples: 320
validation:num_bytes: 1365657num_examples: 40
- 下载大小: 2509790
- 数据集大小: 12984213
lt_lcc-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: int64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 3908009num_examples: 80
validation:num_bytes: 435994num_examples: 10
- 下载大小: 802890
- 数据集大小: 4344003
no_bokmaalnarc-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: int64lemma: stringmisc: stringtext: stringupos: stringxpos: null
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 21847333num_examples: 284
validation:num_bytes: 2319889num_examples: 31
- 下载大小: 4979662
- 数据集大小: 24167222
no_nynorsknarc-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: int64lemma: stringmisc: stringtext: stringupos: stringxpos: null
id: stringtext: stringcoref_chains: sequence of sequence of sequence of int64genre: nullmeta_data:comment: string
- 分割:
train:num_bytes: 18472313num_examples: 336
validation:num_bytes: 1904614num_examples: 28
- 下载大小: 4209149
- 数据集大小: 20376927
pl_pcc-corefud
- 特征:
sentences:id: int64speaker: nulltext: stringtokens:deprel: stringfeats: stringhead: int64id: float64lemma: stringmisc: stringtext: stringupos: stringxpos: string
id: stringtext: string- `core



