Aclairs/ALBERTFINALYEAR
收藏AutoNLP Dataset for project: ALBERTFINALYEAR
数据集描述
- 语言: 数据集的语言标识符为unk。
数据集结构
数据实例
数据集中的样本示例如下: json [ { "context": "Hasidic or Chasidic Judaism overlaps significantly with Haredi Judaism in its engagement with the se[...]", "question": "What overlaps significantly with Haredi Judiasm?", "answers.text": [ "Chasidic Judaism" ], "answers.answer_start": [ 11 ] }, { "context": "Data compression can be viewed as a special case of data differencing: Data differencing consists of[...]", "question": "What can classified as data differencing with empty source data?", "answers.text": [ "Data compression", "data compression" ], "answers.answer_start": [ 0, 400 ] } ]
数据字段
数据集包含以下字段: json { "context": "Value(dtype=string, id=None)", "question": "Value(dtype=string, id=None)", "answers.text": "Sequence(feature=Value(dtype=string, id=None), length=-1, id=None)", "answers.answer_start": "Sequence(feature=Value(dtype=int32, id=None), length=-1, id=None)" }
数据分割
数据集分为训练集和验证集,分割大小如下:
| 分割名称 | 样本数量 |
|---|---|
| 训练集 | 87433 |
| 验证集 | 10544 |



