abaki/autotrain-data-testproject
收藏AutoTrain Dataset for project: testproject
数据集描述
本数据集由AutoTrain自动处理,用于项目testproject。
语言
数据集的语言BCP-47代码为en。
数据集结构
数据实例
数据集样本示例如下:
json [ { "context": "", "question": "Identify which instrument is string or percussion: Vibraslap, Inanga", "answers.text": [ "Vibraslap is percussion, Inanga is string." ], "answers.answer_start": [ 0 ], "feat_category": [ "classification" ] }, { "context": "Crypto AG was a Swiss company specialising in communications and information security founded by Boris Hagelin in 1952. The company was secretly purchased for US $5.75 million and jointly owned by the American Central Intelligence Agency (CIA) and West German Federal Intelligence Service (BND) from 1970 until about 1993, with the CIA continuing as sole owner until about 2018. The mission of breaking encrypted communication using a secretly owned company was known as "Operation Rubikon". With headquarters in Steinhausen, the company was a long-established manufacturer of encryption machines and a wide variety of cipher devices.", "question": "Is data security an illusion?", "answers.text": [ "The long answer is yes." ], "answers.answer_start": [ 0 ], "feat_category": [ "summarization" ] } ]
数据集字段
数据集包含以下字段:
json { "context": "Value(dtype=string, id=None)", "question": "Value(dtype=string, id=None)", "answers.text": "Sequence(feature=Value(dtype=string, id=None), length=-1, id=None)", "answers.answer_start": "Sequence(feature=Value(dtype=int32, id=None), length=-1, id=None)", "feat_category": "Sequence(feature=Value(dtype=string, id=None), length=-1, id=None)" }
数据集分割
数据集分为训练集和验证集,分割大小如下:
| 分割名称 | 样本数量 |
|---|---|
| 训练集 | 15827 |
| 验证集 | 3957 |



