Kaludi/data-eurekaQA
收藏Hugging Face2023-02-03 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Kaludi/data-eurekaQA
下载链接
链接失效反馈官方服务:
资源简介:
---
language:
- en
---
# Dataset for project: eurekaqa
This dataset has been trained for project eurekaQA.
### Languages
The BCP-47 code for the dataset's language is en.
## Dataset Structure
### Data Instances
A sample from this dataset looks as follows:
```json
[
{
"context": "Colquhoun's utilitarian approach to the problem \u2013 using a cost-benefit argument to obtain support from businesses standing to benefit \u2013 allowed him to achieve what Henry and John Fielding failed for their Bow Street detectives. Unlike the stipendiary system at Bow Street, the river police were full-time, salaried officers prohibited from taking private fees. His other contribution was the concept of preventive policing; his police were to act as a highly visible deterrent to crime by their permanent presence on the Thames. Colquhoun's innovations were a critical development leading up to Robert Peel's \"new\" police three decades later.",
"question": "How did the Thames River Police pay their employees?",
"answers.text": [
"full-time, salaried officers prohibited from taking private fees"
],
"answers.answer_start": [
295
]
},
{
"context": "The small woolen dolls called Maniae, hung on the Compitalia shrines, were thought a symbolic replacement for child-sacrifice to Mania, as Mother of the Lares. The Junii took credit for its abolition by their ancestor L. Junius Brutus, traditionally Rome's Republican founder and first consul. Political or military executions were sometimes conducted in such a way that they evoked human sacrifice, whether deliberately or in the perception of witnesses; Marcus Marius Gratidianus was a gruesome example.",
"question": "Who was Mania in Roman religion?",
"answers.text": [
"Mother of the Lares"
],
"answers.answer_start": [
139
]
}
]
```
### Dataset Fields
The dataset has the following fields (also called "features"):
```json
{
"context": "Value(dtype='string', id=None)",
"question": "Value(dtype='string', id=None)",
"answers.text": "Sequence(feature=Value(dtype='string', id=None), length=-1, id=None)",
"answers.answer_start": "Sequence(feature=Value(dtype='int32', id=None), length=-1, id=None)"
}
```
### Dataset Splits
This dataset is split into a train and validation split. The split sizes are as follow:
| Split name | Num samples |
| ------------ | ------------------- |
| train | 8996 |
| valid | 998 |
提供机构:
Kaludi
原始信息汇总
数据集概述
数据集名称
eurekaQA
语言
- 语言代码:en
数据集结构
数据实例
数据实例包含以下字段:
context: 文本内容question: 问题answers.text: 答案文本answers.answer_start: 答案在文本中的起始位置
数据集字段
数据集包含以下字段:
context: 字符串类型question: 字符串类型answers.text: 字符串序列answers.answer_start: 整数序列
数据集分割
数据集分为训练集和验证集,具体分割如下:
| 分割名称 | 样本数量 |
|---|---|
| 训练集 | 8996 |
| 验证集 | 998 |



