five

Aeonai/autotrain-data-demo-2

收藏
Hugging Face2023-09-01 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/Aeonai/autotrain-data-demo-2
下载链接
链接失效反馈
官方服务:
资源简介:
--- language: - en --- # AutoTrain Dataset for project: demo-2 ## Dataset Description This dataset has been automatically processed by AutoTrain for project demo-2. ### Languages The BCP-47 code for the dataset's language is en. ## Dataset Structure ### Data Instances A sample from this dataset looks as follows: ```json [ { "context": "In the late 1970s many arterial roads were redesigned as ejes viales; high-volume one-way roads that cross, in theory, Mexico City proper from side to side. The eje vial network is based on a quasi-Cartesian grid, with the ejes themselves being called Eje 1 Poniente, Eje Central, and Eje 1 Oriente, for example, for the north-south roads, and Eje 2 Sur and Eje 3 Norte, for example, for east-west roads. Ring roads are the Circuito Interior (inner ring), Anillo Perif\u00e9rico; the Circuito Exterior Mexiquense (\"State of Mexico outer loop\") toll road skirting the northeastern and eastern edges of the metropolitan area, the Chamapa-La Venta toll road skirting the northwestern edge, and the Arco Norte completely bypassing the metropolitan area in an arc from northwest (Atlacomulco) to north (Tula, Hidalgo) to east (Puebla). A second level (where tolls are charged) of the Perif\u00e9rico, colloquially called the segundo piso (\"second floor\"), was officially opened in 2012, with sections still being completed. The Viaducto Miguel Alem\u00e1n crosses the city east-west from Observatorio to the airport. In 2013 the Superv\u00eda Poniente opened, a toll road linking the new Santa Fe business district with southwestern Mexico City.", "question": "When were these second level roads opened?", "answers.text": [ "2012" ], "answers.answer_start": [ 966 ], "feat_id": [ "572694bef1498d1400e8e468" ], "feat_title": [ "Mexico_City" ] }, { "context": "The first Code of Canon Law, 1917, was mostly for the Roman Rite, with limited application to the Eastern Churches. After the Second Vatican Council, (1962 - 1965), another edition was published specifically for the Roman Rite in 1983. Most recently, 1990, the Vatican produced the Code of Canons of the Eastern Churches which became the 1st code of Eastern Catholic Canon Law.", "question": "For which part of the Roman Catholic Church was the first Code published?", "answers.text": [ "the Roman Rite" ], "answers.answer_start": [ 50 ], "feat_id": [ "56e1040ecd28a01900c6743e" ], "feat_title": [ "Canon_law" ] } ] ``` ### Dataset Fields The dataset has the following fields (also called "features"): ```json { "context": "Value(dtype='string', id=None)", "question": "Value(dtype='string', id=None)", "answers.text": "Sequence(feature=Value(dtype='string', id=None), length=-1, id=None)", "answers.answer_start": "Sequence(feature=Value(dtype='int32', id=None), length=-1, id=None)", "feat_id": "Sequence(feature=Value(dtype='string', id=None), length=-1, id=None)", "feat_title": "Sequence(feature=Value(dtype='string', id=None), length=-1, id=None)" } ``` ### Dataset Splits This dataset is split into a train and validation split. The split sizes are as follow: | Split name | Num samples | | ------------ | ------------------- | | train | 87433 | | valid | 10546 |
提供机构:
Aeonai
原始信息汇总

AutoTrain Dataset for project: demo-2

数据集描述

该数据集由AutoTrain自动处理,用于项目demo-2。

语言

数据集的语言BCP-47代码为en。

数据集结构

数据实例

数据集的一个样本如下所示:

json [ { "context": "In the late 1970s many arterial roads were redesigned as ejes viales; high-volume one-way roads that cross, in theory, Mexico City proper from side to side. The eje vial network is based on a quasi-Cartesian grid, with the ejes themselves being called Eje 1 Poniente, Eje Central, and Eje 1 Oriente, for example, for the north-south roads, and Eje 2 Sur and Eje 3 Norte, for example, for east-west roads. Ring roads are the Circuito Interior (inner ring), Anillo Periférico; the Circuito Exterior Mexiquense ("State of Mexico outer loop") toll road skirting the northeastern and eastern edges of the metropolitan area, the Chamapa-La Venta toll road skirting the northwestern edge, and the Arco Norte completely bypassing the metropolitan area in an arc from northwest (Atlacomulco) to north (Tula, Hidalgo) to east (Puebla). A second level (where tolls are charged) of the Periférico, colloquially called the segundo piso ("second floor"), was officially opened in 2012, with sections still being completed. The Viaducto Miguel Alemán crosses the city east-west from Observatorio to the airport. In 2013 the Supervía Poniente opened, a toll road linking the new Santa Fe business district with southwestern Mexico City.", "question": "When were these second level roads opened?", "answers.text": [ "2012" ], "answers.answer_start": [ 966 ], "feat_id": [ "572694bef1498d1400e8e468" ], "feat_title": [ "Mexico_City" ] }, { "context": "The first Code of Canon Law, 1917, was mostly for the Roman Rite, with limited application to the Eastern Churches. After the Second Vatican Council, (1962 - 1965), another edition was published specifically for the Roman Rite in 1983. Most recently, 1990, the Vatican produced the Code of Canons of the Eastern Churches which became the 1st code of Eastern Catholic Canon Law.", "question": "For which part of the Roman Catholic Church was the first Code published?", "answers.text": [ "the Roman Rite" ], "answers.answer_start": [ 50 ], "feat_id": [ "56e1040ecd28a01900c6743e" ], "feat_title": [ "Canon_law" ] } ]

数据集字段

数据集包含以下字段(也称为“特征”):

json { "context": "Value(dtype=string, id=None)", "question": "Value(dtype=string, id=None)", "answers.text": "Sequence(feature=Value(dtype=string, id=None), length=-1, id=None)", "answers.answer_start": "Sequence(feature=Value(dtype=int32, id=None), length=-1, id=None)", "feat_id": "Sequence(feature=Value(dtype=string, id=None), length=-1, id=None)", "feat_title": "Sequence(feature=Value(dtype=string, id=None), length=-1, id=None)" }

数据集分割

该数据集被分割为训练集和验证集。分割大小如下:

分割名称 样本数量
train 87433
valid 10546
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作