US National Flight Data 2015 - 2020
收藏www.kaggle.com2021-02-18 更新2025-01-21 收录
下载链接:
https://www.kaggle.com/bingecode/us-national-flight-data-2015-2020
下载链接
链接失效反馈官方服务:
资源简介:
### Context
This data set was retrieved from the [Transtats](https://www.transtats.bts.gov/homepage.asp) webpage of the Bureau of Transportation Statistics of the US Department of Transportation. This data was cleaned and made ready for use in an [university project](https://github.com/BingeCode/columnstore-innodb-monetdb) where the goal was to compare different database engines in terms of performance and more.
NOTE: December 2020 was not included in the data set since it was not made available by the BTS as of today, 18th Feb 2021.
### Content
The data is split into CSV files for each year 2015 to 2020.
flights.csv contains all the data in one file.
The CSV files contain no headers for the columns. The headers are as follows:
```
'YEAR', 'MONTH', 'DAY_OF_MONTH', 'DAY_OF_WEEK', 'OP_UNIQUE_CARRIER', 'ORIGIN_CITY_NAME',
'ORIGIN_STATE_ABR', 'DEST_CITY_NAME', 'DEST_STATE_ABR', 'CRS_DEP_TIME', 'DEP_DELAY_NEW',
'CRS_ARR_TIME', 'ARR_DELAY_NEW', 'CANCELLED', 'CANCELLATION_CODE', 'AIR_TIME', 'DISTANCE'
```
NOTE: The headers were removed due to the requirement of easily importing the data into SQL
### Other
If you have any questions about how I retrieved/cleaned the data or anything about my project, feel free to check out my [Github repository](https://github.com/BingeCode/columnstore-innodb-monetdb) or shoot me a message.
本数据集源自美国交通部统计局(Bureau of Transportation Statistics)的[Transtats](https://www.transtats.bts.gov/homepage.asp)网页。数据经过清洗并准备用于[大学项目](https://github.com/BingeCode/columnstore-innodb-monetdb),项目旨在对比不同数据库引擎在性能等方面的优劣。
注意:由于截至2021年2月18日,美国交通部统计局尚未提供2020年12月的数据,因此该月数据未包含在本数据集中。
### 数据内容
数据按年度分割为CSV文件,范围从2015年至2020年。
flights.csv文件包含了该年度所有数据。
CSV文件中的列无标题,以下列出了标题内容:
'YEAR', 'MONTH', 'DAY_OF_MONTH', 'DAY_OF_WEEK', 'OP_UNIQUE_CARRIER', 'ORIGIN_CITY_NAME',
'ORIGIN_STATE_ABR', 'DEST_CITY_NAME', 'DEST_STATE_ABR', 'CRS_DEP_TIME', 'DEP_DELAY_NEW',
'CRS_ARR_TIME', 'ARR_DELAY_NEW', 'CANCELLED', 'CANCELLATION_CODE', 'AIR_TIME', 'DISTANCE'
注意:由于易于导入SQL的需求,已移除列标题。
### 其他信息
如对数据检索、清洗过程或项目内容有任何疑问,欢迎查阅我的[GitHub仓库](https://github.com/BingeCode/columnstore-innodb-monetdb)或与我联系。
提供机构:
Kaggle



