five

US National Flight Data 2015 - 2020

收藏
www.kaggle.com2021-02-18 更新2025-01-21 收录
下载链接:
https://www.kaggle.com/bingecode/us-national-flight-data-2015-2020
下载链接
链接失效反馈
官方服务:
资源简介:
### Context This data set was retrieved from the [Transtats](https://www.transtats.bts.gov/homepage.asp) webpage of the Bureau of Transportation Statistics of the US Department of Transportation. This data was cleaned and made ready for use in an [university project](https://github.com/BingeCode/columnstore-innodb-monetdb) where the goal was to compare different database engines in terms of performance and more. NOTE: December 2020 was not included in the data set since it was not made available by the BTS as of today, 18th Feb 2021. ### Content The data is split into CSV files for each year 2015 to 2020. flights.csv contains all the data in one file. The CSV files contain no headers for the columns. The headers are as follows: ``` 'YEAR', 'MONTH', 'DAY_OF_MONTH', 'DAY_OF_WEEK', 'OP_UNIQUE_CARRIER', 'ORIGIN_CITY_NAME', 'ORIGIN_STATE_ABR', 'DEST_CITY_NAME', 'DEST_STATE_ABR', 'CRS_DEP_TIME', 'DEP_DELAY_NEW', 'CRS_ARR_TIME', 'ARR_DELAY_NEW', 'CANCELLED', 'CANCELLATION_CODE', 'AIR_TIME', 'DISTANCE' ``` NOTE: The headers were removed due to the requirement of easily importing the data into SQL ### Other If you have any questions about how I retrieved/cleaned the data or anything about my project, feel free to check out my [Github repository](https://github.com/BingeCode/columnstore-innodb-monetdb) or shoot me a message.

本数据集源自美国交通部统计局(Bureau of Transportation Statistics)的[Transtats](https://www.transtats.bts.gov/homepage.asp)网页。数据经过清洗并准备用于[大学项目](https://github.com/BingeCode/columnstore-innodb-monetdb),项目旨在对比不同数据库引擎在性能等方面的优劣。 注意:由于截至2021年2月18日,美国交通部统计局尚未提供2020年12月的数据,因此该月数据未包含在本数据集中。 ### 数据内容 数据按年度分割为CSV文件,范围从2015年至2020年。 flights.csv文件包含了该年度所有数据。 CSV文件中的列无标题,以下列出了标题内容: 'YEAR', 'MONTH', 'DAY_OF_MONTH', 'DAY_OF_WEEK', 'OP_UNIQUE_CARRIER', 'ORIGIN_CITY_NAME', 'ORIGIN_STATE_ABR', 'DEST_CITY_NAME', 'DEST_STATE_ABR', 'CRS_DEP_TIME', 'DEP_DELAY_NEW', 'CRS_ARR_TIME', 'ARR_DELAY_NEW', 'CANCELLED', 'CANCELLATION_CODE', 'AIR_TIME', 'DISTANCE' 注意:由于易于导入SQL的需求,已移除列标题。 ### 其他信息 如对数据检索、清洗过程或项目内容有任何疑问,欢迎查阅我的[GitHub仓库](https://github.com/BingeCode/columnstore-innodb-monetdb)或与我联系。
提供机构:
Kaggle
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作