NYC Taxi Rides, STAR, AIRLINE
收藏数据集概述
数据集列表
-
Dataset NYC Taxi Rides
- 数据库: nyc_taxi_rides
- 表:
- central_park_weather_observations
- taxi_zones
- tripdata
- 数据源: 远程的‘etalon dataset server’
-
Dataset STAR
- 数据库: star
- 表: starexp
- 数据源: 远程的‘etalon dataset server’
-
Dataset AIRLINE
- 数据库: airline
- 表: ontime
- 数据源: 远程的‘etalon dataset server’
数据集设置步骤
Dataset NYC Taxi Rides
-
数据库创建: bash clickhouse-client -q "CREATE DATABASE IF NOT EXISTS nyc_taxi_rides;"
-
表创建: bash clickhouse-client -q "CREATE TABLE nyc_taxi_rides.central_park_weather_observations (...) ENGINE = MergeTree(...);" clickhouse-client -q "CREATE TABLE nyc_taxi_rides.taxi_zones (...) ENGINE = MergeTree(...);" clickhouse-client -q "CREATE TABLE nyc_taxi_rides.tripdata (...) ENGINE = MergeTree(...);"
-
数据复制: bash clickhouse-client -q "INSERT INTO nyc_taxi_rides.central_park_weather_observations SELECT * FROM remote(127.0.0.1:9999, nyc_taxi_rides.central_park_weather_observations);" clickhouse-client -q "INSERT INTO nyc_taxi_rides.taxi_zones SELECT * FROM remote(127.0.0.1:9999, nyc_taxi_rides.taxi_zones);" clickhouse-client -q "INSERT INTO nyc_taxi_rides.tripdata SELECT * FROM remote(127.0.0.1:9999, nyc_taxi_rides.tripdata);"
-
数据检查: bash clickhouse-client -q "SELECT count() FROM nyc_taxi_rides.central_park_weather_observations;" clickhouse-client -q "SELECT count() FROM nyc_taxi_rides.taxi_zones;" clickhouse-client -q "SELECT count() FROM nyc_taxi_rides.tripdata;"
Dataset STAR
-
数据库创建: bash clickhouse-client -q "CREATE DATABASE IF NOT EXISTS star;"
-
表创建: bash clickhouse-client -q "CREATE TABLE star.starexp (...) ENGINE = MergeTree(...);"
-
数据复制: bash clickhouse-client -q "INSERT INTO star.starexp SELECT * FROM remote(127.0.0.1:9999, star.starexp);"
-
数据检查: bash clickhouse-client -q "SELECT count() FROM star.starexp;"
Dataset AIRLINE
-
数据库创建: bash clickhouse-client -q "CREATE DATABASE IF NOT EXISTS airline;"
-
表创建: bash clickhouse-client -q "CREATE TABLE IF NOT EXISTS airline.ontime (...) ENGINE = MergeTree(...);"
-
数据复制: bash clickhouse-client -q "INSERT INTO airline.ontime SELECT * FROM remote(127.0.0.1:9999, airline.ontime);"
-
数据检查: bash clickhouse-client -q "SELECT count() FROM airline.ontime;"
结论
完成上述步骤后,本地将成功迁移一个或多个来自‘etalon dataset server’的数据集。




