five

bstds/geonames

收藏
Hugging Face2023-02-14 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/bstds/geonames
下载链接
链接失效反馈
官方服务:
资源简介:
--- dataset_info: features: - name: geonameid dtype: int64 - name: name dtype: string - name: asciiname dtype: string - name: alternatenames dtype: string - name: latitude dtype: float64 - name: longitude dtype: float64 - name: feature_class dtype: string - name: feature_code dtype: string - name: country_code dtype: string - name: cc2 dtype: string - name: admin1_code dtype: string - name: admin2_code dtype: string - name: admin3_code dtype: string - name: admin4_code dtype: string - name: population dtype: int64 - name: elevation dtype: float64 - name: dem dtype: int64 - name: timezone dtype: string - name: modification_date dtype: string splits: - name: train num_bytes: 2378719253 num_examples: 12368001 download_size: 824343833 dataset_size: 2378719253 --- # Dataset Card for "geonames" [More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards) [Source](https://download.geonames.org/export/dump/) ``` "geonameid", # integer id of record in geonames database "name", # name of geographical point (utf8) varchar(200) "asciiname", # name of geographical point in plain ascii characters, varchar(200) "alternatenames", # alternatenames, comma separated, ascii names automatically transliterated, convenience attribute from alternatename table, varchar(10000) "latitude", # latitude in decimal degrees (wgs84) "longitude", # longitude in decimal degrees (wgs84) "feature_class", # see http://www.geonames.org/export/codes.html, char(1) "feature_code", # see http://www.geonames.org/export/codes.html, varchar(10) "country_code", # ISO-3166 2-letter country code, 2 characters "cc2", # alternate country codes, comma separated, ISO-3166 2-letter country code, 200 characters "admin1_code", # fipscode (subject to change to iso code), see exceptions below, see file admin1Codes.txt for display names of this code; varchar(20) "admin2_code", # code for the second administrative division, a county in the US, see file admin2Codes.txt; varchar(80) "admin3_code", # code for third level administrative division, varchar(20) "admin4_code", # code for fourth level administrative division, varchar(20) "population", # bigint (8 byte int) "elevation", # in meters, integer "dem", # digital elevation model, srtm3 or gtopo30, average elevation of 3''x3'' (ca 90mx90m) or 30''x30'' (ca 900mx900m) area in meters, integer. srtm processed by cgiar/ciat. "timezone", # the iana timezone id (see file timeZone.txt) varchar(40) "modification_date", # date of last modification in yyyy-MM-dd format" ```
提供机构:
bstds
原始信息汇总

数据集概述

数据集名称

"geonames"

数据集特征

  • geonameid: 整数类型
  • name: 字符串类型
  • asciiname: 字符串类型
  • alternatenames: 字符串类型
  • latitude: 浮点数类型
  • longitude: 浮点数类型
  • feature_class: 字符串类型
  • feature_code: 字符串类型
  • country_code: 字符串类型
  • cc2: 字符串类型
  • admin1_code: 字符串类型
  • admin2_code: 字符串类型
  • admin3_code: 字符串类型
  • admin4_code: 字符串类型
  • population: 整数类型
  • elevation: 浮点数类型
  • dem: 整数类型
  • timezone: 字符串类型
  • modification_date: 字符串类型

数据集划分

  • train: 12368001个样本,占用2378719253字节

数据集大小

  • 下载大小: 824343833字节
  • 数据集大小: 2378719253字节
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作