bstds/geonames
收藏Hugging Face2023-02-14 更新2024-03-04 收录
下载链接:
https://hf-mirror.com/datasets/bstds/geonames
下载链接
链接失效反馈官方服务:
资源简介:
---
dataset_info:
features:
- name: geonameid
dtype: int64
- name: name
dtype: string
- name: asciiname
dtype: string
- name: alternatenames
dtype: string
- name: latitude
dtype: float64
- name: longitude
dtype: float64
- name: feature_class
dtype: string
- name: feature_code
dtype: string
- name: country_code
dtype: string
- name: cc2
dtype: string
- name: admin1_code
dtype: string
- name: admin2_code
dtype: string
- name: admin3_code
dtype: string
- name: admin4_code
dtype: string
- name: population
dtype: int64
- name: elevation
dtype: float64
- name: dem
dtype: int64
- name: timezone
dtype: string
- name: modification_date
dtype: string
splits:
- name: train
num_bytes: 2378719253
num_examples: 12368001
download_size: 824343833
dataset_size: 2378719253
---
# Dataset Card for "geonames"
[More Information needed](https://github.com/huggingface/datasets/blob/main/CONTRIBUTING.md#how-to-contribute-to-the-dataset-cards)
[Source](https://download.geonames.org/export/dump/)
```
"geonameid", # integer id of record in geonames database
"name", # name of geographical point (utf8) varchar(200)
"asciiname", # name of geographical point in plain ascii characters, varchar(200)
"alternatenames",
# alternatenames, comma separated, ascii names automatically transliterated, convenience attribute from alternatename table, varchar(10000)
"latitude", # latitude in decimal degrees (wgs84)
"longitude", # longitude in decimal degrees (wgs84)
"feature_class", # see http://www.geonames.org/export/codes.html, char(1)
"feature_code", # see http://www.geonames.org/export/codes.html, varchar(10)
"country_code", # ISO-3166 2-letter country code, 2 characters
"cc2",
# alternate country codes, comma separated, ISO-3166 2-letter country code, 200 characters
"admin1_code",
# fipscode (subject to change to iso code), see exceptions below, see file admin1Codes.txt for display names of this code; varchar(20)
"admin2_code",
# code for the second administrative division, a county in the US, see file admin2Codes.txt; varchar(80)
"admin3_code", # code for third level administrative division, varchar(20)
"admin4_code", # code for fourth level administrative division, varchar(20)
"population", # bigint (8 byte int)
"elevation", # in meters, integer
"dem",
# digital elevation model, srtm3 or gtopo30, average elevation of 3''x3'' (ca 90mx90m) or 30''x30'' (ca 900mx900m) area in meters, integer. srtm processed by cgiar/ciat.
"timezone", # the iana timezone id (see file timeZone.txt) varchar(40)
"modification_date", # date of last modification in yyyy-MM-dd format"
```
提供机构:
bstds
原始信息汇总
数据集概述
数据集名称
"geonames"
数据集特征
- geonameid: 整数类型
- name: 字符串类型
- asciiname: 字符串类型
- alternatenames: 字符串类型
- latitude: 浮点数类型
- longitude: 浮点数类型
- feature_class: 字符串类型
- feature_code: 字符串类型
- country_code: 字符串类型
- cc2: 字符串类型
- admin1_code: 字符串类型
- admin2_code: 字符串类型
- admin3_code: 字符串类型
- admin4_code: 字符串类型
- population: 整数类型
- elevation: 浮点数类型
- dem: 整数类型
- timezone: 字符串类型
- modification_date: 字符串类型
数据集划分
- train: 12368001个样本,占用2378719253字节
数据集大小
- 下载大小: 824343833字节
- 数据集大小: 2378719253字节



