Historical GIS and Guidebooks: Czechoslovak Tourist Attractions
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7418863
下载链接
链接失效反馈官方服务:
资源简介:
The following dataset of geolocated travel guide toponyms was compiled in the process of a research project Historical GIS and Guidebooks: A Scalable Reading of Czechoslovak Tourist Attractions (Bechmann Pedersen & Johansson, forthcoming) in which we performed a scalable reading of three travel guides of the former Czechoslovak lands as they were cirka 1959. We used a specialized, open-sourced tool CITADEL created for this project, which in itself relies on open-sourced data from GeoNames and WikiData.
Our primary sources were the following three travel guides:
Čedok/Nagel, 1959, Czechoslovakia
Čedok, 1928 Guide to the Czechoslovak Republic
Baedker, 1905 Austria–Hungary including Dalmatia and Bosnia
All files are tab separated values (.tsv) files with utf-8 encoding.
Three lists of positions with associated toponyms
Two lists of clusters
There are three files of the first type:
CITADEL_1905Baedeker_toponyms.tsv
CITADEL_1928Cedok_toponyms.tsv
CITADEL_1959Nagel_toponyms.tsv
Columns for toponym files
Column
Explanation
Toponym_first_used
First toponym recorded in the database that is linked to the position
Toponym_added
The toponym as spellled in the travel guide
Source
A shorthand used to identify the travel guide
PositionID
The unique identifier of a position, using the GeoNames or WikiData identifiers where applicable.
Longitude
-
Latitude
-
Year
The year of the travel guide
Weight
The weight reflects the number of entries the toponym has in the index. Nagel1928 does not use this convention, and the weight is instead based on the relative page count.
There are two files of the second type:
CITADEL_1905,1928,1959_4km_clusters.tsv
CITADEL_1928,1959_4km_clusters.tsv
Columns for the cluster files
Column
Explanation
years
The years represented in the cluster
Latitude
-
Longitude
-
cnt_points
The number of points represented in the cluster
cnt_toponyms
The number of toponyms represented in the cluster
cnt_toponyms_
The number of toponyms from represented in the cluster
本地理标注旅游指南地名(Toponym)数据集,源自研究项目《历史地理信息系统(Geographic Information System,GIS)与旅游指南:捷克斯洛伐克旅游景点的规模化解读》(Bechmann Pedersen & Johansson,即将刊出)的研究过程。该项目对前捷克斯洛伐克地区1959年左右的三部旅游指南进行了规模化文本解读。本研究使用了为本项目专门开发的开源工具CITADEL,该工具依托GeoNames与WikiData的开源数据构建。
本数据集的核心数据源为以下三部旅游指南:
1. Čedok/Nagel,1959,《捷克斯洛伐克》
2. Čedok,1928,《捷克斯洛伐克共和国指南》
3. Baedker,1905,《奥匈帝国及达尔马提亚与波斯尼亚》
所有文件均为采用UTF-8编码的制表符分隔值(Tab-Separated Values,TSV)文件。
数据集包含两类文件:
一、带关联地名的位置列表,共3个文件:
CITADEL_1905Baedeker_toponyms.tsv
CITADEL_1928Cedok_toponyms.tsv
CITADEL_1959Nagel_toponyms.tsv
地名文件各字段说明如下:
| 字段名 | 说明 |
|-----------------------|----------------------------------------------------------------------|
| Toponym_first_used | 数据库中记录的与该位置关联的首个地名 |
| Toponym_added | 旅游指南原文中记载的地名 |
| Source | 用于标识旅游指南的缩写标识 |
| PositionID | 位置唯一标识符,优先采用GeoNames或WikiData的对应标识符 |
| Longitude | 经度 |
| Latitude | 纬度 |
| Year | 对应旅游指南的出版年份 |
| Weight | 权重值反映该地名在索引中的条目数。其中1928年版Čedok未采用该统计规则,其权重值基于相对页面占比计算 |
二、聚类列表,共2个文件:
CITADEL_1905,1928,1959_4km_clusters.tsv
CITADEL_1928,1959_4km_clusters.tsv
聚类文件各字段说明如下:
| 字段名 | 说明 |
|-----------------------|----------------------------------------------------------------------|
| years | 该聚类所涵盖的指南出版年份 |
| Latitude | 纬度 |
| Longitude | 经度 |
| cnt_points | 该聚类所包含的点位总数 |
| cnt_toponyms | 该聚类所涵盖的地名总数 |
| cnt_toponyms_ | 该聚类所涵盖的来自[原文缺失来源信息]的地名数量
创建时间:
2024-01-12



