five

Geoprivacy Open Data 2023/09

收藏
DataCite Commons2023-10-24 更新2025-04-16 收录
下载链接:
https://etsin.fairdata.fi/dataset/e48e671f-5d74-4028-bb9e-675068945e53
下载链接
链接失效反馈
官方服务:
资源简介:
# General Geoprivacy Open Data is an anonymised tracking dataset in gpx format based on voluntary donations of personal human mobility data. Anonymisation means that all direct and indirect references to a person have been removed, while at the same time, the original data is preserved as much as possible. The raw data donations have been done through the Geoprivacy platform operating at [https://geoprivacy.fi/](https://geoprivacy.fi/). The Geoprivacy platform is a service where cyclists, joggers, and pedestrians can donate GPS (Global Positioning System) and other GNSS (Global Navigation Satellite System) tracking data for science. Additionally, the portal offers an open data repository to which users can choose to provide a privacy-protected version of their data. The open data is freely available to urban planners, scientists, and industry, i.e., everyone interested in the innovation potential of detailed personal-level mobility data. The hourly updates of the data are available from the platform, and this data is one of the monthly/yearly frozen versions of live data having exhaustive metadata descriptions, a persistent identifier, and access through the [https://etsin.fairdata.fi](https://etsin.fairdata.fi) service. The platform was born as part of our research on privacy issues related to using precise individual-level location data. A prime example of this is activity tracking data, which citizens are recording using various mobile sports tracking applications. **Our vision is that this tracking data could be used to improve the infrastructure for non-motorized means of travel.** Making cycling and walking more safe and convenient could greatly help reduce the number of cars in cities. Activity tracking data is not easily accessible. Even when tracks are publicly visible on the web, the terms of use usually limit the ways in which the data can be used. Some companies grant specific types of users, e.g., urban planners, access to the data, but only in a heavily processed form. There are good reasons to limit the distribution of the data: Activity tracking data is sensitive data and can reveal surprising personal details. These include, for example, home and workplace, and repeating patterns such as commuting behavior. We started the Geoprivacy platform to request voluntary participants to donate their tracking data for science. The original tracks are used only for research related to location data privacy within our research group. However, the participants have the additional option to donate a processed version of their tracks to an open data repository. With the open, privacy-preserving data repository, we hope to provide the scientific community with a benchmark dataset for non-motorized mobility data, making research in this area more comparable and reproducible. Furthermore, the open repository can serve as a proof-of-concept for a service that allows citizens to share their data directly with urban planning authorities. # Privacy protection The details of the privacy protection method are described at [https://geoprivacy.fi/#/privacy-mechanisms](https://geoprivacy.fi/#/privacy-mechanisms). In addition to the listed mechanisms, the population density of the data area needs to be more than 6 inhabitants per square kilometer, and the data has to be inside Finland. # Versions This dataset is one of the frozen datasets in the series of Geoprivacy Open Data. The other frozen datasets can be found from the Fairdata.fi service [Etsin with the search term "geoprivacy"](https://etsin.fairdata.fi/datasets/geoprivacy?keys=&terms=&p=1&sort=best) . # Format and coordinate system The data is provided in the standard GPX format. The general GPX XML schema is available at [https://www.topografix.com/GPX/1/1/](https://www.topografix.com/GPX/1/1/). Each GPX file starts with definitions of the XML version and character encoding, the GPX version, and the library used for creating the file. The actual location data is given in and elements, where the latitude and longitude values of each track point are given after and attributes using decimal degrees in the WGS84 coordinate system (EPSG: 4326). The time stamp for each point is given in the element in UTC. For day-time critical applications, conversion to EET/EEST taking into account the day-light saving is important (EET = UTC + 2h, EEST = UTC + 3h). # Terms of use The dataset is provided as open data, and its use is controlled by [the Terms of Use for the GeoPrivacy Open Data](https://geoprivacy.fi/#/open-data-terms-of-use). If you use the data in your work, please use the citation >Mäkinen, V., Brauer, A. and Oksanen, J. 2023. GeoPrivacy platform, available at: https://geoprivacy.fi and acknowledge >"We made use of geospatial data provided by the Open Geospatial Information Infrastructure for Research (Geoportti, urn:nbn:fi:research-infras-2016072513) funded by the Academy of Finland, CSC – IT Center for Science, and other Geoportti consortium members." # Acknowledgements The Geoprivacy project and platform have been funded by the Finnish Cultural Foundation and the Academy of Finland. The platform is the service pilot of the Geoportti RI (Open Geospatial Information Infrastructure for Research, urn:nbn:fi:research-infras-2016072513).

# 通用地理隐私开放数据(General Geoprivacy Open Data)是一套基于人类个人移动数据自愿捐赠的匿名化追踪数据集,采用GPX(GPS Exchange Format)格式存储。匿名化处理指移除所有直接或间接指向个人的关联信息,同时尽可能保留原始数据的核心特征。 原始捐赠数据通过运行于[https://geoprivacy.fi/](https://geoprivacy.fi/)的Geoprivacy平台完成收集。Geoprivacy平台是一项面向骑行者、慢跑者与行人的服务,用户可在此捐赠GPS(全球定位系统)及其他GNSS(全球导航卫星系统)追踪数据用于科学研究。此外,该门户还提供开放数据仓储,用户可选择将经过隐私保护处理的个人数据捐赠至该仓储中。 本开放数据面向城市规划者、科研人员与产业界免费开放,即所有关注精细化个人级移动数据创新价值的群体均可获取。该平台每小时更新一次原始动态数据,本数据集为其中一份月度/年度静态冻结版本,附带完整的元数据描述、持久化标识符,可通过[https://etsin.fairdata.fi](https://etsin.fairdata.fi)服务获取访问权限。 该平台的开发初衷是为支撑我们针对高精度个人级位置数据使用场景下的隐私保护问题开展的研究。此类场景的典型案例便是用户通过各类移动运动追踪应用记录的活动轨迹数据。**我们的愿景是,通过此类追踪数据优化非机动化出行的基础设施建设。**提升骑行与步行的安全性与便利性,可有效减少城市内的机动车保有量与出行占比。 活动轨迹数据通常难以获取:即便轨迹数据在网络上公开,其使用条款通常也会严格限制数据的使用方式。部分企业仅向特定群体(如城市规划者)开放数据访问权限,但提供的均为经过重度处理后的脱敏版本。限制此类数据的公开传播具备充分合理性:活动轨迹数据属于敏感信息,可能泄露大量隐私细节,例如用户的住址、工作单位,以及通勤模式这类规律性出行行为。 我们发起Geoprivacy平台的初衷,便是邀请自愿参与者捐赠个人轨迹数据用于科学研究。原始轨迹数据仅用于本研究团队开展的位置数据隐私相关研究。不过,参与者可额外选择将经过处理的轨迹数据捐赠至开放数据仓储。依托该开放隐私保护数据仓储,我们期望为科研社区提供一套非机动化移动数据的基准数据集,推动该领域研究的可对比性与可复现性。此外,该开放仓储还可作为一项概念验证服务原型,验证普通民众可直接向城市规划管理部门共享个人出行数据的可行性。 # 隐私保护 隐私保护方法的详细说明可参见[https://geoprivacy.fi/#/privacy-mechanisms](https://geoprivacy.fi/#/privacy-mechanisms)。除上述机制外,数据集覆盖区域的人口密度需达到每平方公里6人以上,且数据仅限芬兰境内的轨迹信息。 # 版本说明 本数据集为Geoprivacy开放数据系列中的一份静态冻结数据集。其余静态冻结数据集可通过Fairdata.fi平台的[Etsin服务,搜索关键词“geoprivacy”](https://etsin.fairdata.fi/datasets/geoprivacy?keys=&amp;terms=&amp;p=1&amp;sort=best)获取。 # 格式与坐标系 本数据集采用标准GPX格式存储,通用GPX XML模式规范可参见[https://www.topografix.com/GPX/1/1/](https://www.topografix.com/GPX/1/1/)。每份GPX文件的文件头均包含XML版本、字符编码、GPX版本以及生成该文件所使用的工具库的相关定义。实际位置数据存储于<trkpt>与<wpt>元素中,每个轨迹点的纬度与经度值通过对应属性给出,采用WGS84坐标系(EPSG: 4326)的十进制度数格式。每个轨迹点的时间戳通过<time>元素给出,采用UTC(协调世界时)时区。对于涉及时区时效性的应用场景,需根据夏令时规则将UTC时间转换为EET(东欧时区,UTC+2)或EEST(东欧夏令时时区,UTC+3)。 # 使用条款 本数据集以开放数据形式提供,其使用需遵循[GeoPrivacy开放数据使用条款](https://geoprivacy.fi/#/open-data-terms-of-use)。若您在研究工作中使用本数据集,请遵循以下引用规范: > Mäkinen, V., Brauer, A. and Oksanen, J. 2023. GeoPrivacy platform, available at: https://geoprivacy.fi 同时请注明:"We made use of geospatial data provided by the Open Geospatial Information Infrastructure for Research (Geoportti, urn:nbn:fi:research-infras-2016072513) funded by the Academy of Finland, CSC – IT Center for Science, and other Geoportti consortium members." # 致谢 Geoprivacy项目与平台由芬兰文化基金会与芬兰科学院资助。本平台为Geoportti RI(开放地理空间研究基础设施,urn:nbn:fi:research-infras-2016072513)的服务试点项目。
提供机构:
Finnish Geospatial Research Institute, Department of Geoinformatics and Cartography
创建时间:
2023-09-21
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作