mattia-durso/TerraSky3D

Name: mattia-durso/TerraSky3D
Creator: mattia-durso
Published: 2026-04-17 12:19:20
License: 暂无描述

Hugging Face2026-04-17 更新2026-04-26 收录

下载链接：

https://hf-mirror.com/datasets/mattia-durso/TerraSky3D

下载链接

链接失效反馈

官方服务：

资源简介：

--- license: mit task_categories: - image-to-3d - depth-estimation - image-to-image language: - en tags: - 3D - Multi-View-Stereo - SfM - Aerial-Imagery - Aerial-to-ground - European-Landmarks pretty_name: TerraSky3D size_categories: - 10K<n<100K --- <div align="center"> <h1>TerraSky3D: Multi-View Reconstructions of European Landmarks in 4K</h1> <p> <a href="https://mattiadurso.github.io/">Mattia D'Urso</a><sup>1</sup> · <a href="https://scholar.google.com/citations?user=U6yM89YAAAAJ&hl=zh-CN">Yuxi Hu</a><sup>1</sup> · <a href="https://scholar.google.com/citations?user=6uZVF04AAAAJ&hl=en">Christian Sormann</a><sup>2</sup> · <a href="https://scholar.google.com/citations?user=DA3nSvgAAAAJ&hl=en">Mattia Rossi</a><sup>2</sup> · <a href="https://scholar.google.com/citations?user=M0boL5kAAAAJ&hl=en">Friedrich Fraundorfer</a><sup>1</sup> </p> <p> <sup>1</sup>Graz University of Technology · <sup>2</sup>Sony </p> <h3> <a href="https://3dmv.org/2026/">3DMV</a> at CVPR Workshop 2026 </h3> <p align="center"> <a href="https://github.com/mattiadurso/TerraSky3D"> <img src="https://img.shields.io/badge/GitHub-Code-blue?logo=github" alt="GitHub"> </a>   <a href="https://huggingface.co/datasets/mattia-durso/TerraSky3D"> <img src="https://img.shields.io/badge/%F0%9F%A4%97%20Hugging%20Face-Data-yellow" alt="Hugging Face"> </a>   <a href="https://arxiv.org/abs/2603.28287"> <img src="https://img.shields.io/badge/arXiv-Paper-red?logo=arxiv" alt="Paper"> </a> </p> <img src="assets/villalta.png" alt="TerraSky3D Teaser" width="80%"> <br> <p><em>Left: Sparse 3D reconstruction of the Villalta Castle, Italy. Right: Representative images collected from aerial and ground perspectives, shown with their corresponding geometrically and semantically filtered depth maps.</em></p> <div style="display: flex; justify-content: center; gap: 5px;"> <img src="assets/susans.gif" alt="Scene 1 GIF" width="32%"> <img src="assets/gorizia.gif" alt="Scene 2 GIF" width="32%"> <img src="assets/pieve.gif" alt="Scene 3 GIF" width="32%"> </div> <p><em>Examples of 3DGS reconstructions of aerial and ground scenes from TerraSky3D.</em></p> </div> ## Abstract Despite the growing need for data of more and more sophisticated 3D reconstruction pipelines, we can still observe a scarcity of suitable public datasets. Existing 3D datasets are either low resolution, limited to a small amount of scenes, based on images of varying quality because retrieved from the internet, or limited to specific capturing scenarios. Motivated by this lack of suitable 3D datasets, we captured TerraSky3D, a high-resolution large-scale 3D reconstruction dataset comprising 50,000 images divided into 155 ground, aerial, and mixed scenes. The dataset focuses on European landmarks and comes with curated calibration data, camera poses, and depth maps. TerraSky3D tries to answer the need for challenging dataset that can be used to train and evaluate 3D reconstruction-related pipelines. ## 📊 Training/Test Set Statistics (v1.0)  | Metric | Test | / | Train | Description | | :--- | ---: | :---: | :--- | :--- | | **Scenes** | 133 | / | 12 | Number of scenes. | | **Images** | 42,992 | / | 3,018 | Number of high-resolution (4K) images provided in the dataset. | | **Pairs** | 2,624,784 | / | 43,720 | Total stereo/overlapping image pairs captured. | |   ↳ **Ground** | 1,682,869 | / | 32,718 | Pairs captured only from a ground perspective. | |   ↳ **Aerial** | 728,502 | / | 6,308 | Pairs captured only from an aerial perspective. | |   ↳ **Mixed** | 213,413 | / | 4,694 | Pairs captured from both aerial and ground views. | ## 📁 Download & Format The dataset can be downloaded from <a href="https://huggingface.co/datasets/mattia-durso/TerraSky3D"> Hugging Face 🤗 </a>. Use `data_viewer.ipynb` to generate `train_data.json` and visualize examples of pairs of images from all the dataset or just mixed aerial and ground pairs. ------- The dataset follows a structured format for seamless integration into SfM and novel view synthesis pipelines: ```text data/scene/ ├── colmap/ # Structure-from-Motion outputs in COLMAP format │ └── sparse/ │ └── 0/ │ ├── cameras # Intrinsic parameters │ ├── images # Extrinsic parameters / poses │ └── points3D # Sparse point cloud ├── frames/ # Extracted video frames │ ├── cam_0/ │ │ └── frame_000000.jpg # Format: cam_i/frame_*.jpg │ ├── cam_1/ │ │ └── frame_000000.jpg │ └── ... ├── depth/ # Multi-View Stereo depth outputs │ ├── maps/ # Raw depth estimations from APD-MVS │ │ ├── cam_0/ │ │ │ └── frame_000000.h5 │ │ └── ... │ ├── masks_geometric/ # Geometric masks from APD-MVS │ │ ├── cam_0/ │ │ │ └── frame_000000.png │ │ └── ... │ └── masks_semantic/ # Semantic masks from Mask2Former │ ├── cam_0/ │ │ └── frame_000000.png │ └── ... └── train_data.json # Dictionary containing scenes, images, and camera parameters ``` ## 🙏 Acknowledgments We thank Davide Casano, Luca Danelutti, Federico Fattori, Alessandro Menafra, Florian Thaler, Runze Yuan, and Stefano Zorzi for their invaluable contributions to the data collection process. ## 📝 Citation If you find this dataset or code useful in your research, please consider citing: ```bibtex @article{durso202Xterrasky3d, title={TerraSky3D: Multi-View Reconstructions of European Landmarks in 4K}, author={D'Urso, Mattia and Hu, Yuxi and Rossi, Mattia and Sormann, Christian and Fraundorfer, Friedrich}, booktitle={IEEE Conference on Computer Vision and Pattern Recognition}, year={2026} } ```

提供机构：

mattia-durso

5,000+

优质数据集

54 个

任务类型

进入经典数据集