five

Supporting data for "Biospytial: spatial graph-based computing engine for ecological big data"

收藏
Mendeley Data2024-01-31 更新2024-06-29 收录
下载链接:
http://gigadb.org/dataset/100723
下载链接
链接失效反馈
官方服务:
资源简介:
Biospytial is a modular open source knowledge engine designed to import, organise, analyse and visualise big spatial ecological datasets using the power of graph theory. Specifically, it handles species occurrences and their taxonomic classification for performing ecological analysis on biodiversity and species distributions. The engine uses a hybrid graph-relational approach to store and access information. The data are linked with relationships that are stored in a graph database, while tabular and geospatial (vector and raster) data are stored in a relational database management system (RDBMS). The graph data structure provides a scalable design that eases the problem of merging datasets from different sources. The linkage relationships use semantic structures (objects and predicates) to answer scientific questions represented as complex data structures stored in the graph database. In this sense, we used species occurrences, taxonomic classification, and climatic datasets to build a knowledge graph of the Tree of Life embedded in an environmental and geographical grid. Biospytial comprises three interconnected components: i) a Geospatial Processing unit (GPU) supported by a RDBMS with geoprocessing capabilities, ii) a Graph Storage and Querying Unit, and iii) a graph-relational package, called: The Biospytial Computing Engine (BCE) that integrates all the system’s components. It also includes tools like: interactive notebooks (Jupyter), graph analytic libraries (NetworkX) and statistical frameworks (PyMC3). The Biospytial approach reduces the complexity of joining datasets using multiple primary-foreign key relations, a drawback in RDBMS. Applied to ecological data, it allows the discovery and inference of relationships using the interconnected network of taxonomic and spatial relationships. Its modular and scalable design makes it possible to run and distribute several instances simultaneously, allowing fast and efficient handling of big and complex ecological datasets. An example applied to the conservation of threatened species from the IUCN Red List using the co-occurrence of jaguars (Panthera onca) is included. This example demonstrates the engine’s capabilities in performing basic taxonomic trees manipulation, analysis and visualization of taxonomic groups co-occurring in space.
创建时间:
2024-01-31
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作