five

Cultural ecosystem service labels for photos from Flickr and Twitter using artificial intelligence models

收藏
Zenodo2025-07-23 更新2026-05-26 收录
下载链接:
https://zenodo.org/doi/10.5281/zenodo.15806887
下载链接
链接失效反馈
官方服务:
资源简介:
Description: Dataset of photos downloaded from Flickr (241,582 photos) and Twitter-X (1,035,488 photos) labeled by different artificial intelligence models and validated by labels assigned by human experts.  The entire dataset was labeled using different AI models. First, we applied a Large Language Model (GPT-4.1 from OPENAI) and Llava 1.6 (on a subset of the data) to extract semantic labels from the image content based on prompts fine-tuned using prompt engineering. In parallel, we used the base version of DINO (a self-supervised vision transformation model), fine-tuned with a subset of human expert-labeled images from our own dataset, to generate inferences for the entire image collection. We also incorporated labels derived from expert vision models pre-trained on established datasets such as ImageNet, COCO, Places365, and Nature, which provided complementary classification information. The labels used correspond to two categories (Table 1):   Table 1. Categories used in social media photo tagging: Stoten, based on the scientific framework proposed by Moreno-Llorca et al. (2020). Level 3, a hierarchical tagging system developed by our team to provide greater thematic detail, especially suited for the identification of Cultural Ecosystem Services. Stoten Level3 Cultural Accommodation Fauna/Flora Air activities Gastronomy Animals Nature & Landscape Breakwater Not relevant Bridge Recreational Commerce facilities Religious Cities Rural tourism Clouds Sports Dam Sun and beach Dock Urban Fungus   Heritage and culture   Knowledge   Landscapes   Lighthouse   Not relevant   Other abiotic features   Plants   Roads   Shelter   Skies   Spiritual, symbolic and related connotations   Terrestrial activities   Towns and villages   Tracks and trails   Vegetation and habitats   Vehicle   Water activities   Wind farm   Winter activities   Table 2. Table of contents of the dataset Folder       format Description AI models DINO   model .pt and pth Model fine-tuned with a subset of expert-labeled images   Expert models CES_label_tree .csv Equivalence table used to assign labels generated by expert models to our categories of interest (Stoten and Level3)   LLMs GPT and Llava prompts GPT_Label_local_files .py Python script used for labeling photos using OPENAI models (in our case we used the GPT 4.1 model)       Level3_GPT_LLava_7_prompts_used .txt Seven prompts used for photo tagging using GPT 4.1 and Llava 1.6       Stoten_GPT_LLava_7_prompts_used .txt Seven prompts used for photo tagging with Stoten using GPT 4.1 and Llava 1.6       Stoten_Level3_categories .csv Seven prompts used for photo tagging with level 3 using GPT 4.1 and Llava 1.6 Flickr AI based labels DINO Flickr_DINO_all .csv Inferences for all Flickr photos from the DINO model trained with the ground truth     Expert models Flickr_expert_models_all .csv Labels generated by expert models for the entire database     GPT Flickr_GPT_all .csv Database of Flickr photos tagged with CES using OPENAI's GPT-4.1 model.       Flickr_GPT_7_prompts_8192 .csv Subset of the Flickr photo database with CES-related tags assigned by the GPT 4.1 model where 7 prompts are tested for Stoten and Level 3.     Llava 1.6 Flickr_Llava_1-6 .csv Subset of the Flickr photo database with CES-related tags assigned by the Llava 1.6 model where 7 prompts are tested for Stoten and Level 3.   Ground truth Ground Truth labels Flickr_Database_Labeled_1082 .csv Contain labels assigned by human experts and after rounds of review and consensus, for both Stoten and Level 3, from 1082 Flickr photos       Flickr_Database_Labeled_7110 .csv Ground Truth, an archive containing labels assigned by human experts and after rounds of review and consensus, for both Stoten and Level 3, from 7110 Flickr photos       Flickr_Database_Labeled_8192 .csv Union of the two databases labeled above     Ground Truth photos 1082 .jpg/png Photos labeled by human experts, these photos were selected to be representative of different parks, with different levels of protection and representative of different CES       7110 .jpg/png Photos labeled by human experts, these photos were selected to be representative of different parks, with different levels of protection and representative of different CES   Human labels Flickr_DataBase_Labeled_1082_expert1_AS .csv File containing tags assigned by expert 1 for both Stoten and Level 3, from 1082 Flickr photos       Flickr_DataBase_Labeled_1082_expert2_FG .csv File containing tags assigned by expert 2 for both Stoten and Level 3, from 1082 Flickr photos       Flickr_DataBase_Labeled_7110_expert1_CN .csv File containing tags assigned by expert 1 for both Stoten and Level 3, from 7110 Flickr photos Twitter AI based labels DINO Twitter_DINO_all .csv Inferences for all Twitter photos from the DINO model trained with the ground truth     Expert models Twitter_expert_models_all .csv Labels generated by expert models for the entire database     GPT Twitter_GPT_all .csv Database of Twitter photos tagged with CES using OPENAI's GPT-4.1 model.       Twitter_GPT_7_prompts_150 .csv Subset of the Twitter photo database with CES-related tags assigned by the GPT 4.1 model where 7 prompts are tested for Stoten and Level 3.     Llava 1.6 Twitter_Llava_1-6 .csv Subset of the Twitter photo database with CES-related tags assigned by the Llava 1.6 model where 7 prompts are tested for Stoten and Level 3.   Ground truth Ground Truth labels Twitter_Database_Labeled_150 .csv Contain labels assigned by human experts and after rounds of review and consensus, for both Stoten and Level 3, from 150 Twitter photos       Twitter_Database_Labeled_6804 .csv Contain labels assigned by human experts and after rounds of review and consensus, for both Stoten and Level 3, from 6804 Twitter photos     Ground Truth photos 150 .jpg/png Photos labeled by human experts, these photos were selected to be representative of different parks, with different levels of protection and representative of different CES       6804 .jpg/png Photos labeled by human experts, these photos were selected to be representative of different parks, with different levels of protection and representative of different CES   Human labels Flickr_DataBase_Labeled_150_7experts .csv File containing tags assigned by 7 experts  for both Stoten and Level 3, from 150 Twitter photos       Flickr_DataBase_Labeled_6804_expert1_FG .csv File containing tags assigned by expert 2 for both Stoten and Level 3, from 6804 Twitter photos   References:  Moreno-Llorca, R., Méndez, P. F., Ros-Candeira, A., Alcaraz-Segura, D., Santamaría, L., Ramos-Ridao, Á. F., ... & Vaz, A. S. (2020). Evaluating tourist profiles and nature-based experiences in Biosphere Reserves using Flickr: Matches and mismatches between online social surveys and photo content analysis. Science of the Total Environment, 737, 140067. https://doi.org/10.1016/j.scitotenv.2020.140067
提供机构:
Zenodo
创建时间:
2025-07-23
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作