five

Dataset corresponding to the paper "A privacy-preserving approach to identify riot-related footage on social media".

收藏
DataCite Commons2025-12-29 更新2026-01-03 收录
下载链接:
https://data.4tu.nl/datasets/1d26c310-5a5b-48e7-b72d-5540bd6d0b6e/1
下载链接
链接失效反馈
官方服务:
资源简介:
107,674 geolocated visual posts from a social media were collected during and after the 'Nahel Merzouk' riots in the summer 2023 in 7 French cities. These posts were fed to an image-to-text model (BLIP2-OPT-2.7B) to produce textual description of the visual content. This dataset contains those textual descriptions, along with the metadata (date, time, and location). A subset of the posts were also annotated as riot-related or not riot-related to train a BERT model. This subset is also provided in this database (see paper for more details).<br>Tables:<br>1. videos: Contains metadata about each video including location and timestamp information.2. captions: Contains all captions extracted from videos, with frame-level information.3. annotated_captions: Contains a subset of captions that have been manually annotated for riot-related content.4. annotated_videos: Contains manually annotated video-level labels for riot detection.5. split_annotated_videos: Defines the train/test split for annotated videos used in model training and evaluation.<br>
提供机构:
4TU.ResearchData
创建时间:
2025-12-29
二维码
社区交流群
二维码
科研交流群
商业服务