five

Data related to the paper "Studying social unrest through the lens of social media"

收藏
4TU.ResearchData2025-07-28 更新2026-04-23 收录
下载链接:
https://data.4tu.nl/datasets/649e8f5d-8e40-4ab7-9d07-b5ef53d810f0/3
下载链接
链接失效反馈
官方服务:
资源简介:
<br>Dataset corresponding to the paper "Studying social unrest through the lens of social media".<br>107,674 geolocated visual posts from a social media were collected during and after the 'Nahel Merzouk' riots in the summer 2023 in 7 French cities. These posts were fed to a computer vision model with the objective of identifying riot-related posts. This dataset contains the metadata (date, time, and location) of those posts along with the label of the posts (according to the model). Riot-related posts are then clustered into "events", based on their spatiotemporal proximity (see paper for more details).<br>Columns:"timestamp" (TIMESTAMP): Date and time of the posts"latitude" (REAL): Latitude at which the post was published"longitude" (REAL): Longitude at which the post was published"pred_class" (INTEGER): Binary variable with value 1 if it represents a riot, 0 otherwise"event" (TEXT): Event associated to the post, structured as follows:"No event" if the post is not marked as riot-related"day_city_id" with "day" being the day of the month associated to the event, such as "2", "city" being the city in which the event happened, such as "Paris", "id" being an integer. "29_Marseille_0" corresponds to event "0" happening in Marseille on June 29th 2023. If the value of the id is "-1", the post could not be associated to any event.

本数据集对应论文《透过社交媒体视角探析社会动荡》(Studying social unrest through the lens of social media)。 2023年夏季,法国7座城市在“纳赫尔·梅尔祖克”骚乱期间及骚乱后,共收集到107674条携带地理定位信息的社交媒体视觉帖文。 研究团队将上述帖文输入计算机视觉模型,旨在识别与骚乱相关的帖文。 本数据集包含这些帖文的元数据(发布日期、时间与地理位置),以及模型生成的帖文标签。 随后基于时空邻近性,将与骚乱相关的帖文聚类为若干“事件”,详细说明参见原论文。 字段说明: "timestamp" (TIMESTAMP):时间戳:帖文的发布日期与时间 "latitude" (REAL):纬度:帖文发布时的纬度坐标 "longitude" (REAL):经度:帖文发布时的经度坐标 "pred_class" (INTEGER):预测类别:二分类变量,若帖文与骚乱相关则取值为1,否则为0 "event" (TEXT):事件:帖文关联的事件,格式规则如下: 若帖文未被标记为与骚乱相关,则取值为"No event"(无事件); 否则格式为`day_city_id`:其中`day`为关联事件发生当月的日期(如"2"),`city`为事件发生城市(如"Paris(巴黎)"),`id`为整数编号。例如"29_Marseille_0"代表2023年6月29日在马赛发生的编号为0的事件;若`id`为"-1",则该帖文无法被关联至任一事件。
创建时间:
2025-07-28
二维码
社区交流群
二维码
科研交流群
商业服务