ERC Socsemics – Reddit-Gaza Oct 7-23 news/comment hyperedges
收藏DataCite Commons2025-07-07 更新2025-04-16 收录
下载链接:
https://nakala.fr/10.34847/nkl.e115r335
下载链接
链接失效反馈官方服务:
资源简介:
== Context ==
This dataset is derived from all the posts found on Reddit sharing news articles about the Israel-Hamas war, during the two weeks following the initial attack.
== Dataset scope ==
Semantic hypergraph (SH) representations were extracted from the full text of the news articles as well as from the full text of all the comments associated with these articles.
This dataset contains the SH representation of the full text of the news articles as well as of the full text of all the comments associated with these articles. The dataset also includes relevant metadata such as post / comment score, author, id and parent id. Author usernames and post / comment ids are fully anonymized.
The SH representation provided is compatible and can be used with the “Graphbrain” open source tool, which can be found at:
https://graphbrain.net
== Archive content ==
The dataset consists of two files for each subreddit that contains posts matching the search criteria: “[subreddit]_news_articles.csv” for the news articles and “[subreddit]_comments.csv” for the associated comments. Both file types have the same structure, with each row corresponding to a sentence:
• hyperedge: SH representation of a sentence.
• author: author of the post / comment from where the sentence was extracted (anonymized).
• score: score (upvotes minus downvotes) of the post / comment at the moment of retrieval.
• post_id: unique id of the post / comment from where the sentence was extracted (anonymized).
• parent_id: unique id of the parent of the post / comment from where the sentence was extracted (anonymized).
== Acknowledgment of funding == This dataset has been assembled in the framework of the ERC-supported Consolidator Grant “Socsemics” (research performed at CNRS, 2019-24), grant agreement #772743.
提供机构:
NAKALA - https://nakala.fr (Huma-Num - CNRS)
创建时间:
2024-10-30



