Protest-Related Posts on the LIHKG Forum from June 10 to July 11 2019
收藏Figshare2023-05-02 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/_strong_Protest-Related_Posts_on_the_LIHKG_Forum_from_June_10_to_July_11_2019_strong_/22723636
下载链接
链接失效反馈官方服务:
资源简介:
This dataset contains all protest-relevant posts on the LIHKG forum between June 10 and July 11, 2019. The dataset comprises a substantial corpus of 2,389,590 individual posts that are organized into 49,658 threads and were contributed by 12,624 distinct users. Note: all data could be publicly accessible in the LIHKG forum. Data key fields: thread_id: Unique identifier for a thread. cat_id: Identifier for thread category. user_id: User ID who created the thread. item_data_reply_time: Date and time of the reply to the post within the thread data. item_data_user_id: ID of the user who posted within the thread data. post_text_token: Token of the thread data. push_count: Whether contain any of the following terms: "push", "pish", "posh", "pash", "psuh", "up", "tui", "推", or "幫推". issues_pred: Strategic framing identified in the thread by the Bayesian algorithm. topic: Substantive topics identified in the thread by the LDA model.
创建时间:
2023-05-02



