Sep_TD_Tel01
收藏doi.org2025-01-22 收录
下载链接:
http://doi.org/10.17632/372rnwf9pc.1
下载链接
链接失效反馈官方服务:
资源简介:
The Sep_TD_Tel01 dataset was compiled by ComInSyS . This database has been compiled due to the low resource of the Persian language and the high popularity of the Telegram social network in Iran. In this work, an official API published by Telegram is used. To respect the principle of privacy, only data related to public channels and public groups have been collected. \par
This dataset contains 10,209 records of messages sent to public channels and groups in the one month between 1 January 2017 (12 Day 1395) and 31 January 2017 (12 Bahman 95). This database is divided into sixty 12-hour windows, which include two super-hot topics: "The death of Ayatollah Hashemi Rafsanjani" and the second topic, "Plasco building fire" To be able to review and control the performance, nine of these windows have been selected as GT and labeled. These windows are [14, 15, 16, 17, 18, 37, 38, 39, 40]
Sep_TD_Tel01 数据集由 ComInSyS 编制而成。鉴于波斯语资源的匮乏以及伊朗对 Telegram 社交网络的广泛青睐,该数据库得以汇编。本研究中,采用了 Telegram 发布的官方 API。为尊重隐私原则,仅收集了与公开频道和群组相关的数据。
该数据集收录了自 2017 年 1 月 1 日(1395 年 12 月 12 日)至 2017 年 1 月 31 日(95 年 12 巴姆)一个月内发送至公开频道和群组的 10,209 条信息记录。数据库被划分为六十个 12 小时的时间窗口,其中包含两个热点话题:一是“哈梅内伊·拉夫桑贾尼大阿亚图拉逝世”,二是“Plasco 大楼火灾”。为确保可审查和性能监控,从中选出了九个时间窗口作为训练集(Ground Truth)并进行标记。这些窗口分别为 [14, 15, 16, 17, 18, 37, 38, 39, 40]。
提供机构:
Mendeley Data



