five

Webis-WikiDebate-18

收藏
NIAID Data Ecosystem2026-03-13 收录
下载链接:
https://zenodo.org/record/3339135
下载链接
链接失效反馈
官方服务:
资源简介:
Webis-WikiDebate-18 is a large-scale corpus for the argumentation model. The corpus is generated automatically based on the metadata in discussions and then verified partly by an expert. The table has the following attributes: comment-ID discussion-ID reference comment-ID comment (wiki format with tag) comment (plain text without tag) username timestamp hierarchy level in discussion The IDs consist of up to 3 numbers. For example, the comment-ID "3203277-22-11" consists of the page-ID "3203277" with the 23rd discussion and the 12th comment inside the discussion. Please note that the counting starts at 0. The page-ID is from the MediaWiki's internal article ID and can be called by the curid attribute (e.g. http://en.wikipedia.org/?curid=3203277). The article on Wikipedia and the corresponding talk page have two different IDs. Sometimes the value is "\N" when the comment or the structure of the discussion was ill-formed and there was no previous comment, user or timestamp statement. This can happen quite often because no editor checks the meta data and everything has to be managed by the users themselves. The reference comment-ID is the last comment on the higher hierarchy level which the current one refers to with its statement. The hierarchy level of a comment in a discussion is identified by the number of the ":" in wiki format at the beginning of a comment and shows how deep the discussion already involves.
创建时间:
2022-08-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作