Wikipedia Talk Corpus
收藏figshare.com2017-01-23 更新2025-03-27 收录
下载链接:
https://figshare.com/articles/dataset/Wikipedia_Talk_Corpus/4264973/3
下载链接
链接失效反馈官方服务:
资源简介:
We provide a corpus of discussion comments from English Wikipedia talk pages. Comments are grouped into different files by year. Comments are generated by computing diffs over the full revision history and extracting the content added for each revision. See our wiki for documentation of the schema and our research paper for documentation on the data collection and processing methodology.
本数据集收录了来自英语维基百科讨论页面的评论语料库。评论按年度分组存储于不同文件中。通过计算全文修订历史中的差异,并提取每个修订版本中添加的内容,生成这些评论。详情请参阅我们的wiki文档,以及关于数据收集和处理方法的科研论文。
提供机构:
figshare



