Webis-CPC-11
收藏webis.de2025-01-09 收录
下载链接:
https://webis.de/data/Webis-CPC-11
下载链接
链接失效反馈官方服务:
资源简介:
<p>The Webis Crowd Paraphrase Corpus 2011 (Webis-CPC-11) contains 7,859 candidate paraphrases obtained from Mechanical Turk crowdsourcing. The corpus is made up of 4,067 accepted paraphrases, 3,792 rejected non-paraphrases, and the original texts. These samples have formed part of <a href="https://webis.de/data/pan-pc-10">PAN 2010</a> international plagiarism detection competition, but were not previously available separate to rest of the competition data.</p>
《Webis众包释义语料库2011》(Webis-CPC-11)收录了7,859条候选释义,这些释义通过Mechanical Turk众包平台获得。该语料库包括4,067条被接受的释义、3,792条被拒绝的非释义文本以及原始文本。这些样本曾构成《PAN 2010国际抄袭检测竞赛》(PAN 2010 international plagiarism detection competition)的一部分,但在此之前,它们并未作为独立数据集与其他竞赛数据分开提供。
提供机构:
Webis Group



