Hanif15 Originality: Text Alignment
收藏NIAID Data Ecosystem2026-03-11 收录
下载链接:
https://zenodo.org/record/3712718
下载链接
链接失效反馈官方服务:
资源简介:
We provide you with a training corpus that consists of pairs of documents, one of which may contain passages of text reused from the other. The reused text is subject to various kinds of (automatic) obfuscation to hide the fact it has been reused. Enclosed in the evaluation corpora, a file named pairs is found, which lists all pairs of suspicious documents and source documents.
创建时间:
2020-04-21



