five

Urdu Paraphrase Plagiarism Corpus (UPPC)

收藏
DataCite Commons2025-09-23 更新2025-04-17 收录
下载链接:
https://research.lancaster-university.uk/en/datasets/aa3587f7-8046-49fc-9165-f7481545f016
下载链接
链接失效反馈
官方服务:
资源简介:
This corpus contains 160 Urdu text documents in total. 20 documents are original Wikipedia articles on well-known people whereas 140 documents (manually created by volunteers) are paraphrase plagiarise and non-plagiarise versions of the original articles. 75 documents are paraphrased by 5 university students using different paraphrasing techniques. 65 documents are independently written without considering the source article.
提供机构:
Lancaster University
创建时间:
2016-02-29
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作