five

Borrowed Voices, Shared Debt: Plagiarism, Idea Recombination, and the Knowledge Commons in Large Language Models

收藏
Figshare2025-09-16 更新2026-04-28 收录
下载链接:
https://figshare.com/articles/dataset/Borrowed_Voices_Shared_Debt_Plagiarism_Idea_Recombination_and_the_Knowledge_Commons_in_Large_Language_Models/30137422
下载链接
链接失效反馈
官方服务:
资源简介:
Large language models generate fluent text by recombining the language and ideas of prior authors at scale. This process produces plagiarism-like harms in three dimensions: direct wording leakage, imitation of distinctive styles, and appropriation of argument structures or conceptual syntheses without provenance. At the same time, their capacity to provide insight or novel-seeming combinations depends entirely on the accumulated labor of millions of human writers, editors, teachers, and curators who built the knowledge commons. This paper argues that denunciation and recognition must proceed together: the harms of extraction must be exposed, yet the debt to the commons must also be acknowledged. The article proposes a framework that defines the scope of plagiarism in this context, diagnoses the mechanisms of recombination, and sets out operational remedies, including dataset governance, attribution layers, compensation pools, and measurable audit thresholds. The goal is to establish a system that restricts illegitimate appropriation while reinvesting in the infrastructures of shared knowledge that make such synthesis possible.DOIPrimary archive: https://doi.org/10.5281/zenodo.17132004Secondary archive: https://doi.org/10.6084/m9.figshare.30137422SSRN: Pending assignment (ETA: Q3 2025)
创建时间:
2025-09-16
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作