Replication Data for: Political Censorship in Large Language Models Originating from China
收藏NIAID Data Ecosystem2026-05-10 收录
下载链接:
https://doi.org/10.7910/DVN/VQMOJU
下载链接
链接失效反馈官方服务:
资源简介:
A growing body of research on large language models (LLMs) has identified various biases, primarily in contexts where biases reflect societal patterns. This paper focuses on a different source of bias in LLMs---government censorship. By comparing foundation models developed in China and those from outside China, we find substantially higher rates of refusal to respond, shorter responses, and inaccurate responses to a battery of 145 political questions in China-originating models. These disparities diminish for less-sensitive prompts, showing that technological and market disparities cannot fully explain this divergence. While all models exhibit higher refusal to respond rates with Chinese-language prompts than English ones, language differences are less pronounced than disparities between China-originating and non-China-originating models. We caution that our study is observational and cross-sectional and does not establish a causal linkage between regulatory pressures and censorship behaviors of China-originating LLMs, but these results suggest that censorship through government regulation requiring companies to restrict political content may be an important factor contributing to political bias in LLMs.
创建时间:
2026-01-07



