Replication Data for: Political Censorship in Large Language Models Originating from China

NIAID Data Ecosystem2026-05-10 收录

下载链接：

https://doi.org/10.7910/DVN/VQMOJU

下载链接

链接失效反馈

官方服务：

资源简介：

A growing body of research on large language models (LLMs) has identified various biases, primarily in contexts where biases reflect societal patterns. This paper focuses on a different source of bias in LLMs---government censorship. By comparing foundation models developed in China and those from outside China, we find substantially higher rates of refusal to respond, shorter responses, and inaccurate responses to a battery of 145 political questions in China-originating models. These disparities diminish for less-sensitive prompts, showing that technological and market disparities cannot fully explain this divergence. While all models exhibit higher refusal to respond rates with Chinese-language prompts than English ones, language differences are less pronounced than disparities between China-originating and non-China-originating models. We caution that our study is observational and cross-sectional and does not establish a causal linkage between regulatory pressures and censorship behaviors of China-originating LLMs, but these results suggest that censorship through government regulation requiring companies to restrict political content may be an important factor contributing to political bias in LLMs.

创建时间：

2026-01-07

5,000+

优质数据集

54 个

任务类型

进入经典数据集