Neurodiversity Aware or Hyperaware AI? Visual stereotypes of autism spectrum in Janus-Pro-7B, DALL-E, Stable Diffusion, SDXL, FLUX, and Midjourney. Research Protocol

Name: Neurodiversity Aware or Hyperaware AI? Visual stereotypes of autism spectrum in Janus-Pro-7B, DALL-E, Stable Diffusion, SDXL, FLUX, and Midjourney. Research Protocol
Creator: figshare
Published: 2025-08-29 14:16:35
License: 暂无描述

DataCite Commons2025-08-29 更新2025-09-08 收录

下载链接：

https://figshare.com/articles/dataset/Neurodiversity_Aware_or_Hyperaware_AI_Visual_stereotypes_of_autism_spectrum_in_Janus-Pro-7B_DALL-E_Stable_Diffusion_SDXL_FLUX_and_Midjourney_Research_Protocol/30010723

下载链接

链接失效反馈

官方服务：

资源简介：

Avoiding systemic discrimination of neurodiverse individuals is an ongoing challenge in training language models, which often propagate negative stereotypes. This study examined whether six text-to-image models (Janus-Pro-7B VL2 vs. VL3, DALL-E 3 v. April 2024 vs. August 2025, Stable Diffusion v. 1.6 vs. 3.5, SDXL v. April 2024 vs. FLUX.1 Pro, and Midjourney v. 5.1 vs. 7) perpetuate non-rational beliefs regarding autism by comparing images generated in 2024-2025 with controls. 53 prompts aimed at neutrally visualizing concrete objects and abstract concepts related to autism were used against 53 controls (baseline total N=302, follow-up experimental 280 images plus 265 controls). Expert assessment measuring the presence of common autism-related stereotypes employed a framework of 10 deductive codes followed by statistical analysis. Autistic individuals were depicted with striking homogeneity in skin color (white), gender (male), and age (young), often engaged in solitary activities, interacting with objects rather than people, and exhibiting stereotypical emotional expressions such as sadness, anger, or emotional flatness. In contrast, the images of neurotypical individuals were more diverse and lacked such traits. We found significant differences between the models; however, with a moderate effect size (baseline $\eta^2 = 0.05$ and follow-up η = 0.08$), and no differences between baseline and follow-up summary values, with the ratio of stereotypical themes to the number of images similar across all models. The control prompts showed a significantly lower degree of stereotyping with large size effects (DALL·E 3 η = 0.39; Midjourney η = 0.41; FLUX η = 0.20; Stable Diffusion η = 0.34; DeepSeek-VL3 η = 0.45) confirming the hidden biases of the models. In summary, despite improvements in the technical aspects of image generation, the level of reproduction of potentially harmful autism-related stereotypes remained largely unaffected.

提供机构：

figshare

创建时间：

2025-08-29