MARB
收藏DataCite Commons2025-12-12 更新2026-05-05 收录
下载链接:
https://spraakbanken.gu.se/resurser/marb
下载链接
链接失效反馈官方服务:
资源简介:
Reporting bias (the human tendency to not mention obvious or redundant information) and social bias (societal attitudes toward specific demographic groups) have both been shown to propagate from human text data to language models trained on such data. However, the two phenomena have not previously been studied in combination. The MARB dataset was developed to begin to fill this gap by studying the interaction between social biases and reporting bias in language models. Unlike many existing benchmark datasets, MARB does not rely on artificially constructed templates or crowdworkers to create contrasting examples. Instead, the templates used in MARB are based on naturally occurring written language from the 2021 version of the enTenTen corpus (Jakubíček et al., 2013).
提供机构:
Språkbanken Text
创建时间:
2025-08-20



