Fairlex: A multilingual benchmark for evaluating fairness in legal text processing

NIAID Data Ecosystem2026-03-13 收录

下载链接：

https://zenodo.org/record/6322642

下载链接

链接失效反馈

官方服务：

资源简介：

We present a benchmark suite of four datasets for evaluating the fairness of pre-trained legal language models and the techniques used to fine-tune them for downstream tasks. Our benchmarks cover four jurisdictions (European Council, USA, Swiss, and Chinese), five languages (English, German, French, Italian, and Chinese), and fairness across five attributes (gender, age, nationality/region, language, and legal area). In our experiments, we evaluate pre-trained language models using several group-robust fine-tuning techniques and show that performance group disparities are vibrant in many cases, while none of these techniques guarantee fairness, nor consistently mitigate group disparities. Furthermore, we provide a quantitative and qualitative analysis of our results, highlighting open challenges in the development of robustness methods in legal NLP.

创建时间：

2022-03-02

5,000+

优质数据集

54 个

任务类型

进入经典数据集