Bengali Identity Bias Evaluation Dataset (BIBED)
收藏NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://zenodo.org/record/7775520
下载链接
链接失效反馈官方服务:
资源简介:
Critical studies found NLP systems to bias based on gender and racial identities. However, few studies focused on identities defined by cultural factors like religion and nationality. Compared to English, such research efforts are even further limited in major languages like Bengali due to the unavailability of labeled datasets. Our paper (see the reference) describes a process for developing a bias evaluation dataset highlighting cultural influences on identity. We also provide this Bengali dataset as an artifact outcome that can contribute to future critical research.
If you find this dataset useful, please cite the associated paper:
Das, D., Guha, S., & Semaan, B. (2023, May). Toward Cultural Bias Evaluation Datasets: The Case of Bengali Gender, Religious, and National Identity. In Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP) (pp. 68-83).
BibTeX:
@inproceedings{das-etal-2023-toward,
title = "Toward Cultural Bias Evaluation Datasets: The Case of {B}engali Gender, Religious, and National Identity",
author = "Das, Dipto and
Guha, Shion and
Semaan, Bryan",
booktitle = "Proceedings of the First Workshop on Cross-Cultural Considerations in NLP (C3NLP)",
month = may,
year = "2023",
address = "Dubrovnik, Croatia",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2023.c3nlp-1.8",
pages = "68--83",
}
创建时间:
2023-08-07



