A Multifaceted Approach to Gender Bias Detection in Bengali

NIAID Data Ecosystem2026-05-02 收录

下载链接：

https://data.mendeley.com/datasets/dj3745p2cy

下载链接

链接失效反馈

官方服务：

资源简介：

This project explores the critical issue of gender bias in the Bengali language by taking a multidimensional approach to detection. In a world where language reflects and shapes our social realities, it's essential to identify and address biases that can influence perceptions and reinforce stereotypes. By combining techniques from natural language processing (NLP), machine learning, and linguistic analysis, the project aims to uncover both overt and subtle forms of gender bias in Bengali texts—ranging from news articles and literature to social media content. It investigates how language use may differ based on gender representation and aims to build tools or models that can flag biased or discriminatory expressions. The ultimate goal is not only to detect bias but also to raise awareness and contribute to more inclusive and fair language practices in Bengali-speaking communities. Description of each column: ID: Serial number Text: Sentence or phrase in Bengali Label: "Biased" or "Unbiased" Gendered_Word: Word or phrase causing bias (if any) Bias_Type: Stereotype / Occupational Bias / Honorific Bias / Pronoun Bias / Neutral Source: News, Social Media, Literature, etc. Correction_Suggestion: Suggestion to neutralize the bias Structure of the Dataset: Format: CSV Rows: 2,451 (individual Detecting gender bias in Bengali language texts.) Columns: 7 (whether biased or not, the biased word or phrase, type of bias, and the source of the sentence)

创建时间：

2025-07-11

5,000+

优质数据集

54 个

任务类型

进入经典数据集