Bangla-MedER: Bangla Medical Entity Recognition Dataset

Mendeley Data2026-04-09 收录

下载链接：

https://data.mendeley.com/datasets/jt4gywvwtj/2

下载链接

链接失效反馈

官方服务：

资源简介：

The Bangla-MedER dataset is a carefully compiled collection of 2980 annotated Bangla texts, centered on the field of medical entity recognition. This collection has six unique entity types: Medicine/Chemical Name, Organ, Disease, Hormone, Pharmacological Class, and Common Medical Terms. For the Bangla language, which is mostly spoken in Bangladesh and certain parts of India, this dataset intends to support research in natural language processing (NLP) and medical text mining. For training and assessing medical entity identification algorithms, the dataset—which was assembled from a variety of online medical resources, including blogs and websites—is an invaluable tool. This dataset can be used for a range of natural language processing (NLP) applications, including entity extraction, text classification, and information retrieval, which will enhance medical informatics and healthcare data processing. Here we have also provided the English translated dataset of our prepared Bengali dataset in a separate .csv file.

5,000+

优质数据集

54 个

任务类型

进入经典数据集