five

Bangla-MedER: Bangla Medical Entity Recognition Dataset

收藏
Mendeley Data2026-04-09 收录
下载链接:
https://data.mendeley.com/datasets/jt4gywvwtj/2
下载链接
链接失效反馈
官方服务:
资源简介:
The Bangla-MedER dataset is a carefully compiled collection of 2980 annotated Bangla texts, centered on the field of medical entity recognition. This collection has six unique entity types: Medicine/Chemical Name, Organ, Disease, Hormone, Pharmacological Class, and Common Medical Terms. For the Bangla language, which is mostly spoken in Bangladesh and certain parts of India, this dataset intends to support research in natural language processing (NLP) and medical text mining. For training and assessing medical entity identification algorithms, the dataset—which was assembled from a variety of online medical resources, including blogs and websites—is an invaluable tool. This dataset can be used for a range of natural language processing (NLP) applications, including entity extraction, text classification, and information retrieval, which will enhance medical informatics and healthcare data processing. Here we have also provided the English translated dataset of our prepared Bengali dataset in a separate .csv file.
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作