five

Adverse Drug Reaction (ADR) Text Dataset

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/13889330
下载链接
链接失效反馈
官方服务:
资源简介:
This repository contains text data and code related to the identification and clustering of Adverse Drug Reactions (ADR) using Sentence-BERT (S-BERT) embeddings and the SS-DBSCAN clustering algorithm. The dataset includes both labeled and unlabeled patient reports extracted from the publicly available MIMIC-III database. The labeled data has been manually annotated to distinguish between ADR and non-ADR cases. The unlabeled dataset is used for unsupervised clustering experiments, particularly to assess high-dimensional data clustering performance. New in This Version:- Added Jupyter Notebook: `mimic-5k_PCA_tSNE_clustering.ipynb`- Included detailed `README_ADR_Clustering_Task.txt` with step-by-step instructions to reproduce clustering results- Explained how to scale experiments from 1,000 to full dataset size
创建时间:
2025-04-11
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作