five

PMC clinical trial disentangled tables data set

收藏
NIAID Data Ecosystem2026-03-10 收录
下载链接:
https://data.mendeley.com/datasets/wk53twxddf
下载链接
链接失效反馈
官方服务:
资源简介:
The database is created by processing 6558 clinical trial articles from PubMed Central public sample 2014. The articles are obtained by matching PMC and Medline documents. The documents that were selected contained in publication type word "Clinical" in Medline. The documents were processed using TableDisentangler tool, that is able to create the majority of the database. Then documents were annotated using UMLS/MetaMap and script that is a part of TableDisentangler tool for communication with MetaMap. Three case studies were performed for information extraction from these data: - Extraction of patients' age - Extraction of gender distribution - Extraction of FEV1 measures (this has been performed for COPD studies only) Information extraction case studies were performed using TabInOut tool for generating table information extraction rules. Database schema can be seen on the following link: https://github.com/nikolamilosevic86/TableDisentangler/wiki/Database-schema Files included in the dataset: - Clinicaldata.zip - This file contains raw xml clinical documents from PMC - Database.zip - Contains database with processed data using TableDisentangler and TabInOut
创建时间:
2017-05-19
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作