five

PMC-Patients Dataset

收藏
NIAID Data Ecosystem2026-05-01 收录
下载链接:
https://figshare.com/articles/dataset/PMC-Patients_Dataset/24504115
下载链接
链接失效反馈
官方服务:
资源简介:
## PMC-Patients Dataset The core file of our dataset, containing the patient summaries, demographics, and relational annotations. ### PMC-Patients.json Patient summaries are presented as a `json` file, which is a list of dictionaries with the following keys: - `patient_id`: string. A continuous id of patients, starting from 0. - `patient_uid`: string. Unique ID for each patient, with format PMID-x, where PMID is the PubMed Identifier of source article of the note and x denotes index of the note in source article. - `PMID`: string. PMID for source article. - `file_path`: string. File path of xml file of source article. - `title`: string. Source article title. - `patient`: string. Patient note. - `age`: list of tuples. Each entry is in format `(value, unit)` where value is a float number and unit is in 'year', 'month', 'week', 'day' and 'hour' indicating age unit. For example, `[[1.0, 'year'], [2.0, 'month']]` indicating the patient is a one-year- and two-month-old infant. - `gender`: 'M' or 'F'. Male or Female. - `relevant_articles`: dict. The key is PMID of the relevant articles and the corresponding value is its relevance score (2 or 1 as defined in the ``Methods'' section). - `similar_patients`: dict. The key is patient_uid of the similar patients and the corresponding value is its similarity score (2 or 1 as defined in the ``Methods'' section).
创建时间:
2023-11-06
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作