five

PubMed classification 202401

收藏
figshare.com2024-02-13 更新2025-01-15 收录
下载链接:
https://figshare.com/articles/dataset/PubMed_classification_v1_202102/16601402/7
下载链接
链接失效反馈
官方服务:
资源简介:
The classification contains about 21 million PubMed publications from 1995 onward. It has been created using clustering in a citation network.The January 2024 update is a complete new version of the classification based on new clustering and labeling.File descriptionPMID_cluster_relation_[date].csv contains the relation between PMIDs and clusters. Four levels are included:Level 1 - Topics - Most granularLevel 2 - SpecialtiesLevel 3 - DisciplinesLevel 4 - Discipline group - Most coarseLabelsFor each level there is a table with labels (e.g. labels_lev1_[date].csv), related by an id (e.g lev1_cluster_id).StatsFor each level there is a table with statistics (e.g. lev1_stats). The table includes the columns below. For more information about the "Clinical", "Human", "Animal" and "Molecular/Cellular Biology" categories, see https://nih.figshare.com/collections/iCite_Database_Snapshots_NIH_Open_Citation_Collection_/4586573p - The number of publications in the cluster in the initial clustering.pct_clinical - The proportion of clinical articlessum_clinical - The number of clinical articlespct_human - The average of the fraction of MeSH terms that are in the "Human" categorysum_human - The sum of fraction of MeSH terms that are in the "Human" categorypct_animal - The average of the fraction of MeSH terms that are in the "Animal" categorysum_animal - The sum of fraction of MeSH terms that are in the "Animal" categorypct_molecular_cellular - The average of the fraction of MeSH terms that are in the "Molecular/Cellular Biology" categorysum_molecular_cellular - The sum of fraction of MeSH terms that are in the "Molecular/Cellular Biology" categoryVisualizations:Base map of PubMed 2010-2023Map of PubMed 2023 - Including hyperlinks to publicationsMap colored by % ClinicalMap colored by % HumanMap colored by % AnimalMap colored by % Molecular/CellularSee the figshare collection for further description.

本分类涵盖了自1995年起约2100万篇PubMed出版物。该分类是通过在引证网络中进行聚类而构建的。2024年1月的更新版本是基于新的聚类和标签的全新分类。文件描述PMID_cluster_relation_[date].csv包含了PMID与聚类之间的关系。分类包含四个层级:第一层级——主题,最为细致;第二层级——专业领域;第三层级——学科;第四层级——学科组,最为粗略。标签方面,每个层级均附有标签表(例如:labels_lev1_[date].csv),通过ID(例如lev1_cluster_id)进行关联。统计数据方面,每个层级均包含统计数据表(例如:lev1_stats),其中包含以下列:对于“临床”、“人类”、“动物”以及“分子/细胞生物学”类别,更多信息请参见https://nih.figshare.com/collections/iCite_Database_Snapshots_NIH_Open_Citation_Collection_/4586573p。聚类中出版物的数量(初始聚类);临床文章的比例;临床文章的数量;人类类别的MeSH术语的平均分数;人类类别的MeSH术语分数总和;动物类别的MeSH术语的平均分数;动物类别的MeSH术语分数总和;分子/细胞生物学类别的MeSH术语的平均分数;分子/细胞生物学类别的MeSH术语分数总和。可视化方面:PubMed 2010-2023的基础地图;PubMed 2023地图——包含出版物超链接;按临床百分比着色的地图;按人类百分比着色的地图;按动物百分比着色的地图;按分子/细胞百分比着色的地图。更详细的信息,请参阅figshare收藏夹。
提供机构:
figshare
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作