iCite Database Snapshot 2022-09
收藏nih.figshare.com2023-05-31 更新2025-03-25 收录
下载链接:
https://nih.figshare.com/articles/dataset/iCite_Database_Snapshot_2022-09/21287247/1
下载链接
链接失效反馈官方服务:
资源简介:
This is a database snapshot of the iCite web service (provided here as a single zipped CSV file, or compressed, tarred JSON files). In addition, citation links in the NIH Open Citation Collection are provided as a two-column CSV table in open_citation_collection.zip. iCite provides bibliometrics and metadata on publications indexed in PubMed, organized into three modules:
Influence: Delivers metrics of scientific influence, field-adjusted and benchmarked to NIH publications as the baseline.
Translation: Measures how Human, Animal, or Molecular/Cellular Biology-oriented each paper is; tracks and predicts citation by clinical articles
Open Cites: Disseminates link-level, public-domain citation data from the NIH Open Citation Collection
Definitions for individual data fields:
pmid: PubMed Identifier, an article ID as assigned in PubMed by the National Library of Medicine
doi: Digital Object Identifier, if available
year: Year the article was published
title: Title of the article
authors: List of author names
journal: Journal name (ISO abbreviation)
is_research_article: Flag indicating whether the Publication Type tags for this article are consistent with that of a primary research article
relative_citation_ratio: Relative Citation Ratio (RCR)--OPA's metric of scientific influence. Field-adjusted, time-adjusted and benchmarked against NIH-funded papers. The median RCR for NIH funded papers in any field is 1.0. An RCR of 2.0 means a paper is receiving twice as many citations per year than the median NIH funded paper in its field and year, while an RCR of 0.5 means that it is receiving half as many citations per year. Calculation details are documented in Hutchins et al., PLoS Biol. 2016;14(9):e1002541.
provisional: RCRs for papers published in the previous two years are flagged as "provisional", to reflect that citation metrics for newer articles are not necessarily as stable as they are for older articles. Provisional RCRs are provided for papers published previous year, if they have received with 5 citations or more, despite being, in many cases, less than a year old. All papers published the year before the previous year receive provisional RCRs. The current year is considered to be the NIH Fiscal Year which starts in October. For example, in July 2019 (NIH Fiscal Year 2019), papers from 2018 receive provisional RCRs if they have 5 citations or more, and all papers from 2017 receive provisional RCRs. In October 2019, at the start of NIH Fiscal Year 2020, papers from 2019 receive provisional RCRs if they have 5 citations or more and all papers from 2018 receive provisional RCRs.
citation_count: Number of unique articles that have cited this one
citations_per_year: Citations per year that this article has received since its publication. If this appeared as a preprint and a published article, the year from the published version is used as the primary publication date. This is the numerator for the Relative Citation Ratio.
field_citation_rate: Measure of the intrinsic citation rate of this paper's field, estimated using its co-citation network.
expected_citations_per_year: Citations per year that NIH-funded articles, with the same Field Citation Rate and published in the same year as this paper, receive. This is the denominator for the Relative Citation Ratio.
nih_percentile: Percentile rank of this paper's RCR compared to all NIH publications. For example, 95% indicates that this paper's RCR is higher than 95% of all NIH funded publications.
human: Fraction of MeSH terms that are in the Human category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)
animal: Fraction of MeSH terms that are in the Animal category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)
molecular_cellular: Fraction of MeSH terms that are in the Molecular/Cellular Biology category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)
x_coord: X coordinate of the article on the Triangle of Biomedicine
y_coord: Y Coordinate of the article on the Triangle of Biomedicine
is_clinical: Flag indicating that this paper meets the definition of a clinical article.
cited_by_clin: PMIDs of clinical articles that this article has been cited by.
apt: Approximate Potential to Translate is a machine learning-based estimate of the likelihood that this publication will be cited in later clinical trials or guidelines. Calculation details are documented in Hutchins et al., PLoS Biol. 2019;17(10):e3000416.
cited_by: PMIDs of articles that have cited this one.
references: PMIDs of articles in this article's reference list.
Large CSV files are zipped using zip version 4.5, which is more recent than the default unzip command line utility in some common Linux distributions. These files can be unzipped with tools that support version 4.5 or later such as 7zip.
Comments and questions can be addressed to iCite@mail.nih.gov
{'database_snapshot': '本数据库为iCite网络服务的数据快照(提供单份压缩的CSV文件或压缩打包的JSON文件)。此外,NIH开放引文集合中的引文链接以两列CSV表格的形式提供于open_citation_collection.zip文件中。', 'iCite': 'iCite平台提供PubMed索引中出版物的计量学和元数据,分为三个模块:', 'modules': {'influence': '影响力模块:提供科学影响力的指标,经领域调整并以NIH出版物为基准进行基准测试。', 'translation': '翻译模块:衡量每篇论文在人类、动物或分子/细胞生物学领域的倾向性;追踪并预测由临床文章产生的引用。', 'open_cites': '开放引文模块:传播来自NIH开放引文集合的链接级、公共领域引文数据。'}, 'data_fields_definitions': {'pmid': 'pmid:PubMed标识符,由国家医学图书馆在PubMed中分配的文章ID。', 'doi': 'doi:数字对象标识符,如有。', 'year': 'year:文章发表的年份。', 'title': 'title:文章标题。', 'authors': 'authors:作者名单。', 'journal': 'journal:期刊名称(ISO缩写)。', 'is_research_article': 'is_research_article:标志,指示该文章的出版类型标签是否与原创研究文章一致。', 'relative_citation_ratio': 'relative_citation_ratio:相对引用率(RCR)——OPA的科学影响力指标。经领域调整、时间调整并与NIH资助的论文进行基准测试。任何领域中NIH资助论文的RCR中位数均为1.0。RCR为2.0表示该论文每年收到的引用数量是同一领域和年份中位NIH资助论文的两倍,而RCR为0.5则表示每年收到的引用数量是其一半。计算细节已在Hutchins等人,PLoS Biol. 2016;14(9):e1002541一文中进行说明。', 'provisional': 'provisional:过去两年内发表的论文的RCR被标记为“临时”,以反映较新文章的引用指标可能不如较老文章稳定。对于上一年的论文,如果它们收到了5次或以上的引用,则会提供临时RCR,尽管在许多情况下,这些论文的发表时间不到一年。前一年的所有论文都会收到临时RCR。当前年份被视为NIH财政年度,该年度从10月开始。例如,在2019年7月(NIH财政年度2019年),如果2018年的论文收到了5次或以上的引用,则会收到临时RCR,而2017年的所有论文都会收到临时RCR。在2019年10月,即NIH财政年度2020年开始时,如果2019年的论文收到了5次或以上的引用,则会收到临时RCR,而2018年的所有论文都会收到临时RCR。', 'citation_count': 'citation_count:引用该文章的独特文章数量。', 'citations_per_year': 'citations_per_year:自出版以来每年收到的引用数量。如果该文章以预印本和已发表文章的形式出现,则使用已发表版本年份作为主要出版日期。这是相对引用率的分子。', 'field_citation_rate': 'field_citation_rate:衡量该论文领域内在引用率的指标,使用其共引网络进行估计。', 'expected_citations_per_year': 'expected_citations_per_year:NIH资助的论文,具有相同的领域引用率和与该论文同年发表的论文,每年收到的引用数量。这是相对引用率的分母。', 'nih_percentile': 'nih_percentile:该论文RCR相对于所有NIH出版物的百分位数排名。例如,95%表示该论文的RCR高于95%的所有NIH资助出版物。', 'human': 'human:属于人类类别的MeSH术语的比例(占该文章MeSH术语中属于人类、动物或分子/细胞生物学类别的术语的比例)。', 'animal': 'animal:属于动物类别的MeSH术语的比例(占该文章MeSH术语中属于人类、动物或分子/细胞生物学类别的术语的比例)。', 'molecular_cellular': 'molecular_cellular:属于分子/细胞生物学类别的MeSH术语的比例(占该文章MeSH术语中属于人类、动物或分子/细胞生物学类别的术语的比例)。', 'x_coord': 'x_coord:文章在生物医药三角上的X坐标。', 'y_coord': 'y_coord:文章在生物医药三角上的Y坐标。', 'is_clinical': 'is_clinical:标志,指示该论文是否符合临床文章的定义。', 'cited_by_clin': 'cited_by_clin:被该论文引用的临床文章的PMID。', 'apt': 'apt:近似翻译潜力,是基于机器学习的估计,表明该出版物将在以后的临床试验或指南中被引用的可能性。计算细节已在Hutchins等人,PLoS Biol. 2019;17(10):e3000416一文中进行说明。', 'cited_by': 'cited_by:引用该论文的论文的PMID。', 'references': 'references:该文章参考文献列表中论文的PMID。'}, 'file_compression': '大型CSV文件使用zip版本4.5进行压缩,该版本比某些常见Linux发行版中默认的unzip命令行实用工具版本更新。这些文件可以使用支持版本4.5或更高版本的工具解压,例如7zip。', 'contact': '有关评论和问题,请联系iCite@mail.nih.gov。'}
提供机构:
nih.figshare.com



