five

iCite Database Snapshot 2025-01

收藏
DataCite Commons2025-02-07 更新2025-04-16 收录
下载链接:
https://nih.figshare.com/articles/dataset/iCite_Database_Snapshot_2025-01/28360103
下载链接
链接失效反馈
官方服务:
资源简介:
This is a database snapshot of the iCite web service (provided here as a single zipped CSV file, or compressed, tarred JSON files). In addition, citation links in the NIH Open Citation Collection are provided as a two-column CSV table in open_citation_collection.zip. iCite provides bibliometrics and metadata on publications indexed in PubMed, organized into three modules:<br>Influence: Delivers metrics of scientific influence, field-adjusted and benchmarked to NIH publications as the baseline.<br>Translation: Measures how Human, Animal, or Molecular/Cellular Biology-oriented each paper is; tracks and predicts citation by clinical articles<br>Open Cites: Disseminates link-level, public-domain citation data from the NIH Open Citation Collection<br>Definitions for individual data fields:<br>pmid: PubMed Identifier, an article ID as assigned in PubMed by the National Library of Medicine<br>doi: Digital Object Identifier, if available<br>year: Year the article was published<br>title: Title of the article<br>authors: List of author names<br>journal: Journal name (ISO abbreviation)<br>is_research_article: Flag indicating whether the Publication Type tags for this article are consistent with that of a primary research article<br>relative_citation_ratio: Relative Citation Ratio (RCR)--OPA's metric of scientific influence. Field-adjusted, time-adjusted and benchmarked against NIH-funded papers. The median RCR for NIH funded papers in any field is 1.0. An RCR of 2.0 means a paper is receiving twice as many citations per year than the median NIH funded paper in its field and year, while an RCR of 0.5 means that it is receiving half as many citations per year. Calculation details are documented in Hutchins et al., PLoS Biol. 2016;14(9):e1002541.<br>provisional: RCRs for papers published in the previous two years are flagged as "provisional", to reflect that citation metrics for newer articles are not necessarily as stable as they are for older articles. Provisional RCRs are provided for papers published previous year, if they have received with 5 citations or more, despite being, in many cases, less than a year old. All papers published the year before the previous year receive provisional RCRs. The current year is considered to be the NIH Fiscal Year which starts in October. For example, in July 2019 (NIH Fiscal Year 2019), papers from 2018 receive provisional RCRs if they have 5 citations or more, and all papers from 2017 receive provisional RCRs. In October 2019, at the start of NIH Fiscal Year 2020, papers from 2019 receive provisional RCRs if they have 5 citations or more and all papers from 2018 receive provisional RCRs.<br>citation_count: Number of unique articles that have cited this one<br>citations_per_year: Citations per year that this article has received since its publication. If this appeared as a preprint and a published article, the year from the published version is used as the primary publication date. This is the numerator for the Relative Citation Ratio.<br>field_citation_rate: Measure of the intrinsic citation rate of this paper's field, estimated using its co-citation network.<br>expected_citations_per_year: Citations per year that NIH-funded articles, with the same Field Citation Rate and published in the same year as this paper, receive. This is the denominator for the Relative Citation Ratio.<br>nih_percentile: Percentile rank of this paper's RCR compared to all NIH publications. For example, 95% indicates that this paper's RCR is higher than 95% of all NIH funded publications.<br>human: Fraction of MeSH terms that are in the Human category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)<br>animal: Fraction of MeSH terms that are in the Animal category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)<br>molecular_cellular: Fraction of MeSH terms that are in the Molecular/Cellular Biology category (out of this article's MeSH terms that fall into the Human, Animal, or Molecular/Cellular Biology categories)<br>x_coord: X coordinate of the article on the Triangle of Biomedicine<br>y_coord: Y Coordinate of the article on the Triangle of Biomedicine<br>is_clinical: Flag indicating that this paper meets the definition of a clinical article.<br>cited_by_clin: PMIDs of clinical articles that this article has been cited by.<br>apt: Approximate Potential to Translate is a machine learning-based estimate of the likelihood that this publication will be cited in later clinical trials or guidelines. Calculation details are documented in Hutchins et al., PLoS Biol. 2019;17(10):e3000416.<br>cited_by: PMIDs of articles that have cited this one.<br>references: PMIDs of articles in this article's reference list.<br>Large CSV files are zipped using zip version 4.5, which is more recent than the default unzip command line utility in some common Linux distributions. These files can be unzipped with tools that support version 4.5 or later such as 7zip.<br>Comments and questions can be addressed to iCite@mail.nih.gov

本数据集为iCite网络服务的数据库快照(以单个压缩CSV文件或压缩的tar包JSON文件形式提供)。此外,美国国立卫生研究院(NIH)开放引文集合中的引文链接以两列CSV表格形式包含在open_citation_collection.zip压缩包中。iCite提供PubMed索引文献的文献计量学数据与元数据,其功能分为三个模块:<br>影响力模块:提供科学影响力指标,该指标经领域调整,并以NIH资助文献为基准进行对标。<br>转化模块:评估每篇论文在人类、动物或分子/细胞生物学方向的侧重程度;追踪并预测临床文章对其的引用情况<br>开放引文模块:传播来自NIH开放引文集合的、链接级别的公共领域引文数据<br>各数据字段定义如下:<br>pmid:PubMed标识符(PubMed Identifier),即美国国家医学图书馆在PubMed中为文献分配的唯一标识<br>doi:数字对象标识符(Digital Object Identifier)(如有)<br>year:文献发表年份<br>title:文献标题<br>authors:作者名单<br>journal:期刊名称(ISO缩写)<br>is_research_article:标志位,指示文献的出版类型标签是否符合原始研究论文的定义<br>relative_citation_ratio:相对引文比率(Relative Citation Ratio,RCR)——OPA的科学影响力指标,经领域调整、时间调整,并以NIH资助文献为对标基准。任一领域中NIH资助文献的RCR中位数为1.0;RCR为2.0意味着该文献每年的被引次数是同领域同年份NIH资助文献中位数的两倍,而RCR为0.5则表示其每年被引次数为中位数的一半。计算细节详见Hutchins等人的研究(PLoS Biol. 2016;14(9):e1002541)。<br>provisional:近两年发表文献的RCR会被标记为“provisional”(临时),以反映新文献的被引指标未必如旧文献稳定。上一年发表的文献若被引次数≥5次(即便多数情况下发表时间不足一年),仍会提供临时RCR;前两年发表的所有文献均会获得临时RCR。当前年份以NIH财年(NIH Fiscal Year)计,该财年始于10月。例如,2019年7月(NIH财年2019)时,2018年发表的文献若被引≥5次则获临时RCR,2017年发表的所有文献均获临时RCR;2019年10月(NIH财年2020启动时),2019年发表的文献若被引≥5次则获临时RCR,2018年发表的所有文献均获临时RCR。<br>citation_count:引用该文献的唯一文献数量<br>citations_per_year:文献自发表以来的年均被引次数。若文献同时以预印本和正式出版物形式存在,则以正式出版物的发表年份为主要发表日期。该指标是相对引文比率的分子。<br>field_citation_rate:文献所属领域的内在被引率,通过共引网络估算<br>expected_citations_per_year:与该文献具有相同领域被引率且同年发表的NIH资助文献的年均被引次数,是相对引文比率的分母<br>nih_percentile:该文献RCR在所有NIH资助文献中的百分位排名。例如,95%表示其RCR高于95%的NIH资助文献<br>human:MeSH术语(MeSH terms)中属于人类类别的占比(基于文献MeSH术语中归属于人类、动物或分子/细胞生物学类别的部分)<br>animal:MeSH术语中属于动物类别的占比(基于文献MeSH术语中归属于人类、动物或分子/细胞生物学类别的部分)<br>molecular_cellular:MeSH术语中属于分子/细胞生物学类别的占比(基于文献MeSH术语中归属于人类、动物或分子/细胞生物学类别的部分)<br>x_coord:文献在生物医学三角图上的X坐标<br>y_coord:文献在生物医学三角图上的Y坐标<br>is_clinical:标志位,指示文献是否符合临床文章的定义<br>cited_by_clin:引用该文献的临床文章的PMID列表<br>apt:近似转化潜力(Approximate Potential to Translate),是基于机器学习的估计值,用于预测该文献被后续临床试验或指南引用的可能性。计算细节详见Hutchins等人的研究(PLoS Biol. 2019;17(10):e3000416)<br>cited_by:引用该文献的所有文献的PMID列表<br>references:该文献参考文献列表中的PMID<br>大型CSV文件采用zip 4.5版本压缩,该版本比部分常见Linux发行版中的默认unzip命令行工具版本更新。这些文件可通过支持4.5及更高版本的工具解压,如7zip。<br>意见与问题可发送至邮箱iCite@mail.nih.gov。
提供机构:
The NIH Figshare Archive
创建时间:
2025-02-07
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作