Source Data for Manuscript: Identifying genomic data use with the Data Citation Explorer
收藏NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/12802876
下载链接
链接失效反馈官方服务:
资源简介:
This page contains the source data for the manuscript describing the Data Citation Explorer, currently in review for publication. The preprint version can be found on this page.
Files:
DCE_manual_eval_sample.xlsx:
This file was used to manually evaluate hits generated by the Data Citation Explorer. There are two separate sheets: one with publications returned by searches in PubMed and PubMed Central and another with publications returned by searches in Dimensions. Column descriptions can be found in the file itself. Each row in each evaluation sheet refers to a pair between a JAMO record and a linked publication.
DCE_citation_report.csv
Contains JAMO record IDs and PubMed IDs from the initial 2020 DCE trial run. There are 238,994 unique JAMO IDs and 30,641 unique PubMed IDs. 78,104 JAMO records are linked with publications.
Columns:
jamo_id - unique JAMO record ID
sample_group - Sample strata from which manually evaluated records were pulled
citation_count - Number of citations associated with each record
citations - comma-delimited PubMed IDs for linked publications
sampled - True/False, denoting which records were included in the initial evaluation sample
notes - descriptions for why certain sampled records were excluded from manual evaluation
unprocessed - True/False. These 7,890 records contained anomalous fields that caused them to be rejected for processing. They are represented as zero-length files in the archive.
DCE_source_files.zip:
This folder contains 3 files for each JAMO record in DCE_citation_report.tsv. For each JAMO record listed in the citation report, three files are provided:
JAMO_ID_source.yaml - The fields extracted from the JAMO record that were relevant to the citation search, including any previously known PMIDs (manually curated).
JAMO_ID_expand.yaml - The source record augmented with additional metadata discovered in other resources, including the citations that were discovered based on querying PubMed Central for the values in those metadata fields.
JAMO_ID_audit.json - The audit path as a directed acyclic graph, in JSON.
创建时间:
2024-09-23



