five

Data supporting the Master thesis "Monitoring von Open Data Praktiken - Herausforderungen beim Auffinden von Datenpublikationen am Beispiel der Publikationen von Forschenden der TU Dresden"

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/14196538
下载链接
链接失效反馈
官方服务:
资源简介:
Data supporting the Master thesis "Monitoring von Open Data Praktiken - Herausforderungen beim Auffinden von Datenpublikationen am Beispiel der Publikationen von Forschenden der TU Dresden" (Monitoring open data practices - challenges in finding data publications using the example of publications by researchers at TU Dresden) - Katharina Zinke, Institut für Bibliotheks- und Informationswissenschaften, Humboldt-Universität Berlin, 2023 This ZIP-File contains the data the thesis is based on, interim exports of the results and the R script with all pre-processing, data merging and analyses carried out. The documentation of the additional, explorative analysis is also available. The actual PDFs and text files of the scientific papers used are not included as they are published open access. The folder structure is shown below with the file names and a brief description of the contents of each file. For details concerning the analyses approach, please refer to the master's thesis (publication following soon). ## Data sources  Folder 01_SourceData/ - PLOS-Dataset_v2_Mar23.csv (PLOS-OSI dataset) - ScopusSearch_ExportResults.csv (export of Scopus search results from Scopus) - ScopusSearch_ExportResults.ris (export of Scopus search results from Scopus) - Zotero_Export_ScopusSearch.csv (export of the file names and DOIs of the Scopus search results from Zotero) ## Automatic classification  Folder 02_AutomaticClassification/ - (NOT INCLUDED) PDFs folder (Folder for PDFs of all publications identified by the Scopus search, named AuthorLastName_Year_PublicationTitle_Title)  - (NOT INCLUDED) PDFs_to_text folder (Folder for all texts extracted from the PDFs by ODDPub, named  AuthorLastName_Year_PublicationTitle_Title) - PLOS_ScopusSearch_matched.csv (merge of the Scopus search results with the PLOS_OSI dataset for the files contained in both) - oddpub_results_wDOIs.csv (results file of the ODDPub classification) - PLOS_ODDPub.csv (merge of the results file of the ODDPub classification with the PLOS-OSI dataset for the publications contained in both) ## Manual coding  Folder 03_ManualCheck/ - CodeSheet_ManualCheck.txt (Code sheet with descriptions of the variables for manual coding) - ManualCheck_2023-06-08.csv (Manual coding results file) - PLOS_ODDPub_Manual.csv (Merge of the results file of the ODDPub and PLOS-OSI classification with the results file of the manual coding) ## Explorative analysis for the discoverability of open data Folder04_FurtherAnalyses  Proof_of_of_Concept_Open_Data_Monitoring.pdf (Description of the explorative analysis of the discoverability of open data publications using the example of a researcher) - in German ## R-Script  Analyses_MA_OpenDataMonitoring.R (R-Script for preparing, merging and analyzing the data and for performing the ODDPub algorithm)

本数据集为硕士论文《开放数据实践监测——以德国德累斯顿工业大学(TU Dresden)研究者的学术文献为例探讨数据文献查找面临的挑战》(英文标题:"Monitoring open data practices - challenges in finding data publications using the example of publications by researchers at TU Dresden")的支撑数据,作者为Katharina Zinke,隶属于柏林洪堡大学图书馆与信息科学研究所,完成于2023年。 本ZIP压缩包包含该硕士论文所依托的研究数据、结果的中间导出文件,以及涵盖全部预处理、数据合并与分析流程的R脚本。此外还附带补充探索性分析的文档。由于所用学术论文的PDF与原文本均以开放获取形式发布,故未包含其原始文件。 下文将展示文件夹结构、文件名及各文件的内容简介。若需了解分析方法的详细细节,请参阅该硕士论文(即将正式发表)。 ### 数据集来源 #### 01_源数据文件夹(01_SourceData/) - PLOS-Dataset_v2_Mar23.csv(PLOS-OSI数据集) - ScopusSearch_ExportResults.csv(Scopus搜索结果导出文件) - ScopusSearch_ExportResults.ris(Scopus搜索结果导出文件) - Zotero_Export_ScopusSearch.csv(来自Zotero的Scopus搜索结果文件名与DOI导出文件) ### 自动分类 #### 02_自动分类文件夹(02_AutomaticClassification/) - (未包含)PDFs文件夹:存放Scopus检索到的全部学术文献的PDF文件,命名格式为:作者姓氏_年份_文献标题(AuthorLastName_Year_PublicationTitle_Title) - (未包含)PDFs_to_text文件夹:存放由ODDPub从PDF文件中提取的文本文件,命名格式为:作者姓氏_年份_文献标题(AuthorLastName_Year_PublicationTitle_Title) - PLOS_ScopusSearch_matched.csv:Scopus搜索结果与PLOS-OSI数据集的交集合并文件 - oddpub_results_wDOIs.csv:ODDPub分类结果文件 - PLOS_ODDPub.csv:ODDPub分类结果文件与PLOS-OSI数据集的交集合并文件 ### 人工编码 #### 03_人工审核文件夹(03_ManualCheck/) - CodeSheet_ManualCheck.txt:人工编码变量说明编码表 - ManualCheck_2023-06-08.csv:人工编码结果文件 - PLOS_ODDPub_Manual.csv:ODDPub与PLOS-OSI分类结果文件和人工编码结果文件的合并文件 ### 开放数据可发现性探索性分析 #### 04_拓展分析文件夹(Folder04_FurtherAnalyses) - Proof_of_of_Concept_Open_Data_Monitoring.pdf:以某位研究者为例阐述开放数据文献可发现性的探索性分析报告,为德文版本 ### R脚本 - Analyses_MA_OpenDataMonitoring.R:用于数据预处理、合并、分析及执行ODDPub算法的R脚本
创建时间:
2024-11-21
二维码
社区交流群
二维码
科研交流群
商业服务