The data of the article "Database of Eu2+ and Ce3+ Doped Phosphors for Development of Violet-light Excited White LEDs"
收藏DataCite Commons2025-12-26 更新2026-05-05 收录
下载链接:
https://www.scidb.cn/detail?dataSetId=c4873d94fa9a4c269cbcb1a62a8195a9
下载链接
链接失效反馈官方服务:
资源简介:
数据收集及处理工作流数据挖掘:使用 GPT-4 模型通过提示工程(Prompt Engineering)技术,从文献中自动提取化学式、掺杂元素、激发波长和发射波长等关键信息数据规范化:利用 Pymatgen)工具包对提取的化学式进行标准化处理以消除重复项人工干预与清洗:针对同一化学式的多组实验数据,通过人工筛选具有完整表征元数据(如合成条件、衍射图谱、PL/PLE光谱等)的条目,剔除化学逻辑不一致或缺乏重复性证据的数据结构匹配:将清洗后的数据与无机晶体结构数据库(ICSD)进行交叉引用,获取晶体结构文件(CIFs),并过滤掉因随机占位导致结构定义不明确的分数占据(fractional occupancy)化合物时空分布信息数据集涵盖了过去二十余年(从 2000年以前至2024年)的Eu2+、Ce3+稀土掺杂荧光粉全球科研论文数据规模与属性描述该数据集共包含 822 条 经验证的荧光粉数据记录,分为两大部分:Eu2+掺杂体系(455 条)和 Ce3+掺杂体系(367 条)数据表中的核心列标签及其含义如下:Host Material Chemical Formula: 宿主材料的化学式Doping Element: 掺杂的稀土离子(Eu2+或Ce3+)Excitation Wavelength 激发波长,单位为纳米 (nm)Emission Wavelength: 发射波长,单位为纳米 (nm)ICSD: 对应的无机晶体结构数据库编号,用于关联 CIF 文件DOI: 原始研究论文的数字对象标识符Data Collection and Preprocessing Workflow1. Data MiningAutomated Information Extraction: Key parameters, including chemical formulas, doping elements, excitation wavelengths, and emission wavelengths, were automatically extracted from scientific literature using the GPT-4 model via prompt engineering techniques.2. Data NormalizationChemical Formula Standardization: The Pymatgen toolkit was employed to standardize the extracted chemical formulas, ensuring consistency and eliminating duplicate entries.3. Manual Intervention and CleaningData Refinement: For multiple experimental datasets associated with the same chemical formula, manual screening was performed.Inclusion Criteria: Entries with comprehensive characterization metadata (e.g., synthesis conditions, XRD patterns, PL/PLE spectra).Exclusion Criteria: Entries with inconsistent chemical logic or those lacking evidence of reproducibility were removed.4. Structural MatchingDatabase Cross-referencing: The cleaned data were cross-referenced with the Inorganic Crystal Structure Database (ICSD) to retrieve Crystallographic Information Files (CIFs).Filtering: Compounds with fractional occupancy (unclear structural definitions due to random site occupancy) were filtered out to ensure structural precision.Dataset OverviewSpatiotemporal DistributionThe dataset encompasses global research papers on Eu2+ and Ce3+ rare-earth doped phosphors spanning over two decades (from before 2000 to 2024).Data Scale and Attribute DescriptionThe final dataset contains 822 verified phosphor data records, categorized into two main groups:Eu2+ doped systems: 455 entriesCe3+ doped systems: 367 entriesCore Column Definitions:Host Material Chemical Formula: The chemical formula of the host matrix.Doping Element: The doped rare-earth ion (Eu2+ or Ce3+).Excitation Wavelength: The peak excitation wavelength in nanometers.Emission Wavelength: The peak emission wavelength in nanometers.ICSD: Inorganic Crystal Structure Database reference number for CIF file association.DOI: Digital Object Identifier of the original research paper.
提供机构:
Science Data Bank
创建时间:
2025-12-26



