Image collection and supporting data for: An image dataset of cleared, x-rayed, and fossil leaves vetted to plant family for human and machine learning
收藏Mendeley Data2024-06-25 更新2024-06-27 收录
下载链接:
https://plus.figshare.com/articles/dataset/Image_collection_and_supporting_data_for_An_image_dataset_of_cleared_x-rayed_and_fossil_leaves_vetted_to_plant_family_for_human_and_machine_learning/14980698/1
下载链接
链接失效反馈官方服务:
资源简介:
Here we provide the image dataset and supporting data files for the following primary article. Please refer to the primary article and the supporting data (provided here) for all details. Wilf P, SL Wing, HW Meyer, J Rose, R Saha, T Serre, NR Cúneo, MP Donovan, DM Erwin, MA Gandolfo, E González-Akre, F Herrera, S Hu, A Iglesias, KR Johnson, TS Karim, X Zou. 2021. An image dataset of cleared, x-rayed, and fossil leaves vetted to plant family for human and machine learning. PhytoKeys 187: 93–128, doi:10.3897/phytokeys.187.72350 https://doi.org/10.3897/phytokeys.187.72350 Files are provided here as zip archives, as follows. v1.0 is the dataset that corresponds precisely to the published article and will be preserved here. Any future updates provided here will use new version numbers. Extant_Leaves_A-E_v1.0.zipExtant_Leaves_F-O_v1.0.zip Extant_Leaves_P-Z_v1.0.zip Families A–E, F–O, and P–Z, respectively, of cleared and x-rayed leaf images. Florissant_Fossil_v1.0.zip Fossil-leaf image collection from Florissant Fossil Beds National Monument. General_Fossil_v1.0.zip Fossil-leaf image collection from several other sites in North and South America. General_Fossil_uncropped_v1.0.zipReference set of the uncropped image versions for the General Fossil collection, for access to scale bars and other archival information not otherwise available digitally (see main article). Filenames are suffixed with "_uncropped" and may have minor differences in format from the cropped set. supplemental_data_v1.0.zip Archive containing three files: Master_inventory_leavesdb_v1.0 Master inventory file listing all extant and fossil specimens. See details in the main article (esp. table 1) for how to look up additional specimen data, which are easily available on the Web for most of the collections using the catalog numbers listed in this inventory file (also see below). Please note that the catalog numbers listed here may be primary or secondary, as described in the main article (table 1). The "old_Family" field preserves legacy data that can assist in locating physical specimens in the collections, which usually retain their original taxonomic organization (see main text). The other two files are catalogs of specimen data not otherwise available on the Web (see main article). General_fossils_catalog_v1.0.csv Specimen data for the "General fossil" image collection. Wing_x-ray_catalog_v1.0.csv Voucher data for the Wing X-Ray image collection.Technical notes:Catalog number field in the Master Inventory file = negative number + leaf number as listed in this file.Example: "Wing_199-001" in the Master Inventory = negative 199, leaf 1 here = Alphonsea arborea (Annonaceae) = primary voucher US 904529.Some typographical errors in this legacy catalog are left as-is, and identifications are not updated here. Vetted spellings and updated family and order assignments can be found by catalog number (= negative + leaf number) in the Master Inventory file.This file includes some additional records that did not meet criteria for the image dataset.
本研究为下述核心学术论文配套提供图像数据集与辅助数据文件。所有详细信息请参阅核心论文及本文提供的配套数据。
作者列表:Wilf P、SL Wing、HW Meyer、J Rose、R Saha、T Serre、NR Cúneo、MP Donovan、DM Erwin、MA Gandolfo、E González-Akre、F Herrera、S Hu、A Iglesias、KR Johnson、TS Karim、X Zou。该论文发表于2021年的《PhytoKeys》期刊第187卷:93–128页,DOI为10.3897/phytokeys.187.72350,链接:https://doi.org/10.3897/phytokeys.187.72350。
所有文件均以ZIP压缩包形式提供,具体如下。v1.0版本为与已发表论文完全匹配的数据集,将在此处永久留存。后续所有更新版本将使用新的版本号。
1. Extant_Leaves_A-E_v1.0.zip、Extant_Leaves_F-O_v1.0.zip、Extant_Leaves_P-Z_v1.0.zip:分别对应透明化叶片(cleared leaf)与X射线成像叶片图像集中的A–E、F–O及P–Z科现生植物。
2. Florissant_Fossil_v1.0.zip:来自弗洛里森特化石床国家纪念区的化石叶片图像集。
3. General_Fossil_v1.0.zip:来自北美与南美其他多个遗址的化石叶片图像集。
4. General_Fossil_uncropped_v1.0.zip:通用化石数据集的未裁剪图像参考集,用于获取标尺及其他无法通过其他数字化渠道获取的存档信息(详见核心论文)。文件名后缀为"_uncropped",格式与裁剪后的数据集存在细微差异。
5. supplemental_data_v1.0.zip:包含三个文件的归档包:
- Master_inventory_leavesdb_v1.0:主清单文件,列出所有现生与化石标本。详见核心论文(尤其是表1)中关于如何查询额外标本数据的说明;多数馆藏可通过本清单中列出的编号在网络上轻松获取相关数据(另见下文说明)。请注意,此处列出的编号可分为一级与二级编号,详见核心论文表1。"old_Family"字段保留了遗留数据,可辅助定位馆藏中的实体标本,这些馆藏通常保留了原始的分类学组织架构(详见正文)。
- 另外两个文件为网络上无法获取的标本数据目录:
- General_fossils_catalog_v1.0.csv:"通用化石"图像集的标本数据。
- Wing_x-ray_catalog_v1.0.csv:Wing X射线图像集的凭证标本(voucher specimen)数据。
技术说明:主清单文件中的编号字段 = 负数编号 + 本文件中列出的叶片编号。示例:主清单中的"Wing_199-001"对应负数编号199,此处的第1号叶片为Alphonsea arborea(番荔枝科Annonaceae),一级凭证标本编号为US 904529。本遗留目录中的部分排版错误将保留原样,分类鉴定信息亦不做更新。经校验的拼写及更新后的科、目分类信息,可通过主清单文件中的编号(= 负数编号 + 叶片编号)查询。本文件包含部分未达到图像数据集标准的额外记录。
创建时间:
2023-06-28



