Mosses occurrences dataset from the temperate region of the north hemisphere: tempNorthBryo.
收藏DataCite Commons2023-06-12 更新2024-08-26 收录
下载链接:
https://figshare.com/articles/dataset/Mosses_occurrences_dataset_from_the_temperate_region_of_the_north_hemisphere_tempNorthBryo_/23500710
下载链接
链接失效反馈官方服务:
资源简介:
Dataset of publicly available biodiversity information of mosses records from the temperate region of the Northern Hemisphere. It contains 9,195,062 occurrences result of the compilation, cleaning, enrichment and validation from the Global Biodiversity Information Facility (GBIF 2021; phylum=Bryophyta), the Botanical Information and Ecology Network (BIEN), Consortium of North American Bryophyte Herbaria (CNABH) and the Integrated digitized biocollections (iDigBio). This database was created to assess and quantify how the application of different filters and taxonomic standardization databases affect the spatial patterns of species richness. Format: txt file with tab field separators and UTF-8 encoding Spatial coverage: lat = 20.000 -90.000; long = -180.000 - 180.000 Temporal coverage start=1600; end=2021 <br> Detail description of each of the 34 fields contained in the dataset. <br> ID Number(Integer) Unique Identifier of records <br> id Number(Integer) Unique Identifier of cell associated to study area grid. <br> source Text(String) Name of the original database of the record: Global Biodiversity Information Facility "GBIF", Botanical Information and Ecology Network "BIEN", Consortium of North American Bryophyte Herbaria "CNABH", Integrated Digitized Biocollections "iDigBio" <br> region Text(String) 2 letters code corresponding to one of the three regions established for the study: Europe and North Africa "eu", North America "am" and Asia "as". <br> decimalLongitude Darwin Core term. Number(Float) Geographic longitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Accepted values lie between -180 and 180, inclusive. <br> decimalLatitude Darwin Core term. Number(Float) Geographic latitude (in decimal degrees, using the spatial reference system given in geodeticDatum) of the geographic center of a Location. Accepted values lie between -90 and 90, inclusive. <br> countryCode Darwin Core term. Text(String) 2 letters standard code for the country in which the record was colleted for GBIF database. <br> country Text(String) Name of the country associated to the record's position in idigBio, CNABH or BIEN database. <br> GADM_Location Text(String) Name (in lowercase) of the administrative unit at level 0 associated to the coordinates location by their position. 'sea' correspond to those coordinates located out of any polygon in the shapefile. <br> Coastal_Country Text(String) Name (in lowercase) of the nearest administrative unit at level 0 associated to the coordinates location by their position in a 0.1º buffer. <br> scientificName Darwin Core term. Text(String) Full scientific name of the organism, to the lowest level taxonomic rank that is possible to supply, and including authorship and year of the name where applicable. <br> sppName Text(String) Scientific name (genus + epithet specific) accepted by at least 3 of the taxonomic sources evaluated (GBIF backbone, The Plant List, World Flora Online, Tropicos or Taxonomic Name resolution Service) <br> consensusValue Number(Integer) Number of taxonomic sources considered to generate the species name of the record ('sppName' field). Values accepted: 5, 4 or 3. <br> gbif_species Text(String) Scientific name (genus + epithet specific) obtained after taxonomic standardization with GBIF backbone using script 2. <br> gbif_status Text(String) Taxonomical status of the species name in 'gbif_species' field based on GBIF backbone. Values: "ACCEPTED" "SYNONYM" "DOUBTFUL" <br> TPL_species Text(String) Scientific name (genus + epithet specific) obtained after taxonomic standardization with The Plant List using script 2. <br> TPL_taxonStatus Text(String) Taxonomical status of the species name in 'TPL_species' field based on The Plant List. Values: "accepted" or "unresolved" <br> tnrs_species_noauthors Text(String) Scientific name (genus + epithet specific) obtained after taxonomic standardization with Taxonomic Name resolution Service using script 2. <br> tnrs_status Text(String) Taxonomical status of the species name in 'tnrs_species_noauthors' field based on Taxonomic Name resolution Service. Values: "Accepted", "Synonym", "No opinion" <br> Tropicos_Accepted_species Text(String) Scientific name (genus + epithet specific) obtained after taxonomic standardization with Tropicos using script 2. <br> Tropicos_taxonomic_status Text(String) Taxonomical status of the species name in 'Tropicos_Accepted_species' field based on Tropicos. Values: "Accepted", "Synonym", "No opinion" <br> wfo.species Text(String) Scientific name (genus + epithet specific) obtained after taxonomic standardization with World Flora Online using script 2. <br> wfo_taxonomicStatus Text(String) Taxonomical status of the species name in 'wfo.species' field based on GBIF backbone. Values: "accepted" or "unchecked" <br> basisOfRecord Darwin Core term. Text(String) Darwin Core dataset element. The specific nature of the data record. <br> basisOfRecordRev Text(String) Revised and unified information of 'basisOfRecord' field. Allowed values: ‘PRESERVED_SPECIMEN’; ‘HUMAN_OBSERVATION’; ‘UNKNOWN’; 'LITERATURE'; 'plot'; 'LIVING_SPECIMEN'; 'MATERIAL_SAMPLE'. <br> day Darwin Core term. Number(Integer) Darwin Core dataset element. The integer day of the month on which the Event occurred. <br> month Darwin Core term. Number(Integer) Darwin Core dataset element. The ordinal month in which the Event occurred. <br> year Darwin Core term. Number(Integer) Darwin Core dataset element. The four-digit year in which the Event occurred, according to the Common Era Calendar. <br> eventDate Darwin Core term. Idate Date-time or interval during which the record was collected. <br> dayRev Number(Integer) Revised 'day' field deleting values up to 31 <br> monthRev Number(Integer) Revised 'month' field deleting values up to 12 <br> date Text(String) Valid date of collection. Obtained by concatenating ('/') 'dayRev' + 'monthRev' + 'year' fields. <br> recordedBy Darwin Core term. Text(String) A list (concatenated and separated) of names of people, groups, or organizations responsible for recording the original Occurrence. <br> recBy_Rev Text(String) Manually revised 'recorderBy' field. Strings to lower and delete extra white space.
本数据集收录北半球温带区域苔藓类植物的公开生物多样性记录。数据集包含9,195,062条物种出现记录,其数据源自全球生物多样性信息设施(Global Biodiversity Information Facility, GBIF 2021;门=苔藓植物门(Bryophyta))、植物信息与生态网络(Botanical Information and Ecology Network, BIEN)、北美苔藓植物标本馆联盟(Consortium of North American Bryophyte Herbaria, CNABH)以及集成数字化生物标本库(Integrated Digitized Biocollections, iDigBio),并经过汇编、清洗、富集与验证流程处理。
本数据库旨在评估与量化不同过滤策略及分类学标准化数据库的应用,会如何影响物种丰富度的空间分布格局。
数据格式:采用制表符作为字段分隔符的文本文件(txt),编码格式为UTF-8。
空间覆盖范围:纬度区间为20.000~90.000;经度区间为-180.000~180.000。
时间覆盖范围:起始年份1600年,终止年份2021年。
以下为数据集包含的34个字段的详细说明:
ID Number(整数型):记录的唯一标识符
id Number(整数型):研究区网格关联单元格的唯一标识符
source Text(文本型):记录的原始数据库名称,可选值包括:全球生物多样性信息设施(GBIF)、植物信息与生态网络(BIEN)、北美苔藓植物标本馆联盟(CNABH)、集成数字化生物标本库(iDigBio)
region Text(文本型):本研究划定的三大研究区域对应的2位字母代码,分别为:欧洲与北非("eu")、北美洲("am")、亚洲("as")
decimalLongitude 达尔文核心(Darwin Core)术语:地理经度(浮点型),指某一地理位置中心的地理经度(以十进制度为单位,采用大地基准面(geodeticDatum)指定的空间参考系统),取值范围为-180至180(含边界值)
decimalLatitude 达尔文核心(Darwin Core)术语:地理纬度(浮点型),指某一地理位置中心的地理纬度(以十进制度为单位,采用大地基准面(geodeticDatum)指定的空间参考系统),取值范围为-90至90(含边界值)
countryCode 达尔文核心(Darwin Core)术语:国家编码(文本型),针对GBIF数据库的记录,采用2位标准国家代码,表示记录采集所在国家
country Text(文本型):与idigBio、CNABH或BIEN数据库中记录位置关联的国家名称
GADM_Location Text(文本型):根据坐标位置匹配得到的0级行政单元名称(小写格式),若坐标位于shapefile文件中的所有多边形之外,则标记为"sea"
Coastal_Country Text(文本型):根据坐标位置在0.1°缓冲范围内匹配得到的最近0级行政单元名称(小写格式)
scientificName 达尔文核心(Darwin Core)术语:生物的完整科学名称(文本型),需标注至可提供的最低分类学等级,必要时需包含命名者及命名年份
sppName Text(文本型):经至少3个评估分类学源(GBIF系统发育主干库、The Plant List、世界植物在线(World Flora Online)、Tropicos或分类名称解析服务(Taxonomic Name Resolution Service))认可的科学名称(属名+种加词)
consensusValue 整数型:生成记录物种名称(即`sppName`字段)时所参考的分类学源数量,允许取值为5、4或3
gbif_species Text(文本型):通过脚本2采用GBIF系统发育主干库进行分类学标准化后得到的科学名称(属名+种加词)
gbif_status Text(文本型):基于GBIF系统发育主干库,`gbif_species`字段中物种名称的分类学状态,可选值为:"ACCEPTED"、"SYNONYM"、"DOUBTFUL"
TPL_species Text(文本型):通过脚本2采用The Plant List进行分类学标准化后得到的科学名称(属名+种加词)
TPL_taxonStatus Text(文本型):基于The Plant List,`TPL_species`字段中物种名称的分类学状态,可选值为:"accepted"、"unresolved"
tnrs_species_noauthors Text(文本型):通过脚本2采用分类名称解析服务(Taxonomic Name Resolution Service)进行分类学标准化后得到的科学名称(属名+种加词,不含命名者)
tnrs_status Text(文本型):基于分类名称解析服务,`tnrs_species_noauthors`字段中物种名称的分类学状态,可选值为:"Accepted"、"Synonym"、"No opinion"
Tropicos_Accepted_species Text(文本型):通过脚本2采用Tropicos进行分类学标准化后得到的科学名称(属名+种加词)
Tropicos_taxonomic_status Text(文本型):基于Tropicos,`Tropicos_Accepted_species`字段中物种名称的分类学状态,可选值为:"Accepted"、"Synonym"、"No opinion"
wfo.species Text(文本型):通过脚本2采用世界植物在线(World Flora Online)进行分类学标准化后得到的科学名称(属名+种加词)
wfo_taxonomicStatus Text(文本型):基于GBIF系统发育主干库,`wfo.species`字段中物种名称的分类学状态,可选值为:"accepted"、"unchecked"
basisOfRecord 达尔文核心(Darwin Core)术语:数据记录的具体本质属性(文本型),属于达尔文核心数据集元素
basisOfRecordRev Text(文本型):`basisOfRecord`字段的修订与统一后信息,允许取值为:"PRESERVED_SPECIMEN"(保藏标本)、"HUMAN_OBSERVATION"(人类观测记录)、"UNKNOWN"(未知)、"LITERATURE"(文献记录)、"plot"(样地记录)、"LIVING_SPECIMEN"(活体标本)、"MATERIAL_SAMPLE"(材料样本)
day 达尔文核心(Darwin Core)术语:事件发生所在月份的整数日期(整数型),属于达尔文核心数据集元素
month 达尔文核心(Darwin Core)术语:事件发生所在的序数月份(整数型),属于达尔文核心数据集元素
year 达尔文核心(Darwin Core)术语:事件发生的四位数公历年份(整数型),属于达尔文核心数据集元素
eventDate 达尔文核心(Darwin Core)术语:记录采集的日期-时间或时间区间
dayRev 整数型:修订后的`day`字段,删除超出31的日期值
monthRev 整数型:修订后的`month`字段,删除超出12的月份值
date Text(文本型):有效的采集日期,通过将`dayRev`、`monthRev`与`year`字段以斜杠("/")拼接得到
recordedBy 达尔文核心(Darwin Core)术语:负责记录原始发生记录的人员、团体或组织名称列表(以分隔符拼接)(文本型)
recBy_Rev Text(文本型):经手动修订的`recordedBy`字段,将字符串转换为小写格式并删除多余空白字符
提供机构:
figshare
创建时间:
2023-06-12



