five

Creating, curating, and evaluating a mitogenomic reference database to improve regional species identification using environmental DNA|生物多样性监测数据集|基因组学数据集

收藏
DataONE2023-09-12 更新2024-06-08 收录
生物多样性监测
基因组学
下载链接:
https://search.dataone.org/view/sha256:5688141b906e013f7b2a8edcb9d41ccc6aa6a2de41c89de6b6f5926945e60137
下载链接
链接失效反馈
资源简介:
Species detection using eDNA is revolutionizing global capacity to monitor biodiversity. However, the lack of regional, vouchered, genomic sequence information—especially sequence information that includes intraspecific variation—creates a bottleneck for management agencies wanting to harness the complete power of eDNA to monitor taxa and implement eDNA analyses. eDNA studies depend upon regional databases of mitogenomic sequence information to evaluate the effectiveness of such data to detect and identify taxa. We created the Oregon Biodiversity Genome Project to create a database of complete, nearly error-free mitogenomic sequences for all of Oregon's fishes. We have successfully assembled the complete mitogenomes of 313 specimens of freshwater, anadromous, and estuarine fishes representing 24 families, 55 genera, and 129 species and lineages. Comparative analyses of these sequences illustrate that many regions of the mitogenome are taxonomically informative, that the short (~150 bp) ..., Voucher Specimen and Tissue Collection The study area initially encompassed the state of Oregon—the region of interest for our eDNA monitoring program—and expanded to a few sites in northern California and Washington State (Fig 3). To strategize sample collection, we examined historical location records in fish collections such as the Oregon State Ichthyology Collection and conferred with local biologists to identify current distributions. For cases where we knew or suspected that deeply divergent evolutionary lineages existed in the present concept of a species, we aimed to include representatives of all lineages. We ultimately identified 146 native and nonnative freshwater fish species and lineages that are currently found in Oregon and strategized collections to span watersheds throughout the state (Appendix S1).  To facilitate consistent sampling, we provided sampling kits (Appendix S2, Box S1) to collectors that contained a 500-mL Nalgene bottle filled with 10% formalin, a 2.0 mL c..., Microsoft Excel, LibreOffice, or Microsoft's free XLS Viewer can be used to open the Excel files and an unzip utility such as 7-Zip or WinZip can be used to unzip zipped fastas. For pdfs, use Adobe Acrobat Reader. Open Microsoft Word documents using Microsoft Word, OpenOffice Writer or Google Docs.,
创建时间:
2023-11-29
用户留言
有没有相关的论文或文献参考?
这个数据集是基于什么背景创建的?
数据集的作者是谁?
能帮我联系到这个数据集的作者吗?
这个数据集如何下载?
点击留言
数据主题
具身智能
数据集  4098个
机构  8个
大模型
数据集  439个
机构  10个
无人机
数据集  37个
机构  6个
指令微调
数据集  36个
机构  6个
蛋白质结构
数据集  50个
机构  8个
空间智能
数据集  21个
机构  5个
5,000+
优质数据集
54 个
任务类型
进入经典数据集
热门数据集

FER2013

FER2013数据集是一个广泛用于面部表情识别领域的数据集,包含28,709个训练样本和7,178个测试样本。图像属性为48x48像素,标签包括愤怒、厌恶、恐惧、快乐、悲伤、惊讶和中性。

github 收录

DALY

DALY数据集包含了全球疾病负担研究(Global Burden of Disease Study)中的伤残调整生命年(Disability-Adjusted Life Years, DALYs)数据。该数据集提供了不同国家和地区在不同年份的DALYs指标,用于衡量因疾病、伤害和早逝导致的健康损失。

ghdx.healthdata.org 收录

GME Data

关于2021年GameStop股票活动的数据,包括每日合并的GME短期成交量数据、每日失败交付数据、可借股数、期权链数据以及不同时间框架的开盘/最高/最低/收盘/成交量条形图。

github 收录

Agricultural Pests Dataset

Agricultural Pests Classification

kaggle 收录

CE-CSL

CE-CSL数据集是由哈尔滨工程大学智能科学与工程学院创建的中文连续手语数据集,旨在解决现有数据集在复杂环境下的局限性。该数据集包含5,988个从日常生活场景中收集的连续手语视频片段,涵盖超过70种不同的复杂背景,确保了数据集的代表性和泛化能力。数据集的创建过程严格遵循实际应用导向,通过收集大量真实场景下的手语视频材料,覆盖了广泛的情境变化和环境复杂性。CE-CSL数据集主要应用于连续手语识别领域,旨在提高手语识别技术在复杂环境中的准确性和效率,促进聋人与听人社区之间的无障碍沟通。

arXiv 收录