five

Barcoding 100K specimens in a single nanopore run

收藏
DataONE2024-05-30 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:e669d09727d98f09864d2e4caae27a4537922e615c8a92c5ecafc0e1adbdffd9
下载链接
链接失效反馈
官方服务:
资源简介:
It is a global priority to better manage the biosphere, but action must be informed by comprehensive data on the abundance and distribution of species. The acquisition of such information is currently constrained by  high costs. DNA barcoding can speed the registration of unknown animal species, the most diverse kingdom of eukaryotes, as the BIN system automates their recognition. However, inexpensive protocols are critical as the census of all animal species is likely to require the analysis of a billion or more specimens. Barcoding involves DNA extraction followed by PCR and sequencing with the last step dominating costs until 2017. By enabling the sequencing of highly multiplexed samples, the Sequel platforms from Pacific BioSciences slashed costs by 90%, but these instruments are only deployed in core facilities because of their expense. Sequencers from Oxford Nanopore Technologies provide an escape from high capital and service costs, but their low sequence fidelity has, until rece..., , , # BARCODE 100K SPECIMENS: IN A SINGLE NANOPORE RUN [https://doi.org/10.5061/dryad.41ns1rnp1](https://doi.org/10.5061/dryad.41ns1rnp1) The following files are used for all bioinformatic processes described in the manuscript \"BARCODE 100K SPECIMENS: IN A SINGLE NANOPORE RUN\" by Hebert et al, 2024. # Description of the data and file structure **Raw_Reads_2K.fastq.gz** - This file contains the raw, base-called ONT reads for the 2K dataset. This can be used as the raw data input for ONT.sh. **Raw_Reads_10K.fastq.gz** - This file contains the raw, base-called ONT reads for the 10K dataset. This can be used as the raw data input for ONT.sh. **Raw_Reads_100K.fastq.gz** - This file contains the raw, base-called ONT reads for the 100K dataset. This can be used as the raw data input for ONT.sh. **parameters_2K.xls**x - This is the parameters file for the 2K dataset (for details, see below). It contains bioinformatic run parameters and the UMI map for this dataset. **parameters_10K.xlsx** -...
创建时间:
2025-08-01
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作