Barcoding 100K specimens in a single nanopore run
收藏DataONE2024-05-30 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:e669d09727d98f09864d2e4caae27a4537922e615c8a92c5ecafc0e1adbdffd9
下载链接
链接失效反馈官方服务:
资源简介:
It is a global priority to better manage the biosphere, but action must be informed by comprehensive data on the abundance and distribution of species. The acquisition of such information is currently constrained by  high costs. DNA barcoding can speed the registration of unknown animal species, the most diverse kingdom of eukaryotes, as the BIN system automates their recognition. However, inexpensive protocols are critical as the census of all animal species is likely to require the analysis of a billion or more specimens. Barcoding involves DNA extraction followed by PCR and sequencing with the last step dominating costs until 2017. By enabling the sequencing of highly multiplexed samples, the Sequel platforms from Pacific BioSciences slashed costs by 90%, but these instruments are only deployed in core facilities because of their expense. Sequencers from Oxford Nanopore Technologies provide an escape from high capital and service costs, but their low sequence fidelity has, until rece..., , , # BARCODE 100K SPECIMENS: IN A SINGLE NANOPORE RUN
[https://doi.org/10.5061/dryad.41ns1rnp1](https://doi.org/10.5061/dryad.41ns1rnp1)
The following files are used for all bioinformatic processes described in the manuscript \"BARCODE 100K SPECIMENS: IN A SINGLE NANOPORE RUN\" by Hebert et al, 2024.
# Description of the data and file structure
**Raw_Reads_2K.fastq.gz** - This file contains the raw, base-called ONT reads for the 2K dataset. This can be used as the raw data input for ONT.sh.
**Raw_Reads_10K.fastq.gz** - This file contains the raw, base-called ONT reads for the 10K dataset. This can be used as the raw data input for ONT.sh.
**Raw_Reads_100K.fastq.gz** - This file contains the raw, base-called ONT reads for the 100K dataset. This can be used as the raw data input for ONT.sh.
**parameters_2K.xls**x - This is the parameters file for the 2K dataset (for details, see below). It contains bioinformatic run parameters and the UMI map for this dataset.
**parameters_10K.xlsx** -...
创建时间:
2025-08-01



