five

Data from: Detecting aquatic invasive species in bait and pond stores with targeted environmental (e) DNA high-throughput sequencing metabarcode assays: angler, retailer, and manager implications

收藏
Mendeley Data2024-06-25 更新2024-06-29 收录
下载链接:
https://datadryad.org/stash/dataset/doi:10.5061/dryad.4tmpg4f5x
下载链接
链接失效反馈
官方服务:
资源简介:
MetaTrim.py: See README.md SeqTabToFasta.pl: script converts SeqTab format files with results from Dada2 (R package) to FASTA files for each sample. Expects an input file named SeqTab.txt in the CWD. BlastCylce500.pl: script performs BLAST searches for every sequence in files ending with ".fasta" in the CWD. BLAST searches are performed against custom databases for all Actinopterygii on GenBank and only fishes in the Great Lakes. If file name begins with "m" (MiFish), sequences are blasted against 12S databases, otherwise, against Cytochrome b databases. BlastCycle500Remote.pl: script performs BLAST searches for every sequence in files ending with ".fasta" in the CWD. BLAST is performed remotely against the entirety of GenBank. SummarizeBlasteDNAEnvironmentSepASVs.pl: script summarizes BLAST results (files ending in "Res.txt" from BlastCycle500.pl"). Species names for hits with >90% query cover and identity are concatenated if the e value is the lowest for all hits. Returns a txt file with summarized hits per sample. Expects an input file named InReads.txt in the CWD. This file is tab delimited. The first value should be the sample name identical to that used as input in the above script, followed by tab, then the number of reads in that sample. This allows calculation of proportion of read represented by the hit. BOPPSummarize*.py: script removes species hits that do not occur at a portion above the error for a specific marker in a specific sample, unless the occur in multiple markers in that same sample. Hits are then compared against morphological results from bait and pond stores. Several inputs are required and are included in this repository. They include a file showing the error rate calculated from positive controls for each marker on each sequencing run (ErrorCons.txt; if none is supplied, the default value will be 0.001), a shop key file (ShopKeybaitShops.txt), a file indicating the legality of species in each state sampled (SppStatusBait.txt), morphological results from each store at the time of sampling (InMorph.txt), and BLAST results files (summarized by the script above, separated by marker).
创建时间:
2023-06-28
二维码
社区交流群
二维码
科研交流群
商业服务