Data from Readsynth: short-read simulation for consideration of composition-biases in reduced metagenome sequencing approaches
收藏DataONE2024-04-12 更新2024-06-08 收录
下载链接:
https://search.dataone.org/view/sha256:7a8f3d3090cbdc87ad156cac1c83f6f40f88a7a9aeb3ae887617c81c68a9b016
下载链接
链接失效反馈官方服务:
资源简介:
Background
The application of reduced metagenomic sequencing approaches holds promise as a middle ground between targeted amplicon sequencing and whole metagenome sequencing approaches but has not been widely adopted as a technique. A major barrier to adoption is the lack of read simulation software built to handle characteristic features of these novel approaches. Reduced metagenomic sequencing (RMS) produces unique patterns of fragmentation per genome that are sensitive to restriction enzyme choice, and the non-uniform size selection of these fragments may introduce novel challenges to taxonomic assignment as well as relative abundance estimates.
Results
Through the development and application of simulation software, readsynth, we compare simulated metagenomic sequencing libraries with existing RMS data to assess the influence of multiple library preparation and sequencing steps on downstream analytical results. Based on read depth per position, readsynth achieved 0.79 Pearsonâs corre..., Sequence data were collected and aggregated from publicly available NCBI SRA databases for raw sequence data (https://www.ncbi.nlm.nih.gov/sra) and NCBI RefSeq databases for reference genome assemblies (https://www.ncbi.nlm.nih.gov/refseq/).
Downloaded reference genomes have been concatenated and indexed using command line \"cat\" command and the bwa index command., , # readsynth\_analysis
[https://doi.org/10.5061/dryad.nzs7h44zk](https://doi.org/10.5061/dryad.nzs7h44zk)
The dataset contained here provides the necessary raw sequence data to perform analyses for the simulation software [readsynth](https://github.com/ryandkuster/readsynth).
The dataset includes the genomes and databases necessary to reproduce the steps in the github repository [readsynth_analysis](https://github.com/ryandkuster/readsynth_analysis) and correspond with that repository's \"raw_data\" directory.
## Description of the data and file structure
The genome directory \"raw_data\" is broken into the following subdirectories (further descriptions below):
```
.
âââ helius
â  âââ all_2084
â  âââ genomes
â  âââ genomes_combined
âââ kraken_dbs
â  âââ k2_pluspfp_20220607
â  âââ snipen_bei_db
â  â  âââ library
â  â  âââ added
â  âââ sun_atcc_db
â  âââ library
â  âââ added
âââ liu_RMS
â  âââ mock_community_estimate
â  âââ 10M_bracken_profile
â ...
创建时间:
2025-07-30



