Data from: Use of long-read sequencing simulators to assess real-world applications for food safety
收藏agdatacommons.nal.usda.gov2024-02-28 更新2025-03-23 收录
下载链接:
https://agdatacommons.nal.usda.gov/articles/dataset/Data_from_Use_of_long-read_sequencing_simulators_to_assess_real-world_applications_for_food_safety/24857262/1
下载链接
链接失效反馈官方服务:
资源简介:
Shiga toxin-producing Escherichia coli (STEC) and Listeria monocytogenes are responsible for severe foodborne illnesses in the United States. Current identification methods require at least four days to identify STEC and six days for L. monocytogenes. Adoption of long-read, whole genome sequencing for testing could significantly reduce the time needed for identification, but method development costs are high. Therefore, the goal of this project was to use NanoSim-H software to simulate Oxford Nanopore sequencing reads to assess the feasibility of sequencing-based foodborne pathogen detection and guide experimental design. Sequencing reads were simulated for STEC, L. monocytogenes, and a 1:1 combination of STEC and Bos taurus genomes using NanoSim-H. This dataset includes all of the simulated reads generated by the project in fasta format. This dataset can be analyzed bioinformatically or used to test bioinformatic pipelines.
志贺毒素产生的大肠杆菌(STEC)和单核细胞增生李斯特菌是美国严重食源性疾病的罪魁祸首。目前用于鉴定STEC至少需要四天时间,而鉴定李斯特菌则需要六天。采用长读长全基因组测序进行检测有望显著缩短鉴定时间,但方法开发成本高昂。因此,本项目旨在利用NanoSim-H软件模拟牛津纳米孔测序读数,以评估基于测序的食源性病原体检测的可行性并指导实验设计。使用NanoSim-H模拟了STEC、李斯特菌以及STEC和牛基因组1:1混合的测序读数。本数据集包括项目生成的所有模拟读数,格式为fasta。此数据集可用于生物信息学分析或测试生物信息学流程。
提供机构:
agdatacommons.nal.usda.gov



