five

GCRSR Proficiency Test

收藏
DataCite Commons2026-03-10 更新2026-05-04 收录
下载链接:
https://data.jrc.ec.europa.eu/dataset/421d3b52-07a7-4dda-a1b2-2db349270aca
下载链接
链接失效反馈
官方服务:
资源简介:
The application of whole-genome sequence (WGS) technology in regulatory food microbiology provides an unprecedented opportunity to produce highly informative laboratory analyses supporting risk assessment and risk management actions. The quality of WGS datasets will have a significant impact on downstream bioinformatics processes, with one critical element being the possible presence of adventitious DNA sequences due to contamination during sample handling and sequencing operations. This dataset is part of a project aimed to address the need to assure WGS data quality by accounting for contamination events through determination of the impacts of sequencing data contamination events on downstream analyses, such as typing and marker discovery. The goal is to contribute to the development and implementation of harmonized quality protocols for the application of WGS technologies in the international regulatory food microbiology community. This record consists of three in silico datasets that will be used for proficiency tests. They are organized into mock Illumina MiSeq sequencing runs. The runs consist of the same 24 Escherichia coli samples. FASTQ files for these samples were created by simulating reads from MinION/PacBIO + Illumina MiSeq hybrid-assembly polished genomes or closed reference genomes downloaded from NCBI. Reads were simulated with ART. One run was created using the empirical Illumina MiSeq profile, while the other runs were simulated using custom read profiles generated from over-clustered runs. The reference strains represent a diverse cross-section of serotypes, shiga toxin subtype, antimicrobial resistance (AMR) profiles, plasmid profiles.
提供机构:
European Commission, Joint Research Centre
创建时间:
2026-03-10
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作