COVID-19 Genome Sequence Dataset
收藏aws亚马逊开源数据集2024-03-07 收录
下载链接:
https://registry.opendata.aws/ncbi-covid-19
下载链接
链接失效反馈官方服务:
资源简介:
This repository within the ACTIV TRACE initiative houses a comprehensive collection of datasets related to SARS-CoV-2. The processing of SARS-CoV-2 Sequence Read Archive (SRA) files has been optimized to identify genetic variations in viral samples. This information is then presented in the Variant Call Format (VCF). Each VCF file corresponds to the SRA parent-run's accession ID. Additionally, the data is available in the parquet format, making it easier to search and filter using the Amazon Athena Service. The SARS-CoV-2 Variant Calling Pipeline is designed to handle new data every six ho...
提供机构:
National Library of Medicine (NLM)



