Markov chain representation dataset of SARS-CoV-2 genome
收藏Mendeley Data2026-04-18 收录
下载链接:
https://data.mendeley.com/datasets/8jt93ggv9w
下载链接
链接失效反馈官方服务:
资源简介:
COVID-19, the disease caused by the SARS-CoV-2 virus, has been spreading around the world quite aggressively since the end of 2019. It has been declared a pandemic by the World Health Organization, and Capturing data from May 13, 2020; there are more than 4 million cases with more than 250 thousand deaths. Thus, this work presents a new dataset in which creates the Markov chain representation of the SARS-CoV-2 genome sequences from NCBI (1557 instances). The dataset also provides a Markov chain representation of other viruses from the Virus-Host DB (11540 different viruses) and three Riboviria viruses from NCBI (Betacoronavirus RaTG13, bat-SL-CoVZC45, and bat-SL-CoVZXC21).
由严重急性呼吸综合征冠状病毒2型(SARS-CoV-2)引发的新型冠状病毒肺炎(COVID-19)自2019年末以来在全球范围内快速扩散。世界卫生组织已将其列为全球大流行病。本次数据集采集自2020年5月13日,彼时全球累计确诊病例超400万例,死亡病例超25万例。为此,本研究构建了一套全新数据集,其中包含来自美国国家生物技术信息中心(NCBI)的1557条SARS-CoV-2基因组序列的马尔可夫链(Markov chain)表征结果。该数据集同时收录了来自病毒-宿主数据库(Virus-Host DB)的11540种其他病毒的马尔可夫链表征,以及来自NCBI的3种核糖病毒门(Riboviria)病毒:贝塔冠状病毒RaTG13、蝙蝠源SARS样冠状病毒ZC45与蝙蝠源SARS样冠状病毒ZXC21。
创建时间:
2020-05-13



