five

Supporting data for the paper "CREMSA: Compressed Indexing of (Ultra) Large Multiple Sequence Alignments"

收藏
NIAID Data Ecosystem2026-05-02 收录
下载链接:
https://zenodo.org/record/15100010
下载链接
链接失效反馈
官方服务:
资源简介:
Four files used in the paper “CREMSA: Compressed Indexing of (Ultra) Large Multiple Sequence Alignments” are made available here for reproducibility: random_datasets_len10000_num30000.zip : An archive of artificial FASTA files generated as described in the paper. HIV1_ALL_2022_genome_DNA.fasta.xz : A multiple sequence alignment of 5,381 HIV1 genomes, retrieved from the Los Alamos National Laboratory on March 2025. nextstrain_groups_LANL-HIV-DB_HIV_genome_timetree.jsonl.gz : A JSONL file, as produced by Nextstrain, of the phylogeny of 3,090 HIV genomes among the 5,381 from the previous file.  MFS_1.fasta.xz : A multiple sequence alignment of 214,283 protein sequences of the Major Facilitator Superfamily (MFS), retrieved from Pfam on March 2025.
创建时间:
2025-03-28
5,000+
优质数据集
54 个
任务类型
进入经典数据集
二维码
社区交流群

面向社区/商业的数据集话题

二维码
科研交流群

面向高校/科研机构的开源数据集话题

数据驱动未来

携手共赢发展

商业合作